Availability, Blog and Traffic - Technology Performance Pulse

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

The Netflix TechBlog

MAY 4, 2023

Migrating Critical Traffic At Scale with No Downtime — Part 1 Shyam Gala , Javier Fernandez-Ivern , Anup Rokkam Pratap , Devang Shah Hundreds of millions of customers tune into Netflix every day, expecting an uninterrupted and immersive streaming experience. This approach has a handful of benefits.

Traffic

Traffic Latency Tuning Systems

Migrating Critical Traffic At Scale with No Downtime?—?Part 2

The Netflix TechBlog

JUNE 13, 2023

Migrating Critical Traffic At Scale with No Downtime — Part 2 Shyam Gala , Javier Fernandez-Ivern , Anup Rokkam Pratap , Devang Shah Picture yourself enthralled by the latest episode of your beloved Netflix series, delighting in an uninterrupted, high-definition streaming experience. This is where large-scale system migrations come into play.

Traffic

Traffic Metrics Systems Strategy

AI-powered DNS request tracking extends infrastructure observability for high quality network traffic

Dynatrace

OCTOBER 1, 2020

To extend Dynatrace diagnostic visibility into network traffic, we’ve added out-of-the-box DNS request tracking to our infrastructure monitoring capabilities. While our competitors only provide generic traffic monitoring without artificial intelligence, Dynatrace automatically analyzes DNS-related anomalies.

Traffic

Traffic Network Infrastructure Artificial Intelligence

Dynatrace Cost & Carbon Optimization certified for accuracy and transparency

Dynatrace

MARCH 5, 2025

Youll be able to read more about our approach to cloud cost optimization in an upcoming blog post. The certification results are now publicly available. The calculations and methodology used are in line with the best available scientific approach, as well as with relevant reporting requirements. Public network traffic uses 1.0

Energy

Energy Analytics Traffic Cloud

Better dashboarding with Dynatrace Davis AI: Instant meaningful insights

Dynatrace

JANUARY 21, 2025

Activate Davis AI to analyze charts within seconds Davis AI can help you expand your dashboards and dive deeper into your available data to extract additional information. Have a look at our recent Davis CoPilot blog post for more information and practical use cases.

Traffic

Traffic Metrics Analytics Monitoring

Dynatrace Managed turnkey Premium High Availability for globally distributed data centers (Early Adopter)

Dynatrace

JUNE 25, 2020

Dynatrace Managed is intrinsically highly available as it stores three copies of all events, user sessions, and metrics across its cluster nodes. Our Premium High Availability comes with the following features: Active-active deployment model for optimum hardware utilization. Minimized cross-data center network traffic.

Availability

Availability Hardware Latency Traffic

Managing High Availability in PostgreSQL – Part III: Patroni

Scalegrid

AUGUST 22, 2019

In our previous blog posts, we discussed the capabilities and functioning of PostgreSQL Automatic Failover (PAF) by Cluster Labs and Replication Manager (repmgr) by 2ndQuadrant. Managing High Availability in PostgreSQL – Part I: PostgreSQL Automatic Failover. Managing High Availability in PostgreSQL – Part II: Replication Manager.

Availability

Availability Servers Network Testing

Title Launch Observability at Netflix Scale

The Netflix TechBlog

MARCH 4, 2025

Part 3: System Strategies and Architecture By: VarunKhaitan With special thanks to my stunning colleagues: Mallika Rao , Esmir Mesic , HugoMarques This blog post is a continuation of Part 2 , where we cleared the ambiguity around title launch observability at Netflix. We call this capability TimeTravel.

Traffic

Traffic Strategy Entertainment Innovation

Managing PostgreSQL® High Availability – Part I: PostgreSQL Automatic Failover

Scalegrid

SEPTEMBER 5, 2024

Managing High Availability (HA) in your PostgreSQL hosting is very important to ensuring your database deployment clusters maintain exceptional uptime and strong operational performance so your data is always available to your application. Effective management of failover and switchover operations is crucial for high availability.

Availability

Availability Servers Database Open Source

OneAgent for Linux on IBM Z (General Availability)

Dynatrace

NOVEMBER 20, 2019

Having released this functionality in an Early Adopter Release with OneAgent version 1.173 and Dynatrace version 1.174 back in August 2019, we’re now happy to announce the General Availability of OneAgent full-stack monitoring for Linux on the IBM Z platform, sometimes informally referred to as Z/Linux. Release details.

Availability

Availability Hardware Java Tuning

Ensuring the Successful Launch of Ads on Netflix

The Netflix TechBlog

JUNE 1, 2023

To do this, we devised a novel way to simulate the projected traffic weeks ahead of launch by building upon the traffic migration framework described here. New content or national events may drive brief spikes, but, by and large, traffic is usually smoothly increasing or decreasing.

Traffic

Traffic Best Practices Systems Testing

Migrating Netflix to GraphQL Safely

The Netflix TechBlog

JUNE 14, 2023

This blog post will share broadly-applicable techniques (beyond GraphQL) we used to perform this migration. The control group’s traffic utilized the legacy Falcor stack, while the experiment population leveraged the new GraphQL client and was directed to the GraphQL Shim. The Replay Tester tool samples raw traffic streams from Mantis.

Traffic

Traffic Latency Metrics Cache

MySQL High Availability Framework Explained – Part III: Failure Scenarios

Scalegrid

APRIL 16, 2019

In this three-part blog series, we introduced a High Availability (HA) Framework for MySQL hosting in Part I, and discussed the details of MySQL semisynchronous replication in Part II. Now in Part III, we review how the framework handles some of the important MySQL failure scenarios and recovers to ensure high availability.

Availability

Availability Network Azure AWS

Introducing Impressions at Netflix

The Netflix TechBlog

FEBRUARY 14, 2025

In this multi-part blog series, we take you behind the scenes of our system that processes billions of impressions daily. This dual availability ensures immediate processing capabilities alongside comprehensive long-term data retention. Thus, all data in one region is processed by the Flink job deployed within thatregion.

Tuning

Tuning Latency Efficiency Storage

A Dynatrace champions guide to get ahead of digital marketing campaigns

Dynatrace

JULY 1, 2020

In my last blog , I’ve provided an example of this happening, whereby the traffic spiked and quadrupled the usual incoming traffic. In my last blog , I’ve provided an example of this happening, whereby the traffic spiked and quadrupled the usual incoming traffic.

Traffic

Traffic Analytics Metrics Servers

Automate CI/CD pipelines with Dynatrace: Part 2, Deploy stage

Dynatrace

NOVEMBER 28, 2023

In the previous installment of this blog series , we explored how to set up Dynatrace as a build-stage orchestrator to effectively address the challenges faced by Site Reliability Engineers (SREs). This can lead to a lack of insight into how the code will behave when exposed to heavy traffic. What’s next?

Traffic

Traffic Best Practices Strategy Engineering

Large scale deployments are easy and cost-effective with network zones (Early Adopter)

Dynatrace

JULY 2, 2020

Unnecessary traffic between such data centers can result in wasted resources, unpredictable downtimes, and lost business. By minimizing bandwidth and preventing unrelated traffic between data centers, you can maintain healthy network infrastructure and save on costs. optimizing traffic routing. optimizing traffic routing.

Network

Network Traffic Infrastructure Tuning

MySQL High Availability Framework Explained – Part III: Failover Scenarios

High Scalability

APRIL 16, 2019

In this three-part blog series, we introduced a High Availability (HA) Framework for MySQL hosting in Part I, and discussed the details of MySQL semisynchronous replication in Part II. Now in Part III, we review how the framework handles some of the important MySQL failure scenarios and recovers to ensure high availability.

Availability

Availability Network Azure AWS

Get the insights you need for your F5 BIG-IP LTM

Dynatrace

JUNE 5, 2023

The F5 BIG-IP Local Traffic Manager (LTM) is an application delivery controller (ADC) that ensures the availability, security, and optimal performance of network traffic flows. Business-critical applications typically rely on F5 for availability and success. It serves as a crucial component between applications and users.

Traffic

Traffic Virtualization Metrics Monitoring

Architected for resiliency: How Dynatrace withstands data center outages

Dynatrace

JUNE 15, 2021

The path to “Architected for Resiliency” is long, but it clearly pays off in the long run, especially when outages occur, as I want to show you in this blog post. The subject line said: “Success Story: Major Issue in single AWS Frankfurt Availability Zone!” Fact #4: Multi-node, multi-availability zone deployment architecture.

AWS

AWS Traffic Architecture Azure

OneAgent for Linux on IBM Z now available in Early Adopter Release

Dynatrace

AUGUST 8, 2019

We’re happy to announce the Early Adopter Release of OneAgent full-stack monitoring for Linux on the IBM Z platform, sometimes informally referred to as Z/Linux (available with OneAgent version 1.173 and Dynatrace version 1.174). For details on available metrics, see our help page on host performance monitoring. Dynatrace news.

Availability

Availability Hardware Java Tuning

The new normal of digital experience delivery – lessons learned from monitoring mission-critical websites during COVID-19

Dynatrace

MAY 6, 2020

Over the last two month s, w e’ve monito red key sites and applications across industries that have been receiving surges in traffic , including government, health insurance, retail, banking, and media. Readers who share our privacy concerns, please note, all the data we monitor is publicly available. . Monitoring with ?the

Website

Website Monitoring Retail Media

Rapid Event Notification System at Netflix

The Netflix TechBlog

FEBRUARY 18, 2022

In this blog post, we will give an overview of the Rapid Event Notification System at Netflix and share some of the learnings we gained along the way. We thus assigned a priority to each use case and sharded event traffic by routing to priority-specific queues and the corresponding event processing clusters.

Systems

Systems Traffic Architecture Mobile

Easy SLA and SLO reporting for all your API endpoints with public synthetic HTTP monitors

Dynatrace

JUNE 26, 2020

With today’s high expectations for the speed and availability of applications, you need a deep understanding of real user experiences to make the best business decisions. Dynatrace Synthetic Monitoring ensures that your application is available and performs well from anywhere in the world to meet your SLAs. Dynatrace news.

Monitoring

Monitoring Azure AWS Traffic

COVID-19 and Digital Services: An Action Plan for the Unexpected

Dynatrace

APRIL 22, 2020

While most government agencies and commercial enterprises have digital services in place, the current volume of usage — including traffic to critical employment, health and retail/eCommerce services — has reached levels that many organizations have never seen before or tested against. So how do you know what to prepare for?

Traffic

Traffic Ecommerce Retail Government

Helping your digital services run optimally for your customers and employees during COVID-19

Dynatrace

APRIL 16, 2020

Aside from the huge surge in internal application usage, businesses are also witnessing increased levels of user traffic to their applications. Facilitating an understanding of traffic patterns and potential traffic spikes helps maintain customer experience. One example of these surges was from an unemployment application.

Traffic

Traffic Government Database Network

Kubernetes vs Docker: What’s the difference?

Dynatrace

SEPTEMBER 29, 2021

This opens the door to auto-scalable applications, which effortlessly matches the demands of rapidly growing and varying user traffic. Containers can be replicated or deleted on the fly to meet varying end-user traffic. appeared first on Dynatrace blog. In production, containers are easy to replicate. What is Docker?

Open Source

Open Source Traffic DevOps Cloud

What is a service mesh?

Dynatrace

MAY 21, 2021

This becomes even more challenging when the application receives heavy traffic, because a single microservice might become overwhelmed if it receives too many requests too quickly. The Envoy proxies also collect and report telemetry on all traffic among the services in the mesh. appeared first on Dynatrace blog.

Traffic

Traffic DevOps Infrastructure Network

Geek Reading - Week of June 5, 2013

DZone

OCTOBER 11, 2022

Making Google’s CalDAV and CardDAV APIs available for everyone ( Google Developers Blog). Improving testing by using real traffic from production ( Hacker News). Improving testing by using real traffic from production ( Hacker News). SAP to acquire Hybris to jumpstart its presence in e-commerce ( VentureBeat).

Java

Java Best Practices Google Analytics

General availability of OneAgent full-stack monitoring for AIX

Dynatrace

APRIL 16, 2019

We’re proud to announce the general availability of OneAgent full-stack monitoring for the AIX operating system. Monitoring IBM Power Systems isn’t a simple task, due to its specific architecture, there aren’t many tools available on the market. The ones that are available are old generation. Dynatrace news.

Availability

Availability Monitoring Metrics Operating System

Innovate. Collaborate. Deliver. Our digital hub is live

Dynatrace

APRIL 9, 2020

As the world socially distances, we are seeing significant increases in website traffic as people turn to their phones and devices, to connect with loved ones, buy online, distance learn, work remotely, and continuously keep up with the news. . Our digital hub is live appeared first on Dynatrace blog. it’s not increasing!).

Innovation

Innovation Traffic Website Monitoring

The Ultimate Guide to Database High Availability

Percona

JUNE 22, 2023

To make data count and to ensure cloud computing is unabated, companies and organizations must have highly available databases. This guide provides an overview of what high availability means, the components involved, how to measure high availability, and how to achieve it. How does high availability work?

Availability

Availability Database Open Source Hardware

9 key DevOps metrics for success

Dynatrace

SEPTEMBER 28, 2021

In a world where 99.999% availability is the standard, measuring MTTR is a crucial practice to ensure resiliency and stability. This metric helps determine the effectiveness of your monitoring and detection capabilities in support of system reliability and availability. App availability. Application usage and traffic.

DevOps

DevOps Metrics Traffic Efficiency

What is cloud migration?

Dynatrace

SEPTEMBER 30, 2021

Cloud migration is the process of transferring some or all your data, software, and operations to a cloud-based computing environment that offers unlimited scale and high availability. In case of a spike in traffic, you can automatically spin up more resources, often in a matter of seconds. Improved performance and availability.

Cloud

Cloud Traffic Best Practices Strategy

Unlock end-to-end observability insights with Dynatrace PurePath 4 seamless integration of OpenTracing for Java

Dynatrace

DECEMBER 9, 2020

Let’s consider the business challenges of an online shop that is powered by a microservice architecture where several instances of each microservice run, including the shopping cart service, to ensure the highest possible availability. With Dynatrace OneAgent you also benefit from support for traffic routing and traffic control.

Java

Java Traffic Architecture Strategy

Efficient SLO event integration powers successful AIOps

Dynatrace

APRIL 5, 2024

This blog post is for both novice and seasoned audiences alike. The first part of this blog post briefly explores the integration of SLO events with AI. When the SLO status converges to an optimal value of 100%, and there’s substantial traffic (calls/min), BurnRate becomes more relevant for anomaly detection.

Efficiency

Efficiency Traffic Tuning Metrics

Ready-to-Use High Availability Architectures for MySQL and PostgreSQL

Percona

JUNE 12, 2023

When it comes to access to their applications, users demand instant, reliable, and secure interactions — and that means databases must be highly available. With database high availability (HA), services are largely uninterrupted, and end users are largely satisfied. The obvious answer is this: To achieve high availability.

Architecture

Architecture Availability Open Source Healthcare

RabbitMQ vs. Kafka: Key Differences

Scalegrid

FEBRUARY 6, 2025

Its design prioritizes high availability and efficient data transfer with minimal overhead, making it a practical choice for handling real-time data pipelines and distributed event processing. It follows a push-based approach, ensuring messages are distributed to consumers as soon as they become available.

Latency

Latency Analytics Architecture Storage

Optimize your marketing campaign investment by leveraging BizDevOps

Dynatrace

JUNE 10, 2020

In this blog, I’m going to share what happened recently at a Dynatrace customer and we explore what they could have done differently by leveraging Dynatrace and BizDevOps practices. From the below screenshot you can see that the traffic picked up not only slightly but quadrupled! Did your big marketing campaign investment pay off?

Traffic

Traffic Analytics DevOps Infrastructure

Simplify troubleshooting with AI-powered insights into connection pool performance (Early Adopter)

Dynatrace

DECEMBER 9, 2020

In addition to being available as metrics in custom charts , you can view these metrics at the process group instance level in the Dynatrace web UI. Aggregated connection pool metrics are available on the process group overview page. A Davis detected problem identifies an increase in traffic as a possible root cause.

Traffic

Traffic Performance Database Metrics

Dynatrace Application Security detects and blocks attacks automatically in real-time

Dynatrace

FEBRUARY 10, 2022

WAFs protect the network perimeter and monitor, filter, or block HTTP traffic. Compared to intrusion detection systems (IDS/IPS), WAFs are focused on the application traffic. RASP solutions sit in or near applications and analyze application behavior and traffic. How to get started.

Traffic

Traffic Benchmarking Innovation Java

Dynatrace adds support for VPC Flow Logs to Kinesis Data Firehose

Dynatrace

SEPTEMBER 7, 2022

VPC Flow Logs is an Amazon service that enables IT pros to capture information about the IP traffic that traverses network interfaces in a virtual private cloud, or VPC. By default, each record captures a network internet protocol (IP), a destination, and the source of the traffic flow that occurs within your environment.

Traffic

Traffic AWS Network Cloud

Get quick alerts and avoid false positives with the new baseline setting

Dynatrace

MARCH 26, 2020

This means that Dynatrace alerts more quickly when an error spike occurs in a high-traffic service (compared to a low-traffic service where statistical confidence is lower). The configuration is available at the global level as well as the service level. To avoid false-positive alerts on your services, you can add more time.

Traffic

Traffic Monitoring Efficiency Strategy

Automated Change Impact Analysis with Site Reliability Guardian

Dynatrace

FEBRUARY 15, 2023

SREs use Service-Level Indicators (SLI) to see the complete picture of service availability, latency, performance, and capacity across various systems, especially revenue-critical systems. This is all available out-of-the-box with the default workflow template provided by Site Reliability Guardian.

DevOps

DevOps Latency Traffic Best Practices

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

Migrating Critical Traffic At Scale with No Downtime?—?Part 2

Trending Sources

AI-powered DNS request tracking extends infrastructure observability for high quality network traffic

Dynatrace Cost & Carbon Optimization certified for accuracy and transparency

Better dashboarding with Dynatrace Davis AI: Instant meaningful insights

Dynatrace Managed turnkey Premium High Availability for globally distributed data centers (Early Adopter)

Managing High Availability in PostgreSQL – Part III: Patroni

Title Launch Observability at Netflix Scale

Managing PostgreSQL® High Availability – Part I: PostgreSQL Automatic Failover

OneAgent for Linux on IBM Z (General Availability)

Ensuring the Successful Launch of Ads on Netflix

Migrating Netflix to GraphQL Safely

MySQL High Availability Framework Explained – Part III: Failure Scenarios

Introducing Impressions at Netflix

A Dynatrace champions guide to get ahead of digital marketing campaigns

Automate CI/CD pipelines with Dynatrace: Part 2, Deploy stage

Large scale deployments are easy and cost-effective with network zones (Early Adopter)

MySQL High Availability Framework Explained – Part III: Failover Scenarios

Get the insights you need for your F5 BIG-IP LTM

Architected for resiliency: How Dynatrace withstands data center outages

OneAgent for Linux on IBM Z now available in Early Adopter Release

The new normal of digital experience delivery – lessons learned from monitoring mission-critical websites during COVID-19

Rapid Event Notification System at Netflix

Easy SLA and SLO reporting for all your API endpoints with public synthetic HTTP monitors

COVID-19 and Digital Services: An Action Plan for the Unexpected

Helping your digital services run optimally for your customers and employees during COVID-19

Kubernetes vs Docker: What’s the difference?

What is a service mesh?

Geek Reading - Week of June 5, 2013

General availability of OneAgent full-stack monitoring for AIX

Innovate. Collaborate. Deliver. Our digital hub is live

The Ultimate Guide to Database High Availability

9 key DevOps metrics for success

What is cloud migration?

Unlock end-to-end observability insights with Dynatrace PurePath 4 seamless integration of OpenTracing for Java

Efficient SLO event integration powers successful AIOps

Ready-to-Use High Availability Architectures for MySQL and PostgreSQL

RabbitMQ vs. Kafka: Key Differences

Optimize your marketing campaign investment by leveraging BizDevOps

Simplify troubleshooting with AI-powered insights into connection pool performance (Early Adopter)

Dynatrace Application Security detects and blocks attacks automatically in real-time

Dynatrace adds support for VPC Flow Logs to Kinesis Data Firehose

Get quick alerts and avoid false positives with the new baseline setting

Automated Change Impact Analysis with Site Reliability Guardian

Stay Connected