Processing and Traffic - Technology Performance Pulse

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

The Netflix TechBlog

MAY 4, 2023

Migrating Critical Traffic At Scale with No Downtime — Part 1 Shyam Gala , Javier Fernandez-Ivern , Anup Rokkam Pratap , Devang Shah Hundreds of millions of customers tune into Netflix every day, expecting an uninterrupted and immersive streaming experience. This approach has a handful of benefits.

Traffic

Traffic Latency Tuning Systems

Close Site Search Indexing via Kubernetes HAProxy Ingress

DZone

OCTOBER 22, 2024

In Kubernetes , Ingress resources are frequently used as traffic controllers, providing external access to services within the cluster. This blog post will walk you through the process of blocking your site's indexing on Kubernetes Ingress using robots.txt file, preventing search engine bots from crawling and indexing your content.

Traffic

Traffic Engineering Processing Development

Migrating Critical Traffic At Scale with No Downtime?—?Part 2

The Netflix TechBlog

JUNE 13, 2023

Migrating Critical Traffic At Scale with No Downtime — Part 2 Shyam Gala , Javier Fernandez-Ivern , Anup Rokkam Pratap , Devang Shah Picture yourself enthralled by the latest episode of your beloved Netflix series, delighting in an uninterrupted, high-definition streaming experience.

Traffic

Traffic Metrics Systems Strategy

Black Friday traffic exposes gaps in observability strategies

Dynatrace

SEPTEMBER 2, 2022

What’s the problem with Black Friday traffic? But that’s difficult when Black Friday traffic brings overwhelming and unpredictable peak loads to retailer websites and exposes the weakest points in a company’s infrastructure, threatening application performance and user experience. Why Black Friday traffic threatens customer experience.

Traffic

Traffic Strategy Retail Ecommerce

Better dashboarding with Dynatrace Davis AI: Instant meaningful insights

Dynatrace

JANUARY 21, 2025

Ensuring smooth operations is no small feat, whether you’re in charge of application performance, IT infrastructure, or business processes. For example, if you’re monitoring network traffic and the average over the past 7 days is 500 Mbps, the threshold will adapt to this baseline.

Traffic

Traffic Metrics Analytics Monitoring

Best Practices for Designing Resilient APIs for Scalability and Reliability

DZone

JANUARY 8, 2025

API resilience is about creating systems that can recover gracefully from disruptions, such as network outages or sudden traffic spikes, ensuring they remain reliable and secure. However, it often introduces new challenges in the process. This has become critical since APIs serve as the backbone of todays interconnected systems.

Best Practices

Best Practices Design Scalability Architecture

Process more with less using smarter cluster overload prevention for Dynatrace Managed

Dynatrace

MAY 14, 2020

Turnkey cluster overload protection with adaptive traffic management and control. By vastly increasing the number of PurePaths that are processed by a Dynatrace Managed cluster, your initial sizing considerations for Dynatrace Managed nodes and clusters may however end up being inadequate for supporting such volume.

Processing

Processing Hardware Traffic Storage

Dynatrace Cost & Carbon Optimization certified for accuracy and transparency

Dynatrace

MARCH 5, 2025

Integration with existing systems and processes : Integration with existing IT infrastructure, observability solutions, and workflows often requires significant investment and customization. Actions resulting from the evaluation The certification process surfaced a few recommendations for improving the app.

Energy

Energy Analytics Traffic Cloud

Title Launch Observability at Netflix Scale

The Netflix TechBlog

DECEMBER 17, 2024

As Netflix expanded globally and the volume of title launches skyrocketed, the operational challenges of maintaining this manual process became undeniable. Metadata and assets must be correctly configured, data must flow seamlessly, microservices must process titles without error, and algorithms must function as intended.

Traffic

Traffic Scalability Strategy Monitoring

Rebuilding Netflix Video Processing Pipeline with Microservices

The Netflix TechBlog

JANUARY 10, 2024

Future blogs will provide deeper dives into each service, sharing insights and lessons learned from this process. The Netflix video processing pipeline went live with the launch of our streaming service in 2007. The Netflix video processing pipeline went live with the launch of our streaming service in 2007.

Processing

Processing Media Latency Innovation

How Netflix Accurately Attributes eBPF Flow Logs

The Netflix TechBlog

APRIL 8, 2025

FlowCollector , a backend service, collects flow logs from FlowExporter instances across the fleet, attributes the IP addresses, and sends these attributed flows to Netflixs Data Mesh for subsequent stream and batch processing. 2xlarge instances, we can process 5 million flows per second across the entire Netflixfleet. With 30 c7i.2xlarge

AWS

AWS Traffic Network Programming

Breaking AWS Lambda: Chaos Engineering for Serverless Devs

DZone

MARCH 24, 2025

Our "serverless" order processing system built on AWS Lambda and API Gateway was humming along, handling 1,000 transactions/minute. A sudden spike in traffic caused Lambda timeouts, API Gateway threw 5xx errors, and customers started tweeting, Why cant I check out?! Then, disaster struck.

Lambda

Lambda Serverless AWS Engineering

Unlock the observability value of log data with processing at scale

Dynatrace

AUGUST 16, 2022

FortiGate traffic logs store data elements in key-value pairs while NGINX custom access logs store events in arrays. Advanced processing on your observability platform unlocks the full value of log data. Advanced processing on your observability platform unlocks the full value of log data.

Processing

Processing Metrics Monitoring Java

Best Practices for Scaling RabbitMQ

Scalegrid

FEBRUARY 24, 2025

Scaling RabbitMQ ensures your system can handle growing traffic and maintain high performance. Optimizing RabbitMQ performance through strategies such as keeping queues short, enabling lazy queues, and monitoring health checks is essential for maintaining system efficiency and effectively managing high traffic loads.

Best Practices

Best Practices Traffic Strategy Efficiency

Title Launch Observability at Netflix Scale

The Netflix TechBlog

MARCH 4, 2025

Accurately Reflecting Production Behavior A key part of our solution is insights into production behavior, which necessitates our requests to the endpoint result in traffic to the real service functions that mimics the same pathways the traffic would take if it came from the usualcallers. We call this capability TimeTravel.

Traffic

Traffic Strategy Entertainment Innovation

Ensuring the Successful Launch of Ads on Netflix

The Netflix TechBlog

JUNE 1, 2023

To do this, we devised a novel way to simulate the projected traffic weeks ahead of launch by building upon the traffic migration framework described here. New content or national events may drive brief spikes, but, by and large, traffic is usually smoothly increasing or decreasing.

Traffic

Traffic Best Practices Systems Testing

Rapid Event Notification System at Netflix

The Netflix TechBlog

FEBRUARY 18, 2022

Event Prioritization Considering the use cases were wide ranging both in terms of their sources and their importance, we built segmentation into the event processing. We thus assigned a priority to each use case and sharded event traffic by routing to priority-specific queues and the corresponding event processing clusters.

Systems

Systems Traffic Architecture Mobile

Introducing Impressions at Netflix

The Netflix TechBlog

FEBRUARY 14, 2025

It requires a state-of-the-art system that can track and process these impressions while maintaining a detailed history of each profiles exposure. In this multi-part blog series, we take you behind the scenes of our system that processes billions of impressions daily.

Tuning

Tuning Latency Efficiency Storage

COVID-19 and Digital Services: An Action Plan for the Unexpected

Dynatrace

APRIL 22, 2020

While most government agencies and commercial enterprises have digital services in place, the current volume of usage — including traffic to critical employment, health and retail/eCommerce services — has reached levels that many organizations have never seen before or tested against. So how do you know what to prepare for?

Traffic

Traffic Ecommerce Retail Government

RabbitMQ vs. Kafka: Key Differences

Scalegrid

FEBRUARY 6, 2025

RabbitMQ is designed for flexible routing and message reliability, while Kafka handles high-throughput event streaming and real-time data processing. RabbitMQ follows a message broker model with advanced routing, while Kafkas event streaming architecture uses partitioned logs for distributed processing. What is Apache Kafka?

Latency

Latency Analytics Architecture Storage

Automate CI/CD pipelines with Dynatrace: Part 2, Deploy stage

Dynatrace

NOVEMBER 28, 2023

Even when the staging environment closely mirrors the production environment, achieving a complete replication of all potential scenarios, such as simulating extremely high traffic volumes to assess software performance, remains challenging. This can lead to a lack of insight into how the code will behave when exposed to heavy traffic.

Traffic

Traffic Best Practices Strategy Engineering

Six causes of major software outages–And how to avoid them

Dynatrace

AUGUST 8, 2024

It’s also critical to have a strategy in place to address these outages, including both documented remediation processes and an observability platform to help you proactively identify and resolve issues to minimize customer and business impact. These attacks can be orchestrated by hackers, cybercriminals, or even state actors.

Software

Software Software Infrastructure Network

What is a service mesh?

Dynatrace

MAY 21, 2021

This becomes even more challenging when the application receives heavy traffic, because a single microservice might become overwhelmed if it receives too many requests too quickly. Management processes make up the control plane, which coordinates the proxies’ behavior. Why do you need a service mesh?

Traffic

Traffic DevOps Infrastructure Network

A Comprehensive Guide to Database Sharding: Building Scalable Systems

DZone

OCTOBER 2, 2024

In this article, we’ll dive deep into the concept of database sharding, a critical technique for scaling databases to handle large volumes of data and high levels of traffic. This section will provide insights into the architecture and strategies to ensure efficient query processing in a sharded environment.

Database

Database Systems Scalability Traffic

9 key DevOps metrics for success

Dynatrace

SEPTEMBER 28, 2021

Your next challenge is ensuring your DevOps processes, pipelines, and tooling meet the intended goal. For example, by measuring deployment frequency daily or weekly, you can determine how efficiently your team is responding to process changes. Lead time for changes helps teams understand how effective their processes are.

DevOps

DevOps Metrics Traffic Efficiency

Unlock end-to-end observability insights with Dynatrace PurePath 4 seamless integration of OpenTracing for Java

Dynatrace

DECEMBER 9, 2020

With Dynatrace OneAgent you also benefit from support for traffic routing and traffic control. OneAgent implements network zones to create traffic routing rules and limit cross data-center traffic. Upgrade OpenTracing instrumentation with high-fidelity data provided by OneAgent.

Java

Java Traffic Architecture Strategy

What is cloud migration?

Dynatrace

SEPTEMBER 30, 2021

Cloud migration is the process of transferring some or all your data, software, and operations to a cloud-based computing environment that offers unlimited scale and high availability. A key step in digital transformation is migrating from traditional on-prem IT processes to adopting cloud services. What is cloud migration?

Cloud

Cloud Traffic Best Practices Strategy

Keeping Netflix Reliable Using Prioritized Load Shedding

The Netflix TechBlog

NOVEMBER 2, 2020

How viewers are able to watch their favorite show on Netflix while the infrastructure self-recovers from a system failure By Manuel Correa , Arthur Gonigberg , and Daniel West Getting stuck in traffic is one of the most frustrating experiences for drivers around the world. Logs and background requests are examples of this type of traffic.

Traffic

Traffic Metrics Infrastructure Architecture

Get the insights you need for your F5 BIG-IP LTM

Dynatrace

JUNE 5, 2023

The F5 BIG-IP Local Traffic Manager (LTM) is an application delivery controller (ADC) that ensures the availability, security, and optimal performance of network traffic flows. Detect and respond to security threats like DDoS attacks or web application attacks by monitoring application traffic and logs.

Traffic

Traffic Virtualization Metrics Monitoring

What are quality gates? How to use quality gates to deliver better software at speed and scale

Dynatrace

FEBRUARY 21, 2024

Open vulnerability on process group: The total number of currently high-profile vulnerabilities related to a process group. Vulnerability score: The highest vulnerability risk score for a process group. This way, the travel agency can easily streamline, organize, and consolidate their quality gates and metric evaluation process.

Speed

Speed Software Software Latency

Istio Explained: Unlocking the Power of Service Mesh in Microservices

DZone

MARCH 11, 2024

This article delves deep into the essence of Istio, illustrating its pivotal role in a Kubernetes (KIND) based environment, and guides you through a Helm-based installation process, ensuring a comprehensive understanding of Istio's capabilities and its impact on microservices architecture.

Open Source

Open Source Traffic Architecture Monitoring

Service level objectives: 5 SLOs to get started

Dynatrace

Dynatrace

AUGUST 25, 2023

These signals ( latency, traffic, errors, and saturation ) provide a solid means of proactively monitoring operative systems via SLOs and tracking business success. Generally, response times measure the total duration of receiving, processing, and completing a request. One template explicitly targets service performance monitoring.

Performance

Performance Latency Traffic Metrics

Digital transformation strategies: Success stories from three digital transformation journeys

Dynatrace

MAY 8, 2023

Digitizing internal processes can improve information flow and enhance collaboration among employees. However, digital transformation requires significant investment in technology infrastructure and processes. Previously, they had 12 tools with different traffic thresholds. Enhanced business operations.

Strategy

Strategy Retail DevOps Traffic

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

Close Site Search Indexing via Kubernetes HAProxy Ingress

Trending Sources

Migrating Critical Traffic At Scale with No Downtime?—?Part 2

Black Friday traffic exposes gaps in observability strategies

Better dashboarding with Dynatrace Davis AI: Instant meaningful insights

Best Practices for Designing Resilient APIs for Scalability and Reliability

Process more with less using smarter cluster overload prevention for Dynatrace Managed

Dynatrace Cost & Carbon Optimization certified for accuracy and transparency

Title Launch Observability at Netflix Scale

Rebuilding Netflix Video Processing Pipeline with Microservices

How Netflix Accurately Attributes eBPF Flow Logs

Breaking AWS Lambda: Chaos Engineering for Serverless Devs

Unlock the observability value of log data with processing at scale

Best Practices for Scaling RabbitMQ

Title Launch Observability at Netflix Scale

Ensuring the Successful Launch of Ads on Netflix

Rapid Event Notification System at Netflix

Introducing Impressions at Netflix

COVID-19 and Digital Services: An Action Plan for the Unexpected

RabbitMQ vs. Kafka: Key Differences

Automate CI/CD pipelines with Dynatrace: Part 2, Deploy stage

Six causes of major software outages–And how to avoid them

What is a service mesh?

A Comprehensive Guide to Database Sharding: Building Scalable Systems

9 key DevOps metrics for success

Unlock end-to-end observability insights with Dynatrace PurePath 4 seamless integration of OpenTracing for Java

What is cloud migration?

Keeping Netflix Reliable Using Prioritized Load Shedding

Get the insights you need for your F5 BIG-IP LTM

What are quality gates? How to use quality gates to deliver better software at speed and scale

Istio Explained: Unlocking the Power of Service Mesh in Microservices

Service level objectives: 5 SLOs to get started

Top PostgreSQL 17 New Features

The Power of Caching: Boosting API Performance and Scalability

Dynatrace adds support for VPC Flow Logs to Kinesis Data Firehose

What is vulnerability management? And why runtime vulnerability detection makes the difference

How Dynatrace boosts production resilience with Site Reliability Guardian

Measuring Network Performance in Mobile Safari

Apollo Router Performance Monitoring with OpenTelemetry and Splunk APM

Leverage automated and intelligent observability for OpenTelemetry for Go with Dynatrace PurePath 4

What is application security monitoring?

Simplify troubleshooting with AI-powered insights into connection pool performance (Early Adopter)

Maximize user experience with out-of-the-box service-performance SLOs

Digital transformation strategies: Success stories from three digital transformation journeys

Stay Connected