Architecture and Traffic - Technology Performance Pulse

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

The Netflix TechBlog

MAY 4, 2023

Migrating Critical Traffic At Scale with No Downtime — Part 1 Shyam Gala , Javier Fernandez-Ivern , Anup Rokkam Pratap , Devang Shah Hundreds of millions of customers tune into Netflix every day, expecting an uninterrupted and immersive streaming experience. This approach has a handful of benefits.

Traffic

Traffic Latency Tuning Systems

Best Practices for Designing Resilient APIs for Scalability and Reliability

DZone

JANUARY 8, 2025

API resilience is about creating systems that can recover gracefully from disruptions, such as network outages or sudden traffic spikes, ensuring they remain reliable and secure. Here's a closer look at the major milestones in API architecture. This has become critical since APIs serve as the backbone of todays interconnected systems.

Best Practices

Best Practices Design Scalability Architecture

The keys to selecting a platform for end-to-end observability

Dynatrace

DECEMBER 2, 2024

DevOps and security teams managing today’s multicloud architectures and cloud-native applications are facing an avalanche of data. This enables proactive changes such as resource autoscaling, traffic shifting, or preventative rollbacks of bad code deployment ahead of time.

Artificial Intelligence

Artificial Intelligence DevOps Architecture Cloud

Migrating Critical Traffic At Scale with No Downtime?—?Part 2

The Netflix TechBlog

JUNE 13, 2023

Migrating Critical Traffic At Scale with No Downtime — Part 2 Shyam Gala , Javier Fernandez-Ivern , Anup Rokkam Pratap , Devang Shah Picture yourself enthralled by the latest episode of your beloved Netflix series, delighting in an uninterrupted, high-definition streaming experience. This is where large-scale system migrations come into play.

Traffic

Traffic Metrics Systems Strategy

An Overview of TCPCopy for Beginners

DZone

OCTOBER 11, 2024

With the rapid development of Internet technology, server-side architectures have become increasingly complex. Therefore, real online traffic is crucial for server-side testing. TCPCopy [1] is an open-source traffic replay tool that has been widely adopted by large enterprises.

Traffic

Traffic Open Source Internet Internet

Chaos Engineering With Litmus: A CNCF Incubating Project

DZone

FEBRUARY 6, 2025

We have developed a microservices architecture platform that encounters sporadic system failures when faced with heavy traffic events. System resilience stands as the key requirement for e-commerce platforms during scaling operations to keep services operational and deliver performance excellence to users.

Engineering

Engineering Traffic Architecture Network

Title Launch Observability at Netflix Scale

The Netflix TechBlog

MARCH 4, 2025

Part 3: System Strategies and Architecture By: VarunKhaitan With special thanks to my stunning colleagues: Mallika Rao , Esmir Mesic , HugoMarques This blog post is a continuation of Part 2 , where we cleared the ambiguity around title launch observability at Netflix. The response schema for the observability endpoint.

Traffic

Traffic Strategy Entertainment Innovation

Architecture Patterns: The Circuit-Breaker

DZone

NOVEMBER 3, 2023

This ensures that a failing service doesn’t continue receiving traffic until it recovers, preventing further strain and potential cascading failures.

Architecture

Architecture Software Engineering Traffic Engineering

Best Practices for Scaling RabbitMQ

Scalegrid

FEBRUARY 24, 2025

Scaling RabbitMQ ensures your system can handle growing traffic and maintain high performance. Optimizing RabbitMQ performance through strategies such as keeping queues short, enabling lazy queues, and monitoring health checks is essential for maintaining system efficiency and effectively managing high traffic loads.

Best Practices

Best Practices Traffic Strategy Efficiency

Rapid Event Notification System at Netflix

The Netflix TechBlog

FEBRUARY 18, 2022

Motivation With the rapid growth in Netflix member base and the increasing complexity of our systems, our architecture has evolved into an asynchronous one that enables both online and offline computation. This helps limit the outgoing traffic footprint considerably.

Systems

Systems Traffic Architecture Mobile

RabbitMQ vs. Kafka: Key Differences

Scalegrid

FEBRUARY 6, 2025

This article outlines the key differences in architecture, performance, and use cases to help determine the best fit for your workload. RabbitMQ follows a message broker model with advanced routing, while Kafkas event streaming architecture uses partitioned logs for distributed processing. What is RabbitMQ? What is Apache Kafka?

Latency

Latency Analytics Architecture Storage

Mastering Scalability and Performance: A Deep Dive Into Azure Load Balancing Options

DZone

JANUARY 8, 2024

This article provides an overview of Azure's load balancing options, encompassing Azure Load Balancer, Azure Application Gateway, Azure Front Door Service, and Azure Traffic Manager. Load balancing is a critical component in cloud architectures for various reasons. What Is Load Balancing?

Azure

Azure Scalability Traffic Performance

Load Management With Istio Using FluxNinja Aperture

DZone

APRIL 4, 2023

Service meshes are becoming increasingly popular in cloud-native applications as they provide a way to manage network traffic between microservices. It offers several features, including: Prioritized load shedding: Drops traffic that is deemed less important to ensure that the most critical traffic is served.

Traffic

Traffic Network Architecture Monitoring

Why Replace External Database Caches?

DZone

AUGUST 28, 2024

Putting an external cache in front of the database is commonly used to compensate for subpar latency stemming from various factors, such as inefficient database internals, driver usage, infrastructure choices, traffic spikes, and so on. In fact, they can be one of the more problematic components of a distributed application architecture.

Cache

Cache Database Latency Traffic

Choosing the Appropriate AWS Load Balancer: ALB vs. NLB

DZone

SEPTEMBER 14, 2023

With the advent of cloud computing, managing network traffic and ensuring optimal performance have become critical aspects of system architecture. Amazon Web Services (AWS), a leading cloud service provider, offers a suite of load balancers to manage network traffic effectively for applications running on its platform.

AWS

AWS Traffic Network Architecture

Demystifying Kuma Service Mesh

DZone

AUGUST 18, 2023

Service mesh emerged as a response to the growing popularity of cloud-native environments, microservices architecture, and Kubernetes. It has its roots in the three-tiered model of application architecture. Under a heavy load, the application could break if the traffic routing, load balancing, etc., were not optimized.

Traffic

Traffic Open Source Architecture Metrics

Unlock end-to-end observability insights with Dynatrace PurePath 4 seamless integration of OpenTracing for Java

Dynatrace

DECEMBER 9, 2020

Cloud-native technologies and microservice architectures have shifted technical complexity from the source code of services to the interconnections between services. Heterogeneous cloud-native microservice architectures can lead to visibility gaps in distributed traces. Dynatrace news.

Java

Java Traffic Architecture Strategy

A Comprehensive Guide to Database Sharding: Building Scalable Systems

DZone

OCTOBER 2, 2024

In this article, we’ll dive deep into the concept of database sharding, a critical technique for scaling databases to handle large volumes of data and high levels of traffic. This section will provide insights into the architecture and strategies to ensure efficient query processing in a sharded environment.

Database

Database Systems Scalability Traffic

Introducing Impressions at Netflix

The Netflix TechBlog

FEBRUARY 14, 2025

Architecture Overview The first pivotal step in managing impressions begins with the creation of a Source-of-Truth (SOT) dataset. Impression Source-of-Truth architecture Ensuring High Quality Impressions Maintaining the highest quality of impressions is a top priority.

Tuning

Tuning Latency Efficiency Storage

Architected for resiliency: How Dynatrace withstands data center outages

Dynatrace

JUNE 15, 2021

The fact is, Reliability and Resiliency must be rooted in the architecture of a distributed system. The email walked through how our Dynatrace self-monitoring notified users of the outage but automatically remediated the problem thanks to our platform’s architecture. And that’s true for Dynatrace as well.

AWS

AWS Traffic Architecture Azure

Istio Explained: Unlocking the Power of Service Mesh in Microservices

DZone

MARCH 11, 2024

This article delves deep into the essence of Istio, illustrating its pivotal role in a Kubernetes (KIND) based environment, and guides you through a Helm-based installation process, ensuring a comprehensive understanding of Istio's capabilities and its impact on microservices architecture.

Open Source

Open Source Traffic Architecture Monitoring

Keeping Netflix Reliable Using Prioritized Load Shedding

The Netflix TechBlog

NOVEMBER 2, 2020

How viewers are able to watch their favorite show on Netflix while the infrastructure self-recovers from a system failure By Manuel Correa , Arthur Gonigberg , and Daniel West Getting stuck in traffic is one of the most frustrating experiences for drivers around the world. Logs and background requests are examples of this type of traffic.

Traffic

Traffic Metrics Infrastructure Architecture

Service Mesh and Management Practices in Microservices

DZone

OCTOBER 27, 2023

In the dynamic world of microservices architecture, efficient service communication is the linchpin that keeps the system running smoothly. It comprises a suite of capabilities, such as managing traffic, enabling service discovery, enhancing security, ensuring observability, and fortifying resilience.

Traffic

Traffic Best Practices Architecture Network

Network performance monitoring top of mind for CloudOps teams

Dynatrace

MAY 19, 2023

Network traffic growth is the main reason for increasing spending, largely because of the adoption of hybrid and multi-cloud architectures. What are the issues with traffic losses and connectivity drops? Without the network, nothing will happen,” Ziemianowicz said.

Network

Network Monitoring Performance Traffic

What is a service mesh?

Dynatrace

MAY 21, 2021

This becomes even more challenging when the application receives heavy traffic, because a single microservice might become overwhelmed if it receives too many requests too quickly. A service mesh is a dedicated infrastructure layer built into an application that controls service-to-service communication in a microservices architecture.

Traffic

Traffic DevOps Infrastructure Network

Automated Deployment and Architectural Validation with Pitometer and keptn!

Dynatrace

APRIL 30, 2019

At its heart it uses Istio (for traffic control) and Knative (for event driven tool orchestration) and stores all configuration in Git – following the GitOps approach. Beyond basic metrics: Detecting Architectural Regressions. Use this to detect any architectural regressions introduced through code or config changes.

Architecture

Architecture Open Source Azure Metrics

7 Best Performance Testing Tools to Look Out for in 2021

DZone

DECEMBER 28, 2020

The system could work efficiently with a specific number of concurrent users; however, it may get dysfunctional with extra loads during peak traffic. It is almost a part of the wider performance engineering portrait, concentrating on performance glitches in the architecture and design of any software.

Performance Testing

Performance Testing Testing Tools Testing Performance

Using Dynatrace to master the 5 pillars of the AWS Well-Architected Framework (Part 1)

Dynatrace

NOVEMBER 24, 2020

As organizations plan, migrate, transform, and operate their workloads on AWS, it’s vital that they follow a consistent approach to evaluating both the on-premises architecture and the upcoming design for cloud-based architecture. Fully conceptualizing capacity requirements. How to get started.

AWS

AWS Artificial Intelligence Best Practices Lambda

Geek Reading - Week of June 5, 2013

DZone

OCTOBER 11, 2022

Improving testing by using real traffic from production ( Hacker News). Simpler UI Testing with CasperJS ( Architects Zone – Architectural Design Patterns & Best Practices). Using MongoDB as a cache store ( Architects Zone – Architectural Design Patterns & Best Practices). History of Lisp ( Hacker News). Hacker News).

Java

Java Best Practices Google Analytics

Ready-to-Use High Availability Architectures for MySQL and PostgreSQL

Percona

JUNE 12, 2023

So why not use a proven architecture instead of starting from scratch on your own? This blog provides links to such architectures — for MySQL and PostgreSQL software. You can use these Percona architectures to build highly available PostgreSQL or MySQL environments or have our experts do the heavy lifting for you.

Architecture

Architecture Availability Open Source Healthcare

Efficient SLO event integration powers successful AIOps

Dynatrace

APRIL 5, 2024

When the SLO status converges to an optimal value of 100%, and there’s substantial traffic (calls/min), BurnRate becomes more relevant for anomaly detection. SLOs must be evaluated at 100%, even when there is currently no traffic. What characterizes a weak SLO? Use the default transformation.

Efficiency

Efficiency Traffic Tuning Metrics

Dynatrace and Google Cloud: Intelligent Kubernetes observability and automation

Dynatrace

DECEMBER 13, 2023

By deploying applications as many separate microservices managed by Kubernetes, these environments can become complicated, especially if the organization has a multi-cloud, hybrid-cloud architecture, or is using elements of a legacy-cloud environment.

Google

Google Cloud Infrastructure Metrics

Observe syslog with Dynatrace ActiveGate, a secure, trusted edge component

Dynatrace

JULY 15, 2024

Finally, adding additional components on the edge to filter and transform syslog messages (for example, Dynatrace OpenTelemetry distribution ) isn’t always possible due to architectural reasons or because it adds unnecessary complexity and cost of ownership when scaling your business.

Infrastructure

Infrastructure Network Azure Monitoring

Five-nines availability: Always-on infrastructure delivers system availability during the holidays’ peak loads

Dynatrace

NOVEMBER 22, 2022

For retail organizations, peak traffic can be a mixed blessing. While high-volume traffic often boosts sales, it can also compromise uptimes. They also need a way to track all the services running on their distributed architectures, from multicloud environments to the edge. What is always-on infrastructure?

Infrastructure

Infrastructure Availability Systems Retail

What is cloud migration?

Dynatrace

SEPTEMBER 30, 2021

In case of a spike in traffic, you can automatically spin up more resources, often in a matter of seconds. Likewise, you can scale down when your application experiences decreased traffic. For example, as traffic increases, costs will too. Analyze your resource consumption and traffic patterns. Inconsistent performance.

Cloud

Cloud Traffic Best Practices Strategy

Lessons learned from enterprise service-level objective management

Dynatrace

MAY 19, 2022

Example 1: Architecture boundaries. First, they took a big step back and looked at their end-to-end architecture (Figure 2). SLO dashboard defined by architectural boundary. My web requests are all HTTP 2XX success, so why are my users getting errors? The dashboards are green, so why are users complaining? So, what did they do?

Automotive

Automotive Latency Architecture Mobile

Leverage automated and intelligent observability for OpenTelemetry for Go with Dynatrace PurePath 4

Dynatrace

JANUARY 28, 2021

With Dynatrace OneAgent you also benefit from support for traffic routing and traffic control. OneAgent implements network zones to create traffic routing rules and limit cross-data-center traffic. Enrich OpenTelemetry instrumentation with high-fidelity data provided by OneAgent.

Traffic

Traffic Open Source Servers Cloud

New Dynatrace Operator elevates cloud-native observability for Kubernetes

Dynatrace

MAY 5, 2021

Today we’re proud to announce the new Dynatrace Operator, designed from the ground up to handle the lifecycle of OneAgent, Kubernetes API monitoring, OneAgent traffic routing, and all future containerized componentry such as the forthcoming extension framework. Dynatrace Operator for OneAgent, API monitoring, routing, and more.

Cloud

Cloud Traffic Monitoring Open Source

What is security analytics?

Dynatrace

JUNE 10, 2024

For example, an organization might use security analytics tools to monitor user behavior and network traffic. Security analytics must also contend with the multicomponent architecture of modern IT infrastructure. While bigger data pools mean more access to potential insights, they come with the challenge of visibility.

Analytics

Analytics Network Open Source Hardware

Keeping DevOps cool in a heated environment

Dynatrace

SEPTEMBER 30, 2019

Dynatrace’s AI engine, Davis automatically identified high traffic surges on the county website as the fire took hold. Dynatrace was able to tell the county you have high traffic on your site due to an influx of residents specifically seeking information on the Woolsey Fire. High Traffic Notification.

DevOps

DevOps Traffic Website Infrastructure

Automatic intelligent observability into Envoy-proxied services of your Istio service mesh (GA)

Dynatrace

OCTOBER 13, 2021

Istio is one of the most popular service meshes It allows you to manage complex microservice architectures based on configuration—there’s no need to change any application code. Istio manages this with the help of Envoy, a lightweight remote configurable proxy server that can dynamically route traffic through a service mesh.

Traffic

Traffic Monitoring Technology Technology

Digital transformation strategies: Success stories from three digital transformation journeys

Dynatrace

MAY 8, 2023

Consumers expect personalized, proactive, and convenient service, which traditional application architectures often struggle to provide. Best Buy is designing its journey to cut through the noise of its multicloud and multi-tool environments to immediately pinpoint the root causes of issues during peak traffic loads.

Strategy

Strategy Retail DevOps Traffic

Why business resiliency depends on unified observability and security

Dynatrace

SEPTEMBER 3, 2024

In many ways, the shift to cloud computing and the adoption of cloud-native architectures have enabled organizations to realize greater resiliency alongside scalability. But in a cloud-native world, resiliency must expand to include the ability for organizations to recover quickly from failures and ensure business continuity.

Infrastructure

Infrastructure Innovation Monitoring Software Performance

How to Optimize Digital Experience and Operations with Dynatrace

Dynatrace

AUGUST 30, 2019

Reducing performance and architectural issues in their backend system gave them a 99% performance improvement! Singapore event last week, one of my colleagues showed a Dynatrace Service Flow for one of our customers, which consisted of 44 different layers of architecture that a single request had to travel through.

Cache

Cache Database Architecture Government

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

Best Practices for Designing Resilient APIs for Scalability and Reliability

Trending Sources

The keys to selecting a platform for end-to-end observability

Migrating Critical Traffic At Scale with No Downtime?—?Part 2

An Overview of TCPCopy for Beginners

Chaos Engineering With Litmus: A CNCF Incubating Project

Title Launch Observability at Netflix Scale

Architecture Patterns: The Circuit-Breaker

Best Practices for Scaling RabbitMQ

Rapid Event Notification System at Netflix

RabbitMQ vs. Kafka: Key Differences

Mastering Scalability and Performance: A Deep Dive Into Azure Load Balancing Options

Load Management With Istio Using FluxNinja Aperture

Why Replace External Database Caches?

Choosing the Appropriate AWS Load Balancer: ALB vs. NLB

Demystifying Kuma Service Mesh

Unlock end-to-end observability insights with Dynatrace PurePath 4 seamless integration of OpenTracing for Java

A Comprehensive Guide to Database Sharding: Building Scalable Systems

Introducing Impressions at Netflix

Architected for resiliency: How Dynatrace withstands data center outages

Istio Explained: Unlocking the Power of Service Mesh in Microservices

Keeping Netflix Reliable Using Prioritized Load Shedding

Service Mesh and Management Practices in Microservices

Network performance monitoring top of mind for CloudOps teams

What is a service mesh?

Automated Deployment and Architectural Validation with Pitometer and keptn!

7 Best Performance Testing Tools to Look Out for in 2021

Using Dynatrace to master the 5 pillars of the AWS Well-Architected Framework (Part 1)

Geek Reading - Week of June 5, 2013

Ready-to-Use High Availability Architectures for MySQL and PostgreSQL

Efficient SLO event integration powers successful AIOps

Dynatrace and Google Cloud: Intelligent Kubernetes observability and automation

Observe syslog with Dynatrace ActiveGate, a secure, trusted edge component

Five-nines availability: Always-on infrastructure delivers system availability during the holidays’ peak loads

What is cloud migration?

Lessons learned from enterprise service-level objective management

Leverage automated and intelligent observability for OpenTelemetry for Go with Dynatrace PurePath 4

New Dynatrace Operator elevates cloud-native observability for Kubernetes

What is security analytics?

Keeping DevOps cool in a heated environment

Automatic intelligent observability into Envoy-proxied services of your Istio service mesh (GA)

Digital transformation strategies: Success stories from three digital transformation journeys

Why business resiliency depends on unified observability and security

How to Optimize Digital Experience and Operations with Dynatrace

Stay Connected