Remove Analysis Remove Latency Remove Traffic
article thumbnail

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

The Netflix TechBlog

Migrating Critical Traffic At Scale with No Downtime — Part 1 Shyam Gala , Javier Fernandez-Ivern , Anup Rokkam Pratap , Devang Shah Hundreds of millions of customers tune into Netflix every day, expecting an uninterrupted and immersive streaming experience. This approach has a handful of benefits.

Traffic 344
article thumbnail

Investigation of a Workbench UI Latency Issue

The Netflix TechBlog

To investigate this issue, we needed a quantitative analysis of the slowness. har file recording all communications from the browser and loaded it into a Notebook for analysis. Using this approach, we observed latencies ranging from 1 to 10 seconds, averaging 7.4 j”) for 15 seconds while running the user’s notebook.

Latency 211
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Migrating Critical Traffic At Scale with No Downtime?—?Part 2

The Netflix TechBlog

Migrating Critical Traffic At Scale with No Downtime — Part 2 Shyam Gala , Javier Fernandez-Ivern , Anup Rokkam Pratap , Devang Shah Picture yourself enthralled by the latest episode of your beloved Netflix series, delighting in an uninterrupted, high-definition streaming experience. This is where large-scale system migrations come into play.

Traffic 282
article thumbnail

Automated Change Impact Analysis with Site Reliability Guardian

Dynatrace

SREs use Service-Level Indicators (SLI) to see the complete picture of service availability, latency, performance, and capacity across various systems, especially revenue-critical systems. The post Automated Change Impact Analysis with Site Reliability Guardian appeared first on Dynatrace news.

DevOps 230
article thumbnail

Noisy Neighbor Detection with eBPF

The Netflix TechBlog

Traditional performance analysis tools such as perf can introduce significant overhead, risking further performance degradation. Continuous instrumentation is critical to catching such matters as they emerge, and eBPF, with its hooks into the Linux scheduler with minimal overhead, enabled us to monitor run queue latency efficiently.

Latency 245
article thumbnail

What are quality gates? How to use quality gates to deliver better software at speed and scale

Dynatrace

Before a new version of the application is deployed, the software is subject to a series of load tests that evaluate capacity and performance under a series of simulated traffic and application demands. These metrics are latency, traffic, errors, and saturation, all of which must be key considerations when curating user experience.

Speed 220
article thumbnail

Seeing through hardware counters: a journey to threefold performance increase

The Netflix TechBlog

A quick canary test was free of errors and showed lower latency, which is expected given that our standard canary setup routes an equal amount of traffic to both the baseline running on 4xl and the canary on 12xl. What’s worse, average latency degraded by more than 50%, with both CPU and latency patterns becoming more “choppy.”

Hardware 363