Remove Document Remove Latency Remove Traffic
article thumbnail

Migrating Critical Traffic At Scale with No Downtime?—?Part 2

The Netflix TechBlog

Migrating Critical Traffic At Scale with No Downtime — Part 2 Shyam Gala , Javier Fernandez-Ivern , Anup Rokkam Pratap , Devang Shah Picture yourself enthralled by the latest episode of your beloved Netflix series, delighting in an uninterrupted, high-definition streaming experience. This is where large-scale system migrations come into play.

Traffic 285
article thumbnail

RabbitMQ vs. Kafka: Key Differences

Scalegrid

Its partitioned log architecture supports both queuing and publish-subscribe models, allowing it to handle large-scale event processing with minimal latency. Apache Kafka uses a custom TCP/IP protocol for high throughput and low latency. However, performance can decline under high traffic conditions.

Latency 147
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Maximize user experience with out-of-the-box service-performance SLOs

Dynatrace

These signals ( latency, traffic, errors, and saturation ) provide a solid means of proactively monitoring operative systems via SLOs and tracking business success. Performance typically addresses response times or latency aspects and contributes to the four golden signals. This is what Dynatrace captures as response time.

article thumbnail

Introducing Netflix TimeSeries Data Abstraction Layer

The Netflix TechBlog

Rajiv Shringi Vinay Chella Kaidan Fullerton Oleksii Tkachuk Joey Lynch Introduction As Netflix continues to expand and diversify into various sectors like Video on Demand and Gaming , the ability to ingest and store vast amounts of temporal data — often reaching petabytes — with millisecond access latency has become increasingly vital.

Latency 239
article thumbnail

Edgar: Solving Mysteries Faster with Observability

The Netflix TechBlog

Edgar captures 100% of interesting traces , as opposed to sampling a small fixed percentage of traffic. As a request flows between services, each distinct unit of work is documented as a span. Telltale provides Edgar with latency benchmarks that indicate if the individual trace’s latency is abnormal for this given service.

Latency 298
article thumbnail

Seamlessly Swapping the API backend of the Netflix Android app

The Netflix TechBlog

For each route we migrated, we wanted to make sure we were not introducing any regressions: either in the form of missing (or worse, wrong) data, or by increasing the latency of each endpoint. You can find a lot more details about how this works in the Spinnaker canaries documentation. Replay Testing Enter replay testing.

Latency 241
article thumbnail

Achieving observability in async workflows

The Netflix TechBlog

Prodicle Distribution Prodicle Distribution allows a production office coordinator to send secure, watermarked documents, such as scripts, to crew members as attachments or links, and track delivery. One distribution job might result in several thousand watermarked documents and links being created. Things got hairy.

Traffic 166