Entertainment and Latency - Technology Performance Pulse

Foundation Model for Personalized Recommendation

The Netflix TechBlog

MARCH 28, 2025

Yet, many are confined to a brief temporal window due to constraints in serving latency or training costs. In recommendation systems, context windows during inference are often limited to hundreds of eventsnot due to model capability but because these services typically require millisecond-level latency.

Tuning

Tuning Efficiency Latency Strategy

Netflix at AWS re:Invent 2019

The Netflix TechBlog

NOVEMBER 22, 2019

Netflix shares how Amazon EC2 Auto Scaling allows its infrastructure to automatically adapt to changing traffic patterns in order to keep its audience entertained and its costs on target. Netflix runs dozens of stateful services on AWS under strict sub-millisecond tail-latency requirements, which brings unique challenges. Wednesday?—?December

AWS

AWS Entertainment Open Source Benchmarking

Migrating Critical Traffic At Scale with No Downtime?—?Part 2

The Netflix TechBlog

JUNE 13, 2023

Behind these perfect moments of entertainment is a complex mechanism, with numerous gears and cogs working in harmony. By collecting and analyzing key performance metrics of the service over time, we can assess the impact of the new changes and determine if they meet the availability, latency, and performance requirements.

Traffic

Traffic Metrics Systems Strategy

Telltale: Netflix Application Monitoring Simplified

The Netflix TechBlog

AUGUST 13, 2020

For example, a latency increase is less critical than error rate increase and some error codes are less critical than others. A healthy Netflix service enables us to entertain the world. Client metrics and QoE changes. Alerts triggered by our alerting platform. Telltale is application monitoring simplified.

Monitoring

Monitoring Tuning Traffic Metrics

Growth Engineering at Netflix?—?Automated Imagery Generation

The Netflix TechBlog

FEBRUARY 9, 2021

entertainment?—?and Server-generated assets, since client-side generation would require the retrieval of many individual images, which would increase latency and time-to-render. To reduce latency, assets should be generated in an offline fashion and not in real time. the background image shown above).

Engineering

Engineering Storage Latency Entertainment

Netflix at AWS re:Invent 2019

The Netflix TechBlog

NOVEMBER 22, 2019

Netflix shares how Amazon EC2 Auto Scaling allows its infrastructure to automatically adapt to changing traffic patterns in order to keep its audience entertained and its costs on target. Netflix runs dozens of stateful services on AWS under strict sub-millisecond tail-latency requirements, which brings unique challenges. Wednesday?—?December

AWS

AWS Entertainment Open Source Benchmarking

Netflix at AWS re:Invent 2019

The Netflix TechBlog

NOVEMBER 22, 2019

Netflix shares how Amazon EC2 Auto Scaling allows its infrastructure to automatically adapt to changing traffic patterns in order to keep its audience entertained and its costs on target. Netflix runs dozens of stateful services on AWS under strict sub-millisecond tail-latency requirements, which brings unique challenges. Wednesday?—?December

AWS

AWS Entertainment Open Source Benchmarking

Supporting Diverse ML Systems at Netflix

The Netflix TechBlog

MARCH 7, 2024

Example use case: Content Knowledge Graph Our knowledge graph of the entertainment world encodes relationships between titles, actors and other attributes of a film or series, supporting all aspects of business at Netflix. In other cases, it is more convenient to share the results via a low-latency API.

Systems

Systems Media Cache Open Source

Ciao Milano! – An AWS Region is coming to Italy!

All Things Distributed

NOVEMBER 13, 2018

We needed to serve our growing base of startup, government, and enterprise customers across many vertical industries, including automotive, financial services, media and entertainment, high technology, education, and energy. In 2012, Amazon opened its first Italian office and its first Italian point of presence (PoP) based in Milan.

AWS

AWS Energy Automotive Traffic

Snap: a microkernel approach to host networking

The Morning Paper

NOVEMBER 10, 2019

You need a lot of software engineers and the willingness to rewrite a lot of software to entertain that idea. Here are the bombshell paragraphs: Our datacenter applications seek ever more CPU-efficient and lower-latency communication, which Pony Express delivers. The desire for CPU efficiency and lower latencies is easy to understand.

Network

Network Transportation Latency Entertainment

Expanding the Cloud - Cluster Compute Instances for Amazon EC2.

All Things Distributed

JULY 13, 2010

Other industries using Amazon EC2 for HPC-style workloads include pharmaceuticals, oil exploration, industrial and automotive design, media and entertainment, and more. When instances are placed in a cluster they have access to low latency, non-blocking 10 Gbps networking when communicating the other instances in the cluster.

Cloud

Cloud AWS Automotive Latency

Growth Engineering at Netflix- Creating a Scalable Offers Platform

The Netflix TechBlog

FEBRUARY 9, 2021

Lower latency as a result of fewer service calls, which means fewer errors for our visitors. How, when, and where people want to be entertained continues to evolve. Configuration instead of code for updating SKU data, which improves innovation velocity. The world is constantly changing. Device capabilities continue to improve.

Engineering

Engineering Scalability Architecture Innovation

Understanding What Kubernetes Is Used For: The Key to Cloud-Native Efficiency

Percona

NOVEMBER 9, 2023

Entertainment : To guarantee that viewers have continuous access to films and TV series, film studios and television networks rely on Kubernetes for content delivery and streaming. Telecommunications : By guaranteeing low-latency communication, Kubernetes assists the telecom sector in quickly deploying 5G and edge computing applications.

Efficiency

Efficiency Cloud Healthcare Open Source

Why Traditional Monitoring Isn’t Enough for Modern Web Applications

Dotcom-Montior

MAY 12, 2020

Users who rely on the websites for their fundamental needs or entertainment will not tolerate even a few seconds delay. Network latency. Network Latency. Network latency can be affected due to. Proactive detection and diagnosis of web application and page performance issues are necessary. Connection time. Connection Time.

Monitoring

Monitoring Entertainment Hardware Traffic

What is a Real-Time Data Platform?

VoltDB

AUGUST 8, 2024

One common problem for real-time data platforms is latency, particularly at scale. High latency often slows down time to insights, which reduces the organization’s ability to use timely information for decision-making.

IoT

IoT Latency Traffic Logistics

Page Simulator

The Netflix TechBlog

NOVEMBER 12, 2019

If you are interested in helping us solve these types of problems and helping entertain the world, please take a look at some of our open positions on the Netflix jobs page. This, in turn, has resulted in improvements of the personalized homepages for our members.

Metrics

Metrics Testing Government Systems

Working at Netflix 2017

Brendan Gregg

MAY 16, 2017

A latency outlier issue that happened every 15 minutes. You can focus on engineering and getting stuff done, with awesome staff who will help you. ## Mission I spoke about this in my 2015 post, but it's worth repeating: our mission is to improve how entertainment is consumed worldwide, by building a great product that people choose to buy.

Java

Java Entertainment Engineering Scalability

I Actually Chatted with ChatGPT

O'Reilly

JANUARY 16, 2024

Unpredictable wait times : Wait times (latency) for ChatGPT’s responses are unpredictable, and there aren’t audio cues to help me establish an expectation for how long I need to wait before it responds. Alternatively, cars are now starting to directly embed AI into entertainment systems (e.g.,

Internet

Internet Internet Entertainment Design

Page Simulator

The Netflix TechBlog

NOVEMBER 12, 2019

If you are interested in helping us solve these types of problems and helping entertain the world, please take a look at some of our open positions on the Netflix jobs page. This, in turn, has resulted in improvements of the personalized homepages for our members.

Metrics

Metrics Testing Government Systems

Page Simulator

The Netflix TechBlog

NOVEMBER 12, 2019

If you are interested in helping us solve these types of problems and helping entertain the world, please take a look at some of our open positions on the Netflix jobs page. This, in turn, has resulted in improvements of the personalized homepages for our members.

Metrics

Metrics Testing Government Systems

Progress Delayed Is Progress Denied

Alex Russell

APRIL 29, 2021

Combined with (delayed) advanced graphics APIs and threading support, WebXR enables critical immersive, low-friction commerce and entertainment on the web. For heavily latency-sensitive use-cases like WebXR, this is a critical component in delivering a good experience. Offscreen Canvas. TextEncoderStream & TextDecoderStream.

Media

Media Games Education Engineering

Solaris to Linux Migration 2017

Brendan Gregg

SEPTEMBER 5, 2017

Here's some output from my zfsdist tool, in bcc/BPF, which measures ZFS latency as a histogram on Linux: # zfsdist. Tracing ZFS operation latency. I also wrote and published an entertaining SMF manifest that played music. Many new tools can now be written, and the main toolkit we're working on is [bcc]. Hit Ctrl-C to end. ^C

Virtualization

Virtualization AWS Engineering Hardware

London Calling! An AWS Region is coming to the UK!

All Things Distributed

NOVEMBER 5, 2015

This region will provide even lower latency and strong data sovereignty to local users. Media and Entertainment – BBC , Channel 4 , ITV , News UK , The FT , Trinity Mirror , The Guardian. The AWS UK region will be our third in the European Union (EU), and we're shooting to have it ready by the end of 2016 (or early 2017).

AWS

AWS Retail Entertainment IoT

Expanding the Cloud – An AWS Region is coming to Hong Kong

All Things Distributed

JUNE 20, 2017

This enables customers to serve content to their end users with low latency, giving them the best application experience. In 2008, AWS opened a point of presence (PoP) in Hong Kong to enable customers to serve content to their end users with low latency. Since then, AWS has added two more PoPs in Hong Kong, the latest in 2016.

AWS

AWS Logistics Cloud Social Media

Part 3: A Survey of Analytics Engineering Work at Netflix

The Netflix TechBlog

JANUARY 6, 2025

Align on Performance Expectations A major challenge during development was managing API latency. To learn more, follow the Netflix Research Site , and if you are also interested in entertaining the world, have a look at our openroles ! Much of this could have been mitigated by aligning on performance expectations from the outset.

Analytics

Analytics Engineering Cache Entertainment

Technology Performance Pulse

Foundation Model for Personalized Recommendation

Netflix at AWS re:Invent 2019

Trending Sources

Migrating Critical Traffic At Scale with No Downtime?—?Part 2

Telltale: Netflix Application Monitoring Simplified

Growth Engineering at Netflix?—?Automated Imagery Generation

Netflix at AWS re:Invent 2019

Netflix at AWS re:Invent 2019

Supporting Diverse ML Systems at Netflix

Ciao Milano! – An AWS Region is coming to Italy!

Snap: a microkernel approach to host networking

Expanding the Cloud - Cluster Compute Instances for Amazon EC2.

Growth Engineering at Netflix- Creating a Scalable Offers Platform

Understanding What Kubernetes Is Used For: The Key to Cloud-Native Efficiency

Why Traditional Monitoring Isn’t Enough for Modern Web Applications

What is a Real-Time Data Platform?

Page Simulator

Working at Netflix 2017

I Actually Chatted with ChatGPT

Page Simulator

Page Simulator

Progress Delayed Is Progress Denied

Solaris to Linux Migration 2017

London Calling! An AWS Region is coming to the UK!

Expanding the Cloud – An AWS Region is coming to Hong Kong

Part 3: A Survey of Analytics Engineering Work at Netflix

Stay Connected