Design, Entertainment and Latency - Technology Performance Pulse

Foundation Model for Personalized Recommendation

The Netflix TechBlog

MARCH 28, 2025

Yet, many are confined to a brief temporal window due to constraints in serving latency or training costs. These insights have shaped the design of our foundation model, enabling a transition from maintaining numerous small, specialized models to building a scalable, efficient system. At Netflix, our mission is to entertain the world.

Tuning

Tuning Efficiency Latency Strategy

Netflix at AWS re:Invent 2019

The Netflix TechBlog

NOVEMBER 22, 2019

Netflix shares how Amazon EC2 Auto Scaling allows its infrastructure to automatically adapt to changing traffic patterns in order to keep its audience entertained and its costs on target. In this talk, we share how Netflix deploys systems to meet its demands, Ceph’s design for high availability, and results from our benchmarking.

AWS

AWS Entertainment Open Source Benchmarking

Migrating Critical Traffic At Scale with No Downtime?—?Part 2

The Netflix TechBlog

JUNE 13, 2023

Behind these perfect moments of entertainment is a complex mechanism, with numerous gears and cogs working in harmony. By collecting and analyzing key performance metrics of the service over time, we can assess the impact of the new changes and determine if they meet the availability, latency, and performance requirements.

Traffic

Traffic Metrics Systems Strategy

Growth Engineering at Netflix?—?Automated Imagery Generation

The Netflix TechBlog

FEBRUARY 9, 2021

entertainment?—?and Before designing a solution it’s important to understand the main product requirements for such a feature: The content needs to be new, relevant, and regional (not all countries have the same catalogue). To reduce latency, assets should be generated in an offline fashion and not in real time.

Engineering

Engineering Storage Latency Entertainment

Supporting Diverse ML Systems at Netflix

The Netflix TechBlog

MARCH 7, 2024

Since its inception , Metaflow has been designed to provide a human-friendly API for building data and ML (and today AI) applications and deploying them in our production infrastructure frictionlessly. In other cases, it is more convenient to share the results via a low-latency API.

Systems

Systems Media Cache Open Source

Netflix at AWS re:Invent 2019

The Netflix TechBlog

NOVEMBER 22, 2019

Netflix shares how Amazon EC2 Auto Scaling allows its infrastructure to automatically adapt to changing traffic patterns in order to keep its audience entertained and its costs on target. In this talk, we share how Netflix deploys systems to meet its demands, Ceph’s design for high availability, and results from our benchmarking.

AWS

AWS Entertainment Open Source Benchmarking

Netflix at AWS re:Invent 2019

The Netflix TechBlog

NOVEMBER 22, 2019

Netflix shares how Amazon EC2 Auto Scaling allows its infrastructure to automatically adapt to changing traffic patterns in order to keep its audience entertained and its costs on target. In this talk, we share how Netflix deploys systems to meet its demands, Ceph’s design for high availability, and results from our benchmarking.

AWS

AWS Entertainment Open Source Benchmarking

Snap: a microkernel approach to host networking

The Morning Paper

NOVEMBER 10, 2019

It’s been clear for a while that software designed explicitly for the data center environment will increasingly want/need to make different design trade-offs to e.g. general-purpose systems software that you might install on your own machines. The desire for CPU efficiency and lower latencies is easy to understand. Enter Google!

Network

Network Transportation Latency Entertainment

Growth Engineering at Netflix- Creating a Scalable Offers Platform

The Netflix TechBlog

FEBRUARY 9, 2021

In particular, it’s our job to design and build the systems and protocols that enable customers from all over the world to sign up for Netflix with the plan features and incentives that best suit their needs. This was a perfectly sufficient design for many years. How, when, and where people want to be entertained continues to evolve.

Engineering

Engineering Scalability Architecture Innovation

Expanding the Cloud - Cluster Compute Instances for Amazon EC2.

All Things Distributed

JULY 13, 2010

Cluster Computer Instances for Amazon EC2 are a new instance type specifically designed for High Performance Computing applications. Other industries using Amazon EC2 for HPC-style workloads include pharmaceuticals, oil exploration, industrial and automotive design, media and entertainment, and more. Recent Entries.

Cloud

Cloud AWS Automotive Latency

I Actually Chatted with ChatGPT

O'Reilly

JANUARY 16, 2024

I’m personally interested in this topic since I am a professor who researches human-computer interaction, user experience design, and cognitive science , so AI voice interfaces are fascinating to me. I’ve recently been brainstorming ideas for how to design such a system and how to deal with the practical challenges of scaling and maintenance.

Internet

Internet Internet Entertainment Design

What is a Real-Time Data Platform?

VoltDB

AUGUST 8, 2024

Real-time data platform defined A real-time data platform is designed to ingest, process, analyze, and act upon data instantaneously — right when it’s generated or received. Processing such high data volumes requires robust infrastructure and scalable architecture designed for high performance and high availability.

IoT

IoT Latency Traffic Logistics

Working at Netflix 2017

Brendan Gregg

MAY 16, 2017

We're on the EC2 cloud, which has great scalability, and our own cloud architecture of microservices is also designed for scalability. A latency outlier issue that happened every 15 minutes. That'd make a great story, but it didn't happen. But there was no single crisis point. Java core dump analysis for a crashing JVM. -

Java

Java Entertainment Engineering Scalability

Progress Delayed Is Progress Denied

Alex Russell

APRIL 29, 2021

Combined with (delayed) advanced graphics APIs and threading support, WebXR enables critical immersive, low-friction commerce and entertainment on the web. Apple's policy against browser engine choice adds years of delays beyond the (expected) delay of design iteration, specification authoring, and browser feature development.

Media

Media Games Education Engineering

Part 3: A Survey of Analytics Engineering Work at Netflix

The Netflix TechBlog

JANUARY 6, 2025

Dashboard DesignTips Rina Chang , SusieLu What is design, and why does it matter? Often people think design is about how things look, but design is actually about how things work. Everything is designed, because were all making choices about how things work, but not everything is designed well.

Analytics

Analytics Engineering Cache Entertainment

Technology Performance Pulse

Foundation Model for Personalized Recommendation

Netflix at AWS re:Invent 2019

Trending Sources

Migrating Critical Traffic At Scale with No Downtime?—?Part 2

Growth Engineering at Netflix?—?Automated Imagery Generation

Supporting Diverse ML Systems at Netflix

Netflix at AWS re:Invent 2019

Netflix at AWS re:Invent 2019

Snap: a microkernel approach to host networking

Growth Engineering at Netflix- Creating a Scalable Offers Platform

Expanding the Cloud - Cluster Compute Instances for Amazon EC2.

I Actually Chatted with ChatGPT

What is a Real-Time Data Platform?

Working at Netflix 2017

Progress Delayed Is Progress Denied

Part 3: A Survey of Analytics Engineering Work at Netflix

Stay Connected