Efficiency, Entertainment and Latency - Technology Performance Pulse

Foundation Model for Personalized Recommendation

The Netflix TechBlog

MARCH 28, 2025

Yet, many are confined to a brief temporal window due to constraints in serving latency or training costs. These insights have shaped the design of our foundation model, enabling a transition from maintaining numerous small, specialized models to building a scalable, efficient system. At Netflix, our mission is to entertain the world.

Tuning

Tuning Efficiency Latency Strategy

Netflix at AWS re:Invent 2019

The Netflix TechBlog

NOVEMBER 22, 2019

Netflix shares how Amazon EC2 Auto Scaling allows its infrastructure to automatically adapt to changing traffic patterns in order to keep its audience entertained and its costs on target. This talk explores the journey, learnings, and improvements to performance analysis, efficiency, reliability, and security. Wednesday?—?December

AWS

AWS Entertainment Open Source Benchmarking

Migrating Critical Traffic At Scale with No Downtime?—?Part 2

The Netflix TechBlog

JUNE 13, 2023

Behind these perfect moments of entertainment is a complex mechanism, with numerous gears and cogs working in harmony. By collecting and analyzing key performance metrics of the service over time, we can assess the impact of the new changes and determine if they meet the availability, latency, and performance requirements.

Traffic

Traffic Metrics Systems Strategy

Understanding What Kubernetes Is Used For: The Key to Cloud-Native Efficiency

Percona

NOVEMBER 9, 2023

Kubernetes can be complex, which is why we offer comprehensive training that equips you and your team with the expertise and skills to manage database configurations, implement industry best practices, and carry out efficient backup and recovery procedures. Just consider the sheer number of people who stream Netflix every night!

Efficiency

Efficiency Cloud Healthcare Open Source

Supporting Diverse ML Systems at Netflix

The Netflix TechBlog

MARCH 7, 2024

Example use case: Content Knowledge Graph Our knowledge graph of the entertainment world encodes relationships between titles, actors and other attributes of a film or series, supporting all aspects of business at Netflix. In other cases, it is more convenient to share the results via a low-latency API.

Systems

Systems Media Cache Open Source

Snap: a microkernel approach to host networking

The Morning Paper

NOVEMBER 10, 2019

You need a lot of software engineers and the willingness to rewrite a lot of software to entertain that idea. Here are the bombshell paragraphs: Our datacenter applications seek ever more CPU-efficient and lower-latency communication, which Pony Express delivers. Enter Google! Emphasis mine). cores, vs 22Gbps using 1.2

Network

Network Transportation Latency Entertainment

Netflix at AWS re:Invent 2019

The Netflix TechBlog

NOVEMBER 22, 2019

Netflix shares how Amazon EC2 Auto Scaling allows its infrastructure to automatically adapt to changing traffic patterns in order to keep its audience entertained and its costs on target. Netflix runs dozens of stateful services on AWS under strict sub-millisecond tail-latency requirements, which brings unique challenges. Wednesday?—?December

AWS

AWS Entertainment Open Source Benchmarking

Netflix at AWS re:Invent 2019

The Netflix TechBlog

NOVEMBER 22, 2019

Netflix shares how Amazon EC2 Auto Scaling allows its infrastructure to automatically adapt to changing traffic patterns in order to keep its audience entertained and its costs on target. Netflix runs dozens of stateful services on AWS under strict sub-millisecond tail-latency requirements, which brings unique challenges. Wednesday?—?December

AWS

AWS Entertainment Open Source Benchmarking

What is a Real-Time Data Platform?

VoltDB

AUGUST 8, 2024

Improved operational efficiency Real-time data platforms enhance operational efficiency by providing timely insights and automating processes. As an added bonus, as operational efficiency improves, margins increase and money is spent more effectively.

IoT

IoT Latency Traffic Logistics

Progress Delayed Is Progress Denied

Alex Russell

APRIL 29, 2021

Combined with (delayed) advanced graphics APIs and threading support, WebXR enables critical immersive, low-friction commerce and entertainment on the web. Efficiently enables new styles of drawing content on the web , removing many hard tradeoffs between visual richness , accessibility, and performance. Form-associated Web Components.

Media

Media Games Education Engineering

Solaris to Linux Migration 2017

Brendan Gregg

SEPTEMBER 5, 2017

. - **eBPF**: tracing features completed in 2016, this provides efficient programmatic tracing to existing kernel frameworks. Here's some output from my zfsdist tool, in bcc/BPF, which measures ZFS latency as a histogram on Linux: # zfsdist. Tracing ZFS operation latency. Hit Ctrl-C to end. ^C

Virtualization

Virtualization AWS Engineering Hardware

Part 3: A Survey of Analytics Engineering Work at Netflix

The Netflix TechBlog

JANUARY 6, 2025

Check out Part 1 , which detailed how were empowering Netflix to efficiently produce and effectively deliver high quality, actionable analytic insights across the company and Part 2 , which stepped through a few exciting business applications for Analytics Engineering. Need to catch up?

Analytics

Analytics Engineering Cache Entertainment

Technology Performance Pulse

Foundation Model for Personalized Recommendation

Netflix at AWS re:Invent 2019

Trending Sources

Migrating Critical Traffic At Scale with No Downtime?—?Part 2

Understanding What Kubernetes Is Used For: The Key to Cloud-Native Efficiency

Supporting Diverse ML Systems at Netflix

Snap: a microkernel approach to host networking

Netflix at AWS re:Invent 2019

Netflix at AWS re:Invent 2019

What is a Real-Time Data Platform?

Progress Delayed Is Progress Denied

Solaris to Linux Migration 2017

Part 3: A Survey of Analytics Engineering Work at Netflix

Stay Connected