Remove Entertainment Remove Infrastructure Remove Latency
article thumbnail

Foundation Model for Personalized Recommendation

The Netflix TechBlog

Yet, many are confined to a brief temporal window due to constraints in serving latency or training costs. In recommendation systems, context windows during inference are often limited to hundreds of eventsnot due to model capability but because these services typically require millisecond-level latency.

Tuning 163
article thumbnail

Netflix at AWS re:Invent 2019

The Netflix TechBlog

Netflix shares how Amazon EC2 Auto Scaling allows its infrastructure to automatically adapt to changing traffic patterns in order to keep its audience entertained and its costs on target. Netflix runs dozens of stateful services on AWS under strict sub-millisecond tail-latency requirements, which brings unique challenges.

AWS 38
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

London Calling! An AWS Region is coming to the UK!

All Things Distributed

This region will provide even lower latency and strong data sovereignty to local users. We are committed to meeting our customers’ increasing needs for capacity and for powerful AWS services that eliminate the heavy lifting of the underlying IT infrastructure -- allowing them to focus more of their precious resources on their core business.

AWS 167
article thumbnail

Supporting Diverse ML Systems at Netflix

The Netflix TechBlog

Berg , Romain Cledat , Kayla Seeley , Shashank Srikanth , Chaoying Wang , Darin Yu Netflix uses data science and machine learning across all facets of the company, powering a wide range of business applications from our internal infrastructure and content demand modeling to media understanding.

Systems 235
article thumbnail

Ciao Milano! – An AWS Region is coming to Italy!

All Things Distributed

Currently we have 57 Availability Zones across 19 technology infrastructure Regions. We needed to serve our growing base of startup, government, and enterprise customers across many vertical industries, including automotive, financial services, media and entertainment, high technology, education, and energy.

AWS 164
article thumbnail

Expanding the Cloud – An AWS Region is coming to Hong Kong

All Things Distributed

This enables customers to serve content to their end users with low latency, giving them the best application experience. In 2008, AWS opened a point of presence (PoP) in Hong Kong to enable customers to serve content to their end users with low latency. Since then, AWS has added two more PoPs in Hong Kong, the latest in 2016.

AWS 125
article thumbnail

Telltale: Netflix Application Monitoring Simplified

The Netflix TechBlog

Infrastructure change events. For example, a latency increase is less critical than error rate increase and some error codes are less critical than others. A healthy Netflix service enables us to entertain the world. Mantis real-time streaming data. Canary launches and deployments. The health of upstream and downstream services.