2019 and Latency - Technology Performance Pulse

Netflix at AWS re:Invent 2019

The Netflix TechBlog

NOVEMBER 22, 2019

In 2019, Netflix moved thousands of container hosts to bare metal. Netflix runs dozens of stateful services on AWS under strict sub-millisecond tail-latency requirements, which brings unique challenges. It launches more than four million containers per week across thousands of underlying hosts.

AWS

AWS Entertainment Open Source Benchmarking

Foundation Model for Personalized Recommendation

The Netflix TechBlog

MARCH 28, 2025

Yet, many are confined to a brief temporal window due to constraints in serving latency or training costs. In recommendation systems, context windows during inference are often limited to hundreds of eventsnot due to model capability but because these services typically require millisecond-level latency. Zhai et al.,

Tuning

Tuning Efficiency Latency Strategy

Optimize Citrix platform performance and user experience with Dynatrace (GA)

Dynatrace

JANUARY 15, 2020

Having released this functionality in an Preview Release back in September 2019, we’re now happy to announce the General Availability of our Citrix monitoring extension. Citrix platform performance—optimize your Citrix landscape with insights into user load and screen latency per server. Dynatrace news. Citrix VDA. SAP server.

Latency

Latency Performance Virtualization Infrastructure

Stuff The Internet Says On Scalability For March 1st, 2019

High Scalability

MARCH 1, 2019

It was made possible by using a low latency of 0.1 seconds, the lower the latency, the more responsive the robot. They'll learn a lot and love you forever. AWSonAir : @McDonalds uses Amazon ECS to scale to support 20,000 orders per second. antoniogm : Know why the European startup scene sucks?

Internet

Internet Internet Scalability Blockchain

Stuff The Internet Says On Scalability For May 10th, 2019

High Scalability

MAY 10, 2019

Quotable Stuff: @mjpt777 : APIs to IO need to be asynchronous and support batching otherwise the latency of calls dominate throughput and latency profile under burst conditions. . $84.4 : average yearly Facebook ad revenue per user in North America.

Internet

Internet Internet Scalability Energy

The Netflix Cosmos Platform

The Netflix TechBlog

MARCH 1, 2021

It supports both high throughput services that consume hundreds of thousands of CPUs at a time, and latency-sensitive workloads where humans are waiting for the results of a computation. The subsystems all communicate with each other asynchronously via Timestone, a high-scale, low-latency priority queuing system. Warm capacity.

Serverless

Serverless Media Latency Social Media

Site reliability done right: 5 SRE best practices that deliver on business objectives

Dynatrace

MAY 31, 2023

Uptime Institute’s 2022 Outage Analysis report found that over 60% of system outages resulted in at least $100,000 in total losses, up from 39% in 2019. At the lowest level, SLIs provide a view of service availability, latency, performance, and capacity across systems. More than one in seven outages cost more than $1 million.

Best Practices

Best Practices DevOps Latency Metrics

Netflix at AWS re:Invent 2019

The Netflix TechBlog

NOVEMBER 22, 2019

Netflix runs dozens of stateful services on AWS under strict sub-millisecond tail-latency requirements, which brings unique challenges. We showcase our case studies, open-source tools in benchmarking, and how we ensure that AWS cloud services are serving our needs without compromising on tail latencies.

AWS

AWS Entertainment Open Source Benchmarking

Netflix at AWS re:Invent 2019

The Netflix TechBlog

NOVEMBER 22, 2019

Netflix runs dozens of stateful services on AWS under strict sub-millisecond tail-latency requirements, which brings unique challenges. We showcase our case studies, open-source tools in benchmarking, and how we ensure that AWS cloud services are serving our needs without compromising on tail latencies.

AWS

AWS Entertainment Open Source Benchmarking

Stuff The Internet Says On Scalability For January 25th, 2019

High Scalability

JANUARY 24, 2019

TServerless : We sat with a solution architect, apparently they are aware of the latency issue and suggested to ditch api gw and build our own solution. For those who sought to control nature through programmable machines, it responds by allowing us to build machines whose nature is that they can no longer be controlled by programs.

Internet

Internet Internet Scalability Games

ScyllaDB Trends – How Users Deploy The Real-Time Big Data Database

Scalegrid

NOVEMBER 25, 2019

ScyllaDB offers significantly lower latency which allows you to process a high volume of data with minimal delay. percentile latency is up to 11X better than Cassandra on AWS EC2 bare metal. This number is more inline with our recent 2019 Open Source Database Trends Report where 56.9% Databases Most Commonly Used with ScyllaDB.

Big Data

Big Data Database Open Source Azure

Stuff The Internet Says On Scalability For March 22nd, 2019

High Scalability

MARCH 22, 2019

µs of replication latency on lossy Ethernet, which is faster than or comparable to specialized replication systems that use programmable switches, FPGAs, or RDMA.". We achieve 5.5 matthewstoller : I just looked at Netflix’s 10K.

Internet

Internet Internet Scalability Wireless

Keeping Netflix Reliable Using Prioritized Load Shedding

The Netflix TechBlog

NOVEMBER 2, 2020

Those two metrics are approximate indicators of failures and latency. Netflix experienced a similar issue with the same potential impact as the outage seen in 2019. Service throttling Zuul can sense when a back-end service is in trouble by monitoring the error rates and concurrent requests to that service.

Traffic

Traffic Metrics Infrastructure Architecture

So many bad takes?—?What is there to learn from the Prime Video microservices to monolith story

Adrian Cockcroft

MAY 6, 2023

I don’t advocate “Serverless Only”, and I recommended that if you need sustained high traffic, low latency and higher efficiency, then you should re-implement your rapid prototype as a continuously running autoscaled container, as part of a larger serverless event driven architecture, which is what they did.

Serverless

Serverless Lambda Best Practices Traffic

Open Observability – Part 1: Distributed tracing and observability

Dynatrace

JUNE 25, 2021

Already in the 2000s, service-oriented architectures (SOA) became popular, and operations teams discovered the need to understand how transactions traverse through all tiers and how these tiers contributed to the execution time and latency. In 2019, the OpenCensus and OpenTracing projects merged into what we now know as OpenTelemetry.

Open Source

Open Source Monitoring Google Systems

USENIX LISA2021 Computing Performance: On the Horizon

Brendan Gregg

JULY 4, 2021

## References I've reproduced the talk references below, so you can click on links: - [Gregg 08] Brendan Gregg, “ZFS L2ARC,” [link] Jul 2008 - [Gregg 10] Brendan Gregg, “Visualizations for Performance Analysis (and More),” [link] 2010 - [Greenberg 11] Marc Greenberg, “DDR4: Double the speed, double the latency? Ford, et al., “TCP

Performance

Performance Latency Hardware Storage

Building Netflix’s Distributed Tracing Infrastructure

The Netflix TechBlog

OCTOBER 19, 2020

If we had an ID for each streaming session then distributed tracing could easily reconstruct session failure by providing service topology, retry and error tags, and latency measurements for all service calls. Using simple lookup indices in Cassandra gives us the ability to maintain acceptable read latencies while doing heavy writes.

Infrastructure

Infrastructure Transportation Storage Open Source

Achieving observability in async workflows

The Netflix TechBlog

MAY 14, 2021

We are expected to process 1,000 watermarks for a single distribution in a minute, with non-linear latency growth as the number of watermarks increases. The goal is to process these documents as fast as possible and reliably deliver them to recipients while offering strong observability to both our users and internal teams.

Traffic

Traffic Java Latency Google

O’Reilly serverless survey 2019: Concerns, what works, and what to expect

O'Reilly

NOVEMBER 12, 2019

latency, startup, mocking, etc.) The results in Figure 12 reflect what we know of the cloud market and mirror what we found in our cloud native survey from earlier in 2019. 1] The serverless adoption survey ran in June 2019. “Integration/testing is harder” ranked as the third biggest worry, noted by 30% of respondents.

Serverless

Serverless Architecture FinTech Infrastructure

Memory-Optimized TempDB Metadata in SQL Server 2019

SQL Shack

JULY 10, 2019

TempDB is one of the biggest sources of latency in […]. By removing disk-based storage and the challenge of copying data in and out of memory, query speeds in SQL Server can be improved by orders of magnitude.

Servers

Servers Latency Storage Speed

Understanding operational 5G: a first measurement study on its coverage, performance and energy consumption

The Morning Paper

OCTOBER 4, 2020

We are standing on the eve of the 5G era… 5G, as a monumental shift in cellular communication technology, holds tremendous potential for spurring innovations across many vertical industries, with its promised multi-Gbps speed, sub-10 ms low latency, and massive connectivity. Throughput and latency. km university campus.

Energy

Energy Latency Performance Network

Four tips to maximise your time at DevOps Enterprise Summit 2019, London

Tasktop

JUNE 18, 2019

In one week’s time, thousands of IT and business professionals will descend on London for the latest iteration of DevOps Enterprise Summit London 2019 (June 25-27 – InterContinental O2, London, UK). Here are four tips to get the most out of DOES London 2019: Tip #1 – Develop a plan of attack. The countdown is on.

DevOps

DevOps Network Software Software

Stuff The Internet Says On Scalability For December 21st, 2018

High Scalability

DECEMBER 21, 2018

swardley: X : What's going to happen in cloud in 2019? Tim Bray : How to talk about [Serverless Latency] · To start with, don’t just say “I need 120ms.” And if you know someone with hearing problems they might find Live CC useful. 202,157 flights tracked! Me : Nothing special. 3) Serverless will rocket.

Internet

Internet Internet Scalability Serverless

Connecting MongoDB to Ruby with Self-Signed Certificates for SSL

Scalegrid

JUNE 18, 2019

The easiest way to induce failover is to run the rs.stepDown() command: RS-example-0:PRIMARY> rs.stepDown() 2019-04-18T19:44:42.257+0530 E QUERY [thread1] Error: error doing query: failed: network error while attempting to run command 'replSetStepDown' on host 'SG-example-1.servers.mongodirector.com:27017' 27017 (sg-example-17026.servers.mongodirector.com:27017,

C++

C++ Servers Database Testing

Virtual consensus in Delos

The Morning Paper

NOVEMBER 8, 2020

The initial version of Delos went into production after eight months using a ZooKeeper-backed Loglet implementation, and then four months later it was swapped out for a new custom-built NativeLoglet that gave a 10x improvement in end-to-end latency. For Facebook’s Delos, reconfiguration latencies of 10s of ms are ok.

Virtualization

Virtualization Latency Storage Systems

DBLog: A Generic Change-Data-Capture Framework

The Netflix TechBlog

DECEMBER 17, 2019

Passive instances across regions are also possible, though it is recommended to operate in the same region as the database host in order to keep the change capture latencies low. Production usage DBLog is the foundation of the MySQL and PostgreSQL Connectors at Netflix, which are used in Delta. Beresford, and Boerge Svingen.

Database

Database Traffic Transportation Open Source

DBLog: A Generic Change-Data-Capture Framework

The Netflix TechBlog

DECEMBER 17, 2019

Passive instances across regions are also possible, though it is recommended to operate in the same region as the database host in order to keep the change capture latencies low. Production usage DBLog is the foundation of the MySQL and PostgreSQL Connectors at Netflix, which are used in Delta. Beresford, and Boerge Svingen.

Database

Database Traffic Transportation Open Source

USENIX SREcon APAC 2022: Computing Performance: What's on the Horizon

Brendan Gregg

FEBRUARY 28, 2023

My personal opinion is that I don't see a widespread need for more capacity given horizontal scaling and servers that can already exceed 1 Tbyte of DRAM; bandwidth is also helpful, but I'd be concerned about the increased latency for adding a hop to more memory. Ford, et al., “TCP

Performance

Performance Latency Cache Virtualization

Software Testing Trends 2021 – What can we expect?

Testsigma

FEBRUARY 12, 2021

Are you aware that the scale of the app testing industry in 2019 was over USD$ 40 billion? In 2019, we had previously projected the demand for IoT research at $781.96billion. 38% of organisations were expected to introduce machine-learning initiatives in 2019, according to the Capgemini World Efficiency survey. billion in 2016.

Artificial Intelligence

Artificial Intelligence Software Software IoT

As-Salaam-Alaikum: The cloud arrives in the Middle East!

All Things Distributed

SEPTEMBER 25, 2017

The Region will be in the heart of Gulf Cooperation Council (GCC) countries, and we're aiming to have it ready by early 2019. This Region will consist of three Availability Zones at launch, and it will provide even lower latency to users across the Middle East. This news marks the 22nd AWS Region we have announced globally.

Cloud

Cloud Education Energy Government

The Performance Inequality Gap, 2023

Alex Russell

DECEMBER 18, 2022

Without beating around the bush, our ASP 2019 device was an Android that cost between $300-$350, new and unlocked. We've been tracking the mobile device landscape more carefully over the years and, as with desktop, ASP s today are tomorrow's performance destiny. But this also gives rise to the critique: OK, but does it work?

Performance

Performance Mobile Network Latency

KeyCDN Launches Image Processing

KeyCDN

MAY 14, 2019

This allows for global processing, which means no matter where your users are located they will receive processed images with low latency. Our image processing is advantageous because it combines high performance image transformation and optimization with our global CDN. This is achieved by defining the applicable image processing parameters.

Processing

Processing Cache Network Latency

The Performance Inequality Gap, 2021

Alex Russell

MARCH 6, 2021

India has been the epicentre of smartphone growth in recent years, owing to the sheer size of its market and an accelerating shift away from feature phones which made up the majority of Indian mobile devices until as late as 2019. So what did $150USD fetch in 2019? The smooth, dulcet tones of 2019's Moto E6. " package.

Performance

Performance Network Mobile Metrics

Automating chaos experiments in production

The Morning Paper

JULY 4, 2019

Two failure modes we focus on are a service becoming slower (increase in response latency) or a service failing outright (returning errors). The criticality score is combined with a safety score and experiment weight (failure experiments, then latency, than failure inducing latency) to produce the final prioritization score.

Latency

Latency Engineering Metrics Traffic

Expanding the AWS Cloud – Introducing the AWS Europe (Stockholm) Region

All Things Distributed

DECEMBER 12, 2018

They can run applications in Sweden, serve end users across the Nordics with lower latency, and leverage advanced technologies such as containers, serverless computing, and more. For VR, this is a journey that is already one-third complete and expected to be finished by the end of 2019.

AWS

AWS Cloud Games Serverless

The Speed of Time

Brendan Gregg

SEPTEMBER 25, 2021

A Cassandra database cluster had switched to Ubuntu and noticed write latency increased by over 30%. The change was obvious in the production graphs, showing a drop in write latencies: Once tested more broadly, it showed the write latencies dropped by 43%, delivering slightly better performance than on CentOS.

Speed

Speed Java AWS Virtualization

Optimizing Performance With Resource Hints

Smashing Magazine

APRIL 17, 2019

2019-04-17T12:30:16+02:00. 2019-04-29T18:34:58+00:00. This typically happens once per server and takes up valuable time — especially if the server is very distant from the browser and network latency is high. Optimizing Performance With Resource Hints. Optimizing Performance With Resource Hints. Drew McLellan.

Performance

Performance Servers Games Cache

USENIX LISA2021 Computing Performance: On the Horizon

Brendan Gregg

JULY 4, 2021

## References I've reproduced the talk references below, so you can click on links: - [Gregg 08] Brendan Gregg, “ZFS L2ARC,” [link] Jul 2008 - [Gregg 10] Brendan Gregg, “Visualizations for Performance Analysis (and More),” [link] 2010 - [Greenberg 11] Marc Greenberg, “DDR4: Double the speed, double the latency? Ford, et al., “TCP

Performance

Performance Latency Hardware Storage

Reverb: speculative debugging for web applications

The Morning Paper

JANUARY 26, 2020

This week we’ll be looking at a selection of papers from the 2019 edition of the ACM Symposium of Cloud Computing ( SoCC ). Reverb: speculative debugging for web applications , Netravali & Mickens, SOCC’19. candidate bug-fixes) during replay.

Programming

Programming Servers Network Latency

New Year’s Updates

John McCalpin

JANUARY 9, 2019

As part of my attempt to become organized in 2019, I found several draft blog entries that had never been completed and made public.

Latency

Latency Architecture Servers Performance

New Year’s Updates

John McCalpin

JANUARY 9, 2019

As part of my attempt to become organized in 2019, I found several draft blog entries that had never been completed and made public.

Latency

Latency Architecture Servers Performance

The convoy phenomenon

The Morning Paper

JUNE 30, 2019

Today we’re jumping from HotOS topics of 2019, to hot topics of 1977! In such a situation I’d expect to see unusually high latencies, but normal throughput). The convoy phenomenon Blasgen et al., IBM Research Report 1977 (revised 1979). What is a convoy and why do they form?

Traffic

Traffic Latency Programming Scalability

KeyCDN Launches POP in Romania

KeyCDN

JANUARY 31, 2019

The next closest active POP location to Bucharest was Istanbul which was still almost 900km away; this distance adds up in terms of latency. Before the implementation of our Bucharest POP, Romanian users were delivered content from our surrounding KeyCDN POPs.

Internet

Internet Internet Ecommerce Speed

A Look at JAMstack’s Speed, By the Numbers

CSS - Tricks

NOVEMBER 1, 2019

The FCP distribution for the 10th, 50th and 90th percentile values as reported on August 1, 2019. TTI distribution for the 10th, 50th and 90th percentile values as reported on August 1, 2019. TTFB mobile speed distribution (CrUX, July 2019). FCP mobile speed distribution (CrUX, July 2019). First Contentful Paint.

Speed

Speed Mobile Metrics Scalability

Netflix at AWS re:Invent 2019

Foundation Model for Personalized Recommendation

Trending Sources

Optimize Citrix platform performance and user experience with Dynatrace (GA)

Stuff The Internet Says On Scalability For March 1st, 2019

Stuff The Internet Says On Scalability For May 10th, 2019

The Netflix Cosmos Platform

Site reliability done right: 5 SRE best practices that deliver on business objectives

Netflix at AWS re:Invent 2019

Netflix at AWS re:Invent 2019

Stuff The Internet Says On Scalability For January 25th, 2019

ScyllaDB Trends – How Users Deploy The Real-Time Big Data Database

Stuff The Internet Says On Scalability For March 22nd, 2019

Keeping Netflix Reliable Using Prioritized Load Shedding

So many bad takes?—?What is there to learn from the Prime Video microservices to monolith story

Open Observability – Part 1: Distributed tracing and observability

USENIX LISA2021 Computing Performance: On the Horizon

Building Netflix’s Distributed Tracing Infrastructure

Achieving observability in async workflows

O’Reilly serverless survey 2019: Concerns, what works, and what to expect

Memory-Optimized TempDB Metadata in SQL Server 2019

Understanding operational 5G: a first measurement study on its coverage, performance and energy consumption

Four tips to maximise your time at DevOps Enterprise Summit 2019, London

Stuff The Internet Says On Scalability For December 21st, 2018

Connecting MongoDB to Ruby with Self-Signed Certificates for SSL

Virtual consensus in Delos

DBLog: A Generic Change-Data-Capture Framework

DBLog: A Generic Change-Data-Capture Framework

USENIX SREcon APAC 2022: Computing Performance: What's on the Horizon

Software Testing Trends 2021 – What can we expect?

As-Salaam-Alaikum: The cloud arrives in the Middle East!

The Performance Inequality Gap, 2023

KeyCDN Launches Image Processing

The Performance Inequality Gap, 2021

Automating chaos experiments in production

Expanding the AWS Cloud – Introducing the AWS Europe (Stockholm) Region

The Speed of Time

Optimizing Performance With Resource Hints

USENIX LISA2021 Computing Performance: On the Horizon

Reverb: speculative debugging for web applications

New Year’s Updates

New Year’s Updates

The convoy phenomenon

KeyCDN Launches POP in Romania

A Look at JAMstack’s Speed, By the Numbers

Stay Connected