2018, Latency and Servers - Technology Performance Pulse

Timestone: Netflix’s High-Throughput, Low-Latency Priority Queueing System with Built-in Support…

The Netflix TechBlog

SEPTEMBER 29, 2022

Timestone: Netflix’s High-Throughput, Low-Latency Priority Queueing System with Built-in Support for Non-Parallelizable Workloads by Kostas Christidis Introduction Timestone is a high-throughput, low-latency priority queueing system we built in-house to support the needs of Cosmos , our media encoding platform. Over the past 2.5

Latency

Latency Systems Media Serverless

Self-Host Your Static Assets

CSS Wizardry

MAY 31, 2019

Critical assets are far too valuable to leave on someone else’s servers. This is exactly what Rawgit did in October 2018, yet (at the time of writing) a crude GitHub code search still yielded over a million references to the now-sunset service, and almost 20,000 live sites are still linking to it! Risk: Service Shutdowns. to just 3.6s.

Cache

Cache Latency Infrastructure Website

The Netflix Cosmos Platform

The Netflix TechBlog

MARCH 1, 2021

It supports both high throughput services that consume hundreds of thousands of CPUs at a time, and latency-sensitive workloads where humans are waiting for the results of a computation. The subsystems all communicate with each other asynchronously via Timestone, a high-scale, low-latency priority queuing system. Warm capacity.

Serverless

Serverless Media Latency Social Media

Stuff The Internet Says On Scalability For December 21st, 2018

High Scalability

DECEMBER 21, 2018

Tim Bray : How to talk about [Serverless Latency] · To start with, don’t just say “I need 120ms.” ” And in most mainstream applications, you should be able to get there with serverless. Because nobody knows how to make money. Support contracts aren’t enough.

Internet

Internet Internet Scalability Serverless

USENIX LISA2021 Computing Performance: On the Horizon

Brendan Gregg

JULY 4, 2021

I summarized these topics and more as a plenary conference talk, including my own predictions (as a senior performance engineer) for the future of computing performance, with a focus on back-end servers. Ford, et al., “TCP

Performance

Performance Latency Hardware Storage

bpftrace (DTrace 2.0) for Linux 2018

Brendan Gregg

OCTOBER 8, 2018

Screenshot: tracing read latency for PID 181: # bpftrace -e 'kprobe:vfs_read /pid == 30153/ { @start[tid] = nsecs; } kretprobe:vfs_read /@start[tid]/ { @ns = hist(nsecs - @start[tid]); delete(@start[tid]); }'. Here's key differences as of August 2018: Type DTrace bpftrace. It's shaping up to be a DTrace version 2.0: I wrote seeksize.d

C++

C++ Virtualization Programming Latency

KeyCDN Launches New POPs in 2021

KeyCDN

MARCH 10, 2021

With Tel Aviv being the technology capital of Israel, it's the ideal edge server location. The image below shows a significant drop in latency once we've launched the new point of presence in Israel. In fact, latency has been reduced by almost 50%! Performance report Brisbane - Australia Brisbane is our 4th POP in Australia.

Latency

Latency Internet Internet Speed

Ciao Milano! – An AWS Region is coming to Italy!

All Things Distributed

NOVEMBER 13, 2018

ENEL is using AWS to transform its entire business, closing all of their data centers by 2018, migrating workloads from over 6,000 on-premises servers onto AWS in nine months, and using AWS IoT services to better manage and understand energy consumption. ENEL is one of the leading energy operators in the world. million unique visits.

AWS

AWS Energy Automotive Traffic

USENIX SREcon APAC 2022: Computing Performance: What's on the Horizon

Brendan Gregg

FEBRUARY 28, 2023

My personal opinion is that I don't see a widespread need for more capacity given horizontal scaling and servers that can already exceed 1 Tbyte of DRAM; bandwidth is also helpful, but I'd be concerned about the increased latency for adding a hop to more memory. Ford, et al., “TCP

Performance

Performance Latency Cache Virtualization

Välkommen till Stockholm – An AWS Region is coming to the Nordics

All Things Distributed

APRIL 4, 2017

The new AWS EU (Stockholm) Region will have three Availability Zones and will be ready for customers to use in 2018. This enables customers to serve content to their end users with low latency, giving them the best application experience. Over the past decade, we have seen tremendous growth at AWS.

AWS

AWS Airlines Latency Games

SVT-AV1: an open-source AV1 encoder and decoder

The Netflix TechBlog

MARCH 13, 2020

As mentioned in our earlier blog post , Intel and Netflix have been collaborating on the SVT-AV1 encoder and decoder framework since August 2018. SVT-AV1 uses parallelization at several stages of the encoding process, which allows it to adapt to the number of available cores, including the newest servers with significant core count.

Open Source

Open Source Efficiency C++ Speed

How to use Server Timing to get backend transparency from your CDN

Speed Curve

FEBRUARY 5, 2024

Server-timing headers are a key tool in understanding what's happening within that black box of Time to First Byte (TTFB). Cue server-timing headers Historically, when looking at page speed, we've had the tendency to ignore TTFB when trying to optimize the user experience. I mean, why wouldn't we?

Servers

Servers Cache Retail Benchmarking

Answering Common Questions About Interpreting Page Speed Reports

Smashing Magazine

OCTOBER 31, 2023

The truth is that the two tools were fairly distinct until PSI was updated in 2018 to use Lighthouse reporting. DevTools throttling is easier to set up, but doesn’t accurately reflect how server connections work on the network. Large preview ) TTFB identifies how fast or slow a web server is to respond to requests.

Speed

Speed Google Website Metrics

DBLog: A Generic Change-Data-Capture Framework

The Netflix TechBlog

DECEMBER 17, 2019

Passive instances across regions are also possible, though it is recommended to operate in the same region as the database host in order to keep the change capture latencies low. Delta is used in production since 2018 for datastore synchronization and event processing use cases in Netflix studio applications. All aboard the Databus!:

Database

Database Traffic Transportation Open Source

DBLog: A Generic Change-Data-Capture Framework

The Netflix TechBlog

DECEMBER 17, 2019

Passive instances across regions are also possible, though it is recommended to operate in the same region as the database host in order to keep the change capture latencies low. Delta is used in production since 2018 for datastore synchronization and event processing use cases in Netflix studio applications. All aboard the Databus!:

Database

Database Traffic Transportation Open Source

USENIX LISA2021 Computing Performance: On the Horizon

Brendan Gregg

JULY 4, 2021

I summarized these topics and more as a plenary conference talk, including my own predictions (as a senior performance engineer) for the future of computing performance, with a focus on back-end servers. Ford, et al., “TCP

Performance

Performance Latency Hardware Storage

Fixing a slow site iteratively

CSS - Tricks

APRIL 1, 2021

Google’s industry benchmarks from 2018 also provide a striking breakdown of how each second of loading affects bounce rates. Source: Google /SOASTA Research, 2018. I’m going to update my referenced URL to the new site to help decrease latency that adds drag to the initial page load. Improvement #2: The Critical Render Path.

Cache

Cache Social Media Media Website

Three Other Models of Computer System Performance: Part 2

ACM Sigarch

MARCH 25, 2019

How many buffers are needed to track pending requests as a function of needed bandwidth and expected latency? Can one both minimize latency and maximize throughput for unscheduled work? The M/M/1 queue will show us a required trade-off among (a) allowing unscheduled task arrivals, (b) minimizing latency, and (c) maximizing throughput.

Systems

Systems Latency Performance C++

New Year’s Updates

John McCalpin

JANUARY 9, 2019

This week I updated three of those posts — two really old ones (primarily of interest to computer architecture historians), and one from 2018: July 2012: Local and Remote Memory Latency on AMD Processors in 2-socket and 4-socket servers December 2013: Notes on Memory Bandwidth on the Xeon Phi (Knights Corner) Coprocessor January 2018: A Peculiar (..)

Latency

Latency Architecture Servers Performance

New Year’s Updates

John McCalpin

JANUARY 9, 2019

This week I updated three of those posts — two really old ones (primarily of interest to computer architecture historians), and one from 2018: July 2012: Local and Remote Memory Latency on AMD Processors in 2-socket and 4-socket servers. December 2013: Notes on Memory Bandwidth on the Xeon Phi (Knights Corner) Coprocessor.

Latency

Latency Architecture Servers Performance

USENIX SREcon APAC 2022: Computing Performance: What's on the Horizon

Brendan Gregg

FEBRUARY 28, 2023

My personal opinion is that I don't see a widespread need for more capacity given horizontal scaling and servers that can already exceed 1 Tbyte of DRAM; bandwidth is also helpful, but I'd be concerned about the increased latency for adding a hop to more memory. Ford, et al., “TCP

Performance

Performance Latency Cache Virtualization

bpftrace (DTrace 2.0) for Linux 2018

Brendan Gregg

OCTOBER 7, 2018

Screenshot: tracing read latency for PID 181: # bpftrace -e 'kprobe:vfs_read /pid == 30153/ { @start[tid] = nsecs; } kretprobe:vfs_read /@start[tid]/ { @ns = hist(nsecs - @start[tid]); delete(@start[tid]); }'. Here's key differences as of August 2018: Type DTrace bpftrace. It's shaping up to be a DTrace version 2.0: I wrote seeksize.d

C++

C++ Virtualization Programming Latency

KeyCDN Launches POP in Helsinki

KeyCDN

OCTOBER 24, 2018

With Helsinki being the capital and most populous municipality of Finland, it makes for a great edge server location. Although both countries are relatively close to one another, they are separated by a distance of approximately 500km, which adds up in terms of latency.

Internet

Internet Internet Speed Network

Azure SQL Managed Instance Performance Considerations

SQL Performance

FEBRUARY 26, 2020

Azure SQL Database Managed Instance became generally available in late 2018. Organizations are taking advantage of having managed backups, lots of built-in security features, an uptime SLA of 99.99%, and an always up-to-date environment where they are no longer responsible for patching SQL Server or the operating system.

Azure

Azure Performance Storage Latency

HTTP/3: Practical Deployment Options (Part 3)

Smashing Magazine

SEPTEMBER 6, 2021

Next, we’ll look at how to set up servers and clients (that’s the hard part unless you’re using a content delivery network (CDN)). This difference by itself doesn’t do all that much (it mainly reduces the overhead on the server-side), but it leads to most of the following points. Server Sharding and Connection Coalescing.

Network

Network Servers Cache Traffic

The “Developer Experience” Bait-and-Switch

Alex Russell

SEPTEMBER 11, 2018

If performance is a problem, we can do progressive enhancement through Server-Side Rendering. I can count on one hand the number of teams I’ve worked with who have goals that allow them to block launches for latency regressions, including Google products. — Alex Russell (@slightlylate) August 27, 2018.

Development

Development Government Mobile Network

Optimize Images for Web

KeyCDN

SEPTEMBER 12, 2019

The main reason is because it decreases the latency to the user where they are located by serving your images from a POP physically closest to them. Another is to ensure that the settings on your server and or CDN are setup correctly. It also allows for additional control over the caching of your images as well as hotlink protection.

Social Media

Social Media Media Google Website

HammerDB MySQL and MariaDB Best Practice for Performance and Scalability

HammerDB

OCTOBER 12, 2018

Historically MySQL has been positioned for supporting web-based applications this is in contrast to enterprise based database workloads that have been served by commercial databases such as Oracle, Db2 and SQL Server. maximum transition latency: Cannot determine or is not supported. . c_ytd_payment: 10.00 7 rows in set (0.00

Best Practices

Best Practices Scalability Performance C++

Can You Afford It?: Real-world Web Performance Budgets

Alex Russell

OCTOBER 22, 2017

Late-loading JavaScript can cause “server-side rendered” pages to fail in infuriating ways. The server sends it as a stream of bytes and when the browser encounters each of the sub-resources referenced in the document, it requests them. For a timely introduction, I recommended Kevin Schaaf’s recent talk.

Performance

Performance Network Benchmarking Mobile

6 Proven Ways to Improve Your eCommerce Conversion Rate

KeyCDN

JUNE 20, 2019

According to Monetate , the following conversion rate results were gathered for Q3 of 2017 until Q3 of 2018. The global conversion rate for Q3 of 2018 was 2.42% down from 2.78% in Q2. With a CDN, you can offload your static assets such as product images, videos, GIFs, CSS files, and much more to the CDN’s edge servers.

Ecommerce

Ecommerce Speed Mobile Website

HammerDB Best Practice for PostgreSQL Performance and Scalability

HammerDB

OCTOBER 8, 2018

This limitation is at the database level rather than the hardware level, nevertheless with up to date hardware (from mid-2018) PostgreSQL on a 2 socket system can be expected to deliver more than 2M PostgreSQL TPM and 1M NOPM with the HammerDB TPC-C test. . maximum transition latency: Cannot determine or is not supported. Latency: 0.

Best Practices

Best Practices Scalability Performance Hardware

Service Workers can save the environment!

Dean Hume

APRIL 24, 2018

As data centers and volumes of servers have grown, so has the overall amount of electricity consumed around the world. Electricity used by servers doubled between 2000 and 2005 (and has continued growing ever since) from 12 billion to 23 billion kilowatt hours. Server Power Consumption (Source: Intel Labs 2008).

Energy

Energy Cache Traffic Website

Service Workers can save the environment!

Dean Hume

APRIL 24, 2018

As data centers and volumes of servers have grown, so has the overall amount of electricity consumed around the world. Electricity used by servers doubled between 2000 and 2005 (and has continued growing ever since) from 12 billion to 23 billion kilowatt hours. Server Power Consumption (Source: Intel Labs 2008).

Energy

Energy Cache Traffic Website

Service Workers can save the environment!

Dean Hume

APRIL 24, 2018

As data centers and volumes of servers have grown, so has the overall amount of electricity consumed around the world. Electricity used by servers doubled between 2000 and 2005 (and has continued growing ever since) from 12 billion to 23 billion kilowatt hours. Server Power Consumption (Source: Intel Labs 2008).

Energy

Energy Cache Traffic Website

How to improve Redo, Transaction Log and WAL throughput for HammerDB benchmarks

HammerDB

NOVEMBER 5, 2018

If you are new to running Oracle, SQL Server, MySQL and PostgreSQL TPC-C workloads with HammerDB and have needed to investigate I/O performance the chances are that you have experienced waits on writing to the Redo, Transaction Log or WAL depending on the database you are testing. SQL Server DELAYED_DURABILITY. Checkpoint not complete.

Benchmarking

Benchmarking Database C++ Virtualization

Aurora vs RDS: How to Choose the Right AWS Database Solution

Percona

JULY 1, 2023

This post was originally published in July 2018 and was updated in July 2023. It efficiently manages read and write operations, optimizes data access, and minimizes contention, resulting in high throughput and low latency to ensure that applications perform at their best. What are the differences between Aurora and RDS?

AWS

AWS Database Serverless Storage

Stuff The Internet Says On Scalability For September 14th, 2018

High Scalability

SEPTEMBER 14, 2018

72 : signals sensed from a distant galaxy using AI; 12M : reddit posts per month; 10 trillion : per day Google generated test inputs with 100s of servers for several months using OSS-Fuzz; 200% : growth in Cloud Native technologies used in production; $13 trillion : potential economic impact of AI by 2030; 1.8 They'll love you even more.

Internet

Internet Internet Scalability Education

Stuff The Internet Says On Scalability For July 20th, 2018

High Scalability

JULY 20, 2018

It also works well to justify an acquisition of more servers to investors. During our testing using the storage optimized EC2 instances (I3.2xlarge) we noticed that we were able to perform over 200K IOPS of 1K byte items thus meeting our throughput goals with latency rarely exceeding 1 millisecond. They never question this belief.

Internet

Internet Internet Scalability Automotive

The Market for Lemons

Alex Russell

FEBRUARY 3, 2023

They understood that most websites lack tight latency budgeting, dedicated performance teams, hawkish management reviews, ship gates to prevent regressions, and end-to-end measurements of critical user journeys. "Server-Side Rendering", a.k.a. "SSR" " [ an intro to "isomorphic javascript", a.k.a. "Server-Side

Social Media

Social Media Website Technology Technology

The Performance Inequality Gap, 2021

Alex Russell

MARCH 6, 2021

India became a 4G-centric market sometime in 2018. Sadly, data on latency is harder to get, even from Google's perch, so progress there is somewhat more difficult to judge. If there's a bright spot in our construction of a 2021 baseline for performance, this is it. 5G looks set to continue a bumpy rollout for the next half-decade.

Performance

Performance Network Mobile Metrics

Front-End Performance Checklist 2021

Smashing Magazine

JANUARY 11, 2021

In the end, it’s not the load events or server response times that define the experience, but the perception of how snappy the interface feels. It used to provide an insight into how quickly the server outputs any data. This knowledge will give you the best optimization target for ongoing efforts. What does it mean?

Performance

Performance Cache Media Metrics

Front-End Performance Checklist 2020 [PDF, Apple Pages, MS Word]

Smashing Magazine

JANUARY 6, 2020

Is it worth exploring tree-shaking, scope hoisting, code-splitting, and all the fancy loading patterns with intersection observer, server push, clients hints, HTTP/2, service workers and — oh my — edge workers? It used to provide an insight into how quickly the server outputs any data. What does it mean?

Performance

Performance Cache Servers Network

Front-End Performance Checklist 2019 [PDF, Apple Pages, MS Word]

Smashing Magazine

JANUARY 7, 2019

Is it worth exploring tree-shaking, scope hoisting, code-splitting, and all the fancy loading patterns with intersection observer, server push, clients hints, HTTP/2, service workers and — oh my — edge workers? Long FMP usually indicates JavaScript blocking the main thread, but could be related to back-end/server issues as well.

Performance

Performance Cache Network Metrics

Timestone: Netflix’s High-Throughput, Low-Latency Priority Queueing System with Built-in Support…

Self-Host Your Static Assets

Trending Sources

The Netflix Cosmos Platform

Stuff The Internet Says On Scalability For December 21st, 2018

USENIX LISA2021 Computing Performance: On the Horizon

bpftrace (DTrace 2.0) for Linux 2018

KeyCDN Launches New POPs in 2021

Ciao Milano! – An AWS Region is coming to Italy!

USENIX SREcon APAC 2022: Computing Performance: What's on the Horizon

Välkommen till Stockholm – An AWS Region is coming to the Nordics

SVT-AV1: an open-source AV1 encoder and decoder

How to use Server Timing to get backend transparency from your CDN

Answering Common Questions About Interpreting Page Speed Reports

DBLog: A Generic Change-Data-Capture Framework

DBLog: A Generic Change-Data-Capture Framework

USENIX LISA2021 Computing Performance: On the Horizon

Fixing a slow site iteratively

Three Other Models of Computer System Performance: Part 2

New Year’s Updates

New Year’s Updates

USENIX SREcon APAC 2022: Computing Performance: What's on the Horizon

bpftrace (DTrace 2.0) for Linux 2018

KeyCDN Launches POP in Helsinki

Azure SQL Managed Instance Performance Considerations

HTTP/3: Practical Deployment Options (Part 3)

The “Developer Experience” Bait-and-Switch

Optimize Images for Web

HammerDB MySQL and MariaDB Best Practice for Performance and Scalability

Can You Afford It?: Real-world Web Performance Budgets

6 Proven Ways to Improve Your eCommerce Conversion Rate

HammerDB Best Practice for PostgreSQL Performance and Scalability

Service Workers can save the environment!

Service Workers can save the environment!

Service Workers can save the environment!

How to improve Redo, Transaction Log and WAL throughput for HammerDB benchmarks

Aurora vs RDS: How to Choose the Right AWS Database Solution

Stuff The Internet Says On Scalability For September 14th, 2018

Stuff The Internet Says On Scalability For July 20th, 2018

The Market for Lemons

The Performance Inequality Gap, 2021

Front-End Performance Checklist 2021

Front-End Performance Checklist 2020 [PDF, Apple Pages, MS Word]

Front-End Performance Checklist 2019 [PDF, Apple Pages, MS Word]

Stay Connected