Course and Latency - Technology Performance Pulse

Who will watch the watchers? Extended infrastructure observability for WSO2 API Manager

Dynatrace

SEPTEMBER 18, 2020

In response to this trend, open source communities birthed new companies like WSO2 (of course, industry giants like Google, IBM, Software AG, and Tibco are also competing for a piece of the API management cake). High latency or lack of responses. This increase is clearly correlated with the increased response latencies.

Infrastructure

Infrastructure Latency Metrics Cloud

Best MySQL DigitalOcean Performance – ScaleGrid vs. DigitalOcean Managed Databases

Scalegrid

JUNE 22, 2020

Compare Latency. On average, ScaleGrid achieves almost 30% lower latency over DigitalOcean for the same deployment configurations. Now that we’ve compared throughput performance, let’s take a look at ScaleGrid vs. DigitalOcean latency for MySQL. Read-Intensive Latency Benchmark. Balanced Workload Latency Benchmark.

Database

Database Benchmarking Latency Performance

Bending pause times to your will with Generational ZGC

The Netflix TechBlog

MARCH 5, 2024

Reduced tail latencies In both our GRPC and DGS Framework services, GC pauses are a significant source of tail latencies. For a given CPU utilization target, ZGC improves both average and P99 latencies with equal or better CPU utilization when compared to G1. There is no best garbage collector.

Latency

Latency Java Tuning Efficiency

Build automated self-healing systems with xMatters and Dynatrace (Part 2 of 3)

Dynatrace

AUGUST 27, 2019

Step 5 – xMatters triggers a runbook in Ansible to fix the disk latency. As a last step, xMatters triggers a runbook in Ansible to push the disk latency fix. In this case, a team member determines that the best course of action is to delete temporary files from disk to free up processing power and return the disk to proper health.

Systems

Systems DevOps Latency Azure

Seamlessly Swapping the API backend of the Netflix Android app

The Netflix TechBlog

SEPTEMBER 8, 2020

Over the course of this post, we will talk about our approach to this migration, the strategies that we employed, and the tools we built to support this. Being able to canary a new route let us verify latency and error rates were within acceptable limits. Background The Netflix Android app uses the falcor data model and query protocol.

Latency

Latency Cache Java Traffic

Real-World Effectiveness of Brotli

CSS Wizardry

APRIL 22, 2020

Of course, this total of 100% only measures compressible responses that actually were compressed—there are still many millions of resources that could or should have been compressed but were not. TCP, Packets, and Round Trips. The point I’m driving at here is that it’s not just about file-size, it’s about TCP, packets, and round trips.

Latency

Latency Servers Website Speed

For your eyes only: improving Netflix video quality with neural networks

The Netflix TechBlog

NOVEMBER 17, 2022

Of course, we believe in the transformative potential of NN throughout video applications, beyond video downscaling. On a CPU, we leveraged oneDnn to further reduce latency. Millions of devices that support Netflix streaming automatically benefit from this solution. Our filter can run on both CPU and GPU.

Network

Network Media Innovation Serverless

Extend Dynatrace automation and AI capabilities more easily than ever

Dynatrace

MARCH 17, 2021

You want to optimize your Citrix landscape with insights into user load and screen latency per server? The long-term outlook, of course, is to sunset the legacy Extension framework at some point in the future. Of course, for users who are comfortable with coding and are already using Python extensions, coded 2.0

Metrics

Metrics Monitoring Network Technology

Escaping POC Purgatory: Evaluation-Driven Development for AI Systems

O'Reilly

MARCH 25, 2025

Of course, we have opinions on all of these, but we think those arent the most useful questions to ask right now. Weve taught this SDLC in a live course with engineers from companies like Netflix, Meta, and the US Air Force and recently distilled it into a free 10-email course to help teams apply it in practice.

Systems

Systems Development Tuning Monitoring

Taskbar Latency and Kernel Calls

Randon ASCII

SEPTEMBER 8, 2019

With a bit of column rearranging we get this impressive result: What this says is that, over the course of two right-mouse clicks, RuntimeBroker.exe , thread 10,252, issued 229,604 ReadFile calls, reading a total of 15,686,586 bytes. Now that we suspect file I/O it’s necessary to go to Graph Explorer-> Storage-> File I/O.

Latency

Latency Cache Programming Operating System

Performance Hero: Annie Sullivan

Speed Curve

JANUARY 19, 2025

Of course writes were much less common than reads, so I added a caching layer for reads, and that did the trick. So in addition to all the optimization work we did for Google Docs, I got to spend a lot of time and energy working on the measurement problem: how can we get end-to-end latency numbers?

Performance

Performance Google Speed Metrics

The Three Types of Performance Testing

CSS Wizardry

OCTOBER 27, 2018

The short answers are, of course ‘all the time’ and ‘everyone’, but this mutual disownership is a common reason why performance often gets overlooked. Of course, it is impossible to fix (or even find) every performance issue during the development phase. Unfortunately, most issues do not get captured at this point. Who: Engineers.

Performance Testing

Performance Testing Testing Performance Strategy

Balancing Low Latency, High Availability, and Cloud Choice

VoltDB

MAY 14, 2024

Balancing Low Latency, High Availability and Cloud Choice Cloud hosting is no longer just an option — it’s now, in many cases, the default choice. Long-tail latency spikes would break a lot of if not most time-sensitive use cases, like IoT device control. But the cloud computing market, having grown to a whopping $483.9

Latency

Latency Availability Cloud Hardware

Making Cloud.typography Fast(er)

CSS Wizardry

AUGUST 13, 2019

Although this response has a 0B filesize, we will always take the latency hit on every single page view (and this response is basically 100% latency). com , which introduces yet more latency for the connection setup. On mobile connections, this can amount to whole seconds of delays, all sat on the Critical Path.

Latency

Latency Cache Strategy Media

Open Observability – Part 1: Distributed tracing and observability

Dynatrace

JUNE 25, 2021

Already in the 2000s, service-oriented architectures (SOA) became popular, and operations teams discovered the need to understand how transactions traverse through all tiers and how these tiers contributed to the execution time and latency. Of course, Dynatrace supports W3C Trace Context as well.

Open Source

Open Source Monitoring Google Systems

Achieving observability in async workflows

The Netflix TechBlog

MAY 14, 2021

It enables a Production Office Coordinator to keep a Production’s cast, crew, and vendors organized and up to date with the latest information throughout the course of a title’s filming. We are expected to process 1,000 watermarks for a single distribution in a minute, with non-linear latency growth as the number of watermarks increases.

Traffic

Traffic Java Latency Google

The road to observability demo part 3: Collect, instrument, and analyze telemetry data automatically with Dynatrace

Dynatrace

MAY 17, 2023

The other sections on that page (such as Disk analysis) provide further information and charts on topics such as available disk space, latency, dropped network packets, refused connections, and more. This allows us to quickly tell whether the network link may be saturated or the processor is running at its limit.

Metrics

Metrics Database Monitoring Network

Effective Concurrency: Live online course in April

Sutter's Mill

FEBRUARY 12, 2024

I generally give one or two courses a year on C++ and related technologies. See the course link for details and a syllabus of topics that will be covered. Presented by Alfasoft.

C++

C++ Latency Code Technology

Stuff The Internet Says On Scalability For March 22nd, 2019

High Scalability

MARCH 22, 2019

µs of replication latency on lossy Ethernet, which is faster than or comparable to specialized replication systems that use programmable switches, FPGAs, or RDMA.". At some point, the e-mail I send over WiFi will hit a wire, of course". We achieve 5.5 matthewstoller : I just looked at Netflix’s 10K.

Internet

Internet Internet Scalability Wireless

This spring: High-Performance and Low-Latency C++ (Stockholm) and ACCU (Bristol)

Sutter's Mill

FEBRUARY 13, 2017

Tue-Thu Apr 25-27: High-Performance and Low-Latency C++ (Stockholm). On April 25-27, I’ll be in Stockholm (Kista) giving a three-day seminar on “High-Performance and Low-Latency C++.” If you’re interested in attending, please check out the links, and I look forward to meeting and re-meeting many of you there.

Latency

Latency C++ Hardware Performance

Meet Hydrogen: A React Framework For Dynamic, Contextual And Personalized E-Commerce

Smashing Magazine

NOVEMBER 8, 2021

As developers, we rightfully obsess about the customer experience, relentlessly working to squeeze every millisecond out of the critical rendering path, optimize input latency, and eliminate jank. And, of course, the result needs to be seamless and delightful — dare we say, even fun — to develop and maintain. Ilya Grigorik.

Cache

Cache Best Practices Strategy Servers

Narrowing the gap between serverless and its state with storage functions

The Morning Paper

JANUARY 28, 2020

Shredder is " a low-latency multi-tenant cloud store that allows small units of computation to be performed directly within storage nodes. " " Running end-user compute inside the datastore is not without its challenges of course. V8 is lightweight enough that it can easily support thousands of concurrent tenants.

Serverless

Serverless Storage Latency Cloud

Seamless offloading of web app computations from mobile device to edge clouds via HTML5 Web Worker migration

The Morning Paper

JANUARY 30, 2020

Edge servers are the middle ground – more compute power than a mobile device, but with latency of just a few ms. The client MWW combines these estimates with an estimate of the input/output transmission time (latency) to find the worker with the minimum overall execution latency.

Mobile

Mobile Cloud Latency Games

Snap: a microkernel approach to host networking

The Morning Paper

NOVEMBER 10, 2019

Here are the bombshell paragraphs: Our datacenter applications seek ever more CPU-efficient and lower-latency communication, which Pony Express delivers. The desire for CPU efficiency and lower latencies is easy to understand. Upgrades are also rolled out progressively across the cluster of course. Emphasis mine).

Network

Network Transportation Latency Entertainment

Expanding the AWS Cloud – An AWS Region is coming to South Africa!

All Things Distributed

OCTOBER 25, 2018

The new AWS Africa (Cape Town) Region will have three Availability Zones and provide lower latency to end users across Sub-Saharan Africa. Another program for higher education institutes is AWS Academy , which provides AWS-authorized courses for students to acquire in-demand cloud computing skills.

AWS

AWS Cloud Education Programming

Cloudburst: stateful functions-as-a-service

The Morning Paper

FEBRUARY 6, 2020

Oh, and there’s a scheduler too of course to keep all the plates spinning. On the Cloudburst design teams’ wish list: A running function’s ‘hot’ data should be kept physically nearby for low-latency access. A low-latency autoscaling KVS can serve as both global storage and a DHT-like overlay network.

Serverless

Serverless Lambda Cache Latency

AI Essentials for Tech Executives

O'Reilly

FEBRUARY 18, 2025

Of course, theres more to making improvements than just relying on tools and metrics. Were experiencing high latency in responses. Distillation Making a smaller, faster model from a big one It lets you use cheaper, faster models with less delay (latency). You also need to develop and follow processes.

Latency

Latency Tuning Metrics Testing

150 successful machine learning models: 6 lessons learned at Booking.com

The Morning Paper

OCTOBER 6, 2019

Prediction serving latency matters. Lesson 4: prediction serving latency matters. In a experiment introducing synthetic latency, Booking.com found that an increase of about 30% in latency cost about 0.5% Even mathematically simple models have the potential of introducing relevant latency.

Latency

Latency Metrics Cache Design

Helios: hyperscale indexing for the cloud & edge – part 1

The Morning Paper

OCTOBER 26, 2020

It’s limited by the laws of physics in terms of end-to-end latency. We saw earlier that there is end-user pressure to replace batch systems with much lower latency online systems. We are observing significant demand from users in terms of avoiding batch telemetry pipelines altogether. Emphasis mine ). Emphasis mine ).

Cloud

Cloud Big Data Latency Architecture

Looking back at 10 years of compartmentalization at AWS

All Things Distributed

MARCH 26, 2018

One is that the latency within a zone is incredibly fast. To provide geographic diversity and lower latencies to end users. Of course, all sorts of automation immediately kick in to mitigate any impact to even that subset. " Silo your traffic or not – you choose. To allow Regions to act as standby sites for each other.

AWS

AWS Latency Lambda Architecture

How To Boost Media Performance On A Budget

Smashing Magazine

MARCH 25, 2021

Also, if you are not using a content delivery network (CDN) or multiple CDNs to map users to the closest edge regions for reduced latencies — a practice called suboptimal routing — you might slow down the start of the video. And of course, you need to host and deliver your images. You don’t have to offload your content.

Media

Media Performance Mobile Latency

Achieving 100Gbps intrusion prevention on a single server

The Morning Paper

NOVEMBER 15, 2020

Options 1 and 2 are of course the ‘scale out’ options, whereas option 3 is ‘scale up’. This makes the whole system latency sensitive. Improving the efficiency with which we can coordinate work across a collection of units (see the Universal Scalability Law ). Increasing the amount of work we can do on a single unit.

Servers

Servers Hardware Latency Design

My plans at CppCon

Sutter's Mill

SEPTEMBER 11, 2020

Instead, I’ll be the panel moderator, and we have a great slate of panelists again this year: Bjarne Stroustrup (of course), Bryce Adelstein Lelbach (library evolution subgroup chair), Hana Dusíková (compile-time programming subgroup chair), Inbal Levi (Israel national chair), JC Van Winkel (Netherlands national chair and teaching subgroup chair), (..)

C++

C++ Games Programming Latency

Fixing Performance Regressions Before they Happen

The Netflix TechBlog

JANUARY 24, 2022

Technically, “performance” metrics are those relating to the responsiveness or latency of the app, including start up time. Both of these metrics will fluctuate over the course of a test, so we post metric values at regular intervals throughout the test. and often before they are even committed to the codebase.

Performance

Performance Performance Testing Metrics Testing

Maximizing Performance of AWS RDS for MySQL with Dedicated Log Volumes

Percona

DECEMBER 11, 2023

DLVs are particularly advantageous for databases with large allocated storage, high I/O per second (IOPS) requirements, or latency-sensitive workloads. The DLV, of course, affects the WRITE operations the most, and hence, the write-only test should be given the most consideration for the comparison of the DLV configuration vs. standard RDS.

AWS

AWS Benchmarking Performance Traffic

Amazon DynamoDB Accelerator (DAX): Speed Up DynamoDB Response Times from Milliseconds to Microseconds without Application Rewrite.

All Things Distributed

JUNE 21, 2017

DynamoDB delivers predictable performance and single digit millisecond latencies for reads and writes to your application, whether you're just getting started and want to perform hundreds of reads or writes per second in dev and test, or you're operating at scale in production performing millions of reads and writes per second.

Speed

Speed Cache Latency AWS

Fast key-value stores: an idea whose time has come and gone

The Morning Paper

JUNE 23, 2019

In ProtoCache (a component of a widely used Google application), 27% of its latency when using a traditional S+RInK design came from marshalling/un-marshalling. The network latency of fetching data over the network, even considering fast data center networks. Horses for courses! Fetching too much data in a single query (i.e.,

Cache

Cache Latency Google Network

How To Measure the Network Impact on PostgreSQL Performance

Percona

JULY 20, 2023

Of course, there are “ClientWrite” events, which is the wait event related to writing the data to the client (pg_dump) in this case. Following wait events are captured from a really fast/low latency network. If the network is fast, we may see a good amount of CPU utilization and “DataFileRead” as a wait event.

Network

Network Performance Latency Servers

Expanding the AWS Cloud – Introducing the AWS Europe (Stockholm) Region

All Things Distributed

DECEMBER 12, 2018

They can run applications in Sweden, serve end users across the Nordics with lower latency, and leverage advanced technologies such as containers, serverless computing, and more. We help Supercell to quickly develop, deploy, and scale their games to cope with varying numbers of gamers accessing the system throughout the course of the day.

AWS

AWS Cloud Games Serverless

A Management Maturity Model for Performance

Alex Russell

MAY 9, 2022

This is a complex topic, but to borrow from a recent post , web performance expands access to information and services by reducing latency and variance across interactions in a session, with a particular focus on the tail of the distribution (P75+). Consistent performance matters just as much as low average latency. What's the problem?

Performance

Performance Latency Metrics Engineering

InnoDB Performance Optimization Basics

Percona

MARCH 23, 2023

These guidelines work well for a wide range of applications, though the optimal settings, of course, depend on the workload. Although there have been many blogs about adjusting MySQL variables for better performance since then, I think this topic deserves a blog update since the last update was a decade ago, and MySQL 5.7

Performance

Performance Hardware Tuning Storage

A Decade of Dynamo: Powering the next wave of high-performance, internet-scale applications

All Things Distributed

OCTOBER 2, 2017

" Of course, no technology change happens in isolation, and at the same time NoSQL was evolving, so was cloud computing. Performant – DynamoDB consistently delivers single-digit millisecond latencies even as your traffic volume increases.

Internet

Internet Internet AWS Performance

Millions of tiny databases

The Morning Paper

MARCH 3, 2020

This work is latency critical, because volume IO is blocked until it is complete. And, of course, it is designed to minimise the blast radius of any failures that do occur. Larger cells have better tolerance of tail latency (e.g. Thus the configuration master is under stress just when you need it the most. Physalia in the large.

Database

Database AWS Network Design

Data Compression for Large-Scale Streaming Experimentation

The Netflix TechBlog

DECEMBER 2, 2019

Take rebuffers per hour, for example, where the number of rebuffers a member has in the course of an experiment (the numerator) is usually 0, and a few members many have 1 to 5 rebuffers. We do not gain much reduction in data size by compressing the numerator as, in practice, we find that the number of unique numerator values is small.

Metrics

Metrics Strategy Testing Efficiency

Who will watch the watchers? Extended infrastructure observability for WSO2 API Manager

Best MySQL DigitalOcean Performance – ScaleGrid vs. DigitalOcean Managed Databases

Trending Sources

Bending pause times to your will with Generational ZGC

Build automated self-healing systems with xMatters and Dynatrace (Part 2 of 3)

Seamlessly Swapping the API backend of the Netflix Android app

Real-World Effectiveness of Brotli

For your eyes only: improving Netflix video quality with neural networks

Extend Dynatrace automation and AI capabilities more easily than ever

Escaping POC Purgatory: Evaluation-Driven Development for AI Systems

Taskbar Latency and Kernel Calls

Performance Hero: Annie Sullivan

The Three Types of Performance Testing

Balancing Low Latency, High Availability, and Cloud Choice

Making Cloud.typography Fast(er)

Open Observability – Part 1: Distributed tracing and observability

Achieving observability in async workflows

The road to observability demo part 3: Collect, instrument, and analyze telemetry data automatically with Dynatrace

Effective Concurrency: Live online course in April

Stuff The Internet Says On Scalability For March 22nd, 2019

This spring: High-Performance and Low-Latency C++ (Stockholm) and ACCU (Bristol)

Meet Hydrogen: A React Framework For Dynamic, Contextual And Personalized E-Commerce

Narrowing the gap between serverless and its state with storage functions

Seamless offloading of web app computations from mobile device to edge clouds via HTML5 Web Worker migration

Snap: a microkernel approach to host networking

Expanding the AWS Cloud – An AWS Region is coming to South Africa!

Cloudburst: stateful functions-as-a-service

AI Essentials for Tech Executives

150 successful machine learning models: 6 lessons learned at Booking.com

Helios: hyperscale indexing for the cloud & edge – part 1

Looking back at 10 years of compartmentalization at AWS

How To Boost Media Performance On A Budget

Achieving 100Gbps intrusion prevention on a single server

My plans at CppCon

Fixing Performance Regressions Before they Happen

Maximizing Performance of AWS RDS for MySQL with Dedicated Log Volumes

Amazon DynamoDB Accelerator (DAX): Speed Up DynamoDB Response Times from Milliseconds to Microseconds without Application Rewrite.

Fast key-value stores: an idea whose time has come and gone

How To Measure the Network Impact on PostgreSQL Performance

Expanding the AWS Cloud – Introducing the AWS Europe (Stockholm) Region

A Management Maturity Model for Performance

InnoDB Performance Optimization Basics

A Decade of Dynamo: Powering the next wave of high-performance, internet-scale applications

Millions of tiny databases

Data Compression for Large-Scale Streaming Experimentation

Stay Connected