Hardware and Latency - Technology Performance Pulse

Single-core memory bandwidth: Latency, Bandwidth, and Concurrency

John McCalpin

FEBRUARY 17, 2025

“Latency” is the duration from the execution of a load instruction (to an address that misses in all the caches), and the completion of that load instruction when the data is returned from memory. . The example below is for a 2005-era processor with 60 ns memory latency and 6.4 cache lines -> 5.6 cache lines -> 5.6

Latency

Latency Hardware Cache Systems

RabbitMQ vs. Kafka: Key Differences

Scalegrid

FEBRUARY 6, 2025

Its partitioned log architecture supports both queuing and publish-subscribe models, allowing it to handle large-scale event processing with minimal latency. Apache Kafka uses a custom TCP/IP protocol for high throughput and low latency. Apache Kafka, designed for distributed event streaming, maintains low latency at scale.

Latency

Latency Analytics Architecture Storage

Cloud infrastructure monitoring in action: Dynatrace on Dynatrace

Dynatrace

SEPTEMBER 29, 2020

Sydney, we have a disk write latency problem! It was on August 25 th at 14:00 when Davis initially alerted on a disk write latency issues to Elastic File System (EFS) on one of our EC2 instances in AWS’s Sydney Data Center. The AWS team confirmed a known hardware issue affecting a certain amount of EC2 machines in that region.

Infrastructure

Infrastructure Cloud Monitoring AWS

How to Optimize CPU Performance Through Isolation and System Tuning

DZone

MAY 1, 2023

CPU isolation and efficient system management are critical for any application which requires low-latency and high-performance computing. In modern production environments, there are numerous hardware and software hooks that can be adjusted to improve latency and throughput.

Tuning

Tuning Systems Latency Performance

Dynatrace Managed turnkey Premium High Availability for globally distributed data centers (Early Adopter)

Dynatrace

JUNE 25, 2020

The network latency between cluster nodes should be around 10 ms or less. Our Premium High Availability comes with the following features: Active-active deployment model for optimum hardware utilization. Save on costs for hardware and network bandwidth to optimize total cost of ownership. Self-contained turnkey solution.

Availability

Availability Hardware Latency Traffic

Optimize your environment: Unveiling Dynatrace Hyper-V extension for enhanced performance and efficient troubleshooting

Dynatrace

OCTOBER 23, 2023

It enables multiple operating systems to run simultaneously on the same physical hardware and integrates closely with Windows-hosted services. Therefore, they experience how the application code functions and how the application operations depend on the underlying hardware resources and the operating system managed by Hyper-V.

Efficiency

Efficiency Virtualization Hardware Performance

What is AWS Lambda?

Dynatrace

APRIL 5, 2021

This is where Lambda comes in: Developers can deploy programs with no concern for the underlying hardware, connecting to services in the broader ecosystem, creating APIs, preparing data, or sending push notifications directly in the cloud, to list just a few examples. AWS continues to improve how it handles latency issues.

Lambda

Lambda AWS Serverless Hardware

Time to First Byte: What It Is and Why It Matters

CSS Wizardry

AUGUST 7, 2019

The first—and often most surprising for people to learn—thing that I want to draw your attention to is that TTFB counts one whole round trip of latency. The reason is because mobile networks are, as a rule, high latency connections. Last mile latency deals with the disproportionate complexity toward the terminus of a connection.

Latency

Latency Ecommerce Servers Mobile

What is serverless computing? Driving efficiency without sacrificing observability

Dynatrace

JANUARY 26, 2021

This allows teams to sidestep much of the cost and time associated with managing hardware, platforms, and operating systems on-premises, while also gaining the flexibility to scale rapidly and efficiently. When an application is triggered, it can cause latency as the application starts. This creates latency when they need to restart.

Serverless

Serverless Efficiency Lambda Azure

Predictive CPU isolation of containers at Netflix

The Netflix TechBlog

JUNE 4, 2019

Because microprocessors are so fast, computer architecture design has evolved towards adding various levels of caching between compute units and the main memory, in order to hide the latency of bringing the bits to the brains. This avoids thrashing caches too much for B and evens out the pressure on the L3 caches of the machine.

Cache

Cache Latency Airlines Logistics

Best Practices for Scaling RabbitMQ

Scalegrid

FEBRUARY 24, 2025

While clustering across wide-area networks (WANs) is discouraged due to latency issues, leased links can mitigate some connectivity challenges. Keeping queues short minimizes latency and enhances the overall efficiency of message delivery in RabbitMQ. Keeping queues short maintains a responsive and efficient RabbitMQ setup.

Best Practices

Best Practices Traffic Strategy Scalability

What is ITOps? Why IT operations is more crucial than ever in a multicloud world

Dynatrace

DECEMBER 15, 2022

Besides the traditional system hardware, storage, routers, and software, ITOps also includes virtual components of the network and cloud infrastructure. Although modern cloud systems simplify tasks, such as deploying apps and provisioning new hardware and servers, hybrid cloud and multicloud environments are often complex. Performance.

Artificial Intelligence

Artificial Intelligence DevOps Hardware Virtualization

These 7 Edge Data Challenges Will Test Companies the Most in 2025

VoltDB

DECEMBER 11, 2024

By bringing computation closer to the data source, edge-based deployments reduce latency, enhance real-time capabilities, and optimize network bandwidth. Use hardware-based encryption and ensure regular over-the-air updates to maintain device security. Increased latency during peak loads. Data interception during transit.

IoT

IoT Energy Logistics Latency

What is cloud migration?

Dynatrace

SEPTEMBER 30, 2021

It requires purchasing, powering, and configuring physical hardware, training and retaining the staff capable of servicing and securing the machines, operating a data center, and so on. They need enough hardware to serve their anticipated volume and keep things running smoothly without buying too much or too little. Reduced cost.

Cloud

Cloud Traffic Best Practices Hardware

Towards a Reliable Device Management Platform

The Netflix TechBlog

AUGUST 30, 2021

Complementing the hardware is the software on the RAE and in the cloud, and bridging the software on both ends is a bi-directional control plane. When a new hardware device is connected, the Local Registry detects and collects a set of information about it, such as networking information and ESN. million elements.

Latency

Latency Traffic Transportation Cloud

Balancing Low Latency, High Availability, and Cloud Choice

VoltDB

MAY 14, 2024

Balancing Low Latency, High Availability and Cloud Choice Cloud hosting is no longer just an option — it’s now, in many cases, the default choice. As a result, IT teams picked hardware somewhat blindly but with a strong bias towards oversizing for the sake of expanding the budget, leading to systems running at 10-15% of maximum capacity.

Latency

Latency Availability Cloud Hardware

Performance Tuning Java Applications in Linux

DZone

DECEMBER 4, 2019

While Performance Tuning an application both Code and Hardware running the code should be accounted for. For low latency, applications use Concurrent Mark and Sweep Algorithm — CMS or G1 GC. Learn how to make your Java applications performance perfectly. You may also like: How to Properly Plan JVM Performance Tuning.

Tuning

Tuning Java Performance Hardware

Achieving 100Gbps intrusion prevention on a single server

The Morning Paper

NOVEMBER 15, 2020

This makes the whole system latency sensitive. So we need low latency, but we also need very high throughput: A recurring theme in IDS/IPS literature is the gap between the workloads they need to handle and the capabilities of existing hardware/software implementations. The target FPGA for Pigasus has 16MB of BRAM.

Servers

Servers Hardware Latency Design

Under the Hood of Amazon EC2 Container Service

All Things Distributed

JULY 20, 2015

To be robust and scalable, this key/value store needs to be distributed for durability and availability, to protect against network partitions or hardware failures. This architecture affords Amazon ECS high availability, low latency, and high throughput because the data store is never pessimistically locked.

Latency

Latency Architecture AWS Open Source

Understanding operational 5G: a first measurement study on its coverage, performance and energy consumption

The Morning Paper

OCTOBER 4, 2020

We are standing on the eve of the 5G era… 5G, as a monumental shift in cellular communication technology, holds tremendous potential for spurring innovations across many vertical industries, with its promised multi-Gbps speed, sub-10 ms low latency, and massive connectivity. Throughput and latency. energy consumption).

Energy

Energy Latency Performance Network

What is observability? Not just logs, metrics and traces

Dynatrace

OCTOBER 1, 2021

In these modern environments, every hardware, software, and cloud infrastructure component and every container, open-source tool, and microservice generates records of every activity. Observability relies on telemetry derived from instrumentation that comes from the endpoints and services in your multi-cloud computing environments.

Metrics

Metrics Open Source Monitoring Cloud

Single-core memory bandwidth: Latency, Bandwidth, and Concurrency

John McCalpin

FEBRUARY 17, 2025

“Latency” is the duration from the execution of a load instruction (to an address that misses in all the caches), and the completion of that load instruction when the data is returned from memory. . The example below is for a 2005-era processor with 60 ns memory latency and 6.4 cache lines -> 5.6 cache lines -> 5.6

Latency

Latency Hardware Cache Systems

Amazon DynamoDB ? a Fast and Scalable NoSQL Database.

All Things Distributed

JANUARY 18, 2012

Amazon DynamoDB offers low, predictable latencies at any scale. This is not just predictability of median performance and latency, but also at the end of the distribution (the 99.9th percentile), so we could provide acceptable performance for virtually every customer. s read latency, particularly as dataset sizes grow.

Scalability

Scalability Database Ecommerce Latency

This spring: High-Performance and Low-Latency C++ (Stockholm) and ACCU (Bristol)

Sutter's Mill

FEBRUARY 13, 2017

Tue-Thu Apr 25-27: High-Performance and Low-Latency C++ (Stockholm). On April 25-27, I’ll be in Stockholm (Kista) giving a three-day seminar on “High-Performance and Low-Latency C++.”

Latency

Latency C++ Hardware Performance

10 Lessons from 10 Years of Amazon Web Services

All Things Distributed

MARCH 11, 2016

This is a given, whether you are using the highest quality hardware or lowest cost components. When customers left the constraining, old world of IT hardware and datacenters behind, they started to develop systems with new and interesting usage patterns that no one had ever seen before. Primitives not frameworks. No gatekeepers.

AWS

AWS Hardware Retail Virtualization

USENIX LISA2021 Computing Performance: On the Horizon

Brendan Gregg

JULY 4, 2021

This was a chance to talk about other things I've been working on, such as the present and future of hardware performance. The video is on [youtube]: The slides are on [slideshare] or as a [PDF]: I work on many areas of performance, but recently I've had a lot of demand to talk about BPF. Ford, et al., “TCP

Performance

Performance Latency Hardware Storage

The Three Types of Performance Testing

CSS Wizardry

OCTOBER 27, 2018

Things always always feel fast when we’re developing because, more often than not, we’re working on high-spec machines on dedicated networks, and also serving from localhost which removes the bulk of the latency and bandwidth issues that a real user would suffer. Unfortunately, most issues do not get captured at this point.

Performance Testing

Performance Testing Testing Performance Strategy

Bring Your Own Cloud (BYOC) vs. Dedicated Hosting at ScaleGrid

Scalegrid

APRIL 16, 2020

This is why our BYOC pricing is less than our Dedicated Hosting pricing, as the costs listed for BYOC are only what you pay for ScaleGrid and don’t include your hardware costs. Deploying your application and database on the same VPC also provides the lowest possible latency path. Where to host your cloud database?

Cloud

Cloud Azure AWS Database

Välkommen till Stockholm – An AWS Region is coming to the Nordics

All Things Distributed

APRIL 4, 2017

This enables customers to serve content to their end users with low latency, giving them the best application experience. In 2011, AWS opened a Point of Presence (PoP) in Stockholm to enable customers to serve content to their end users with low latency. As well as AWS Regions, we also have 24 AWS Edge Network Locations in Europe.

AWS

AWS Airlines Latency Games

5 Steps to Accelerate your Cloud Migration with Dynatrace

Dynatrace

AUGUST 5, 2019

Lift & Shift is where you basically just move physical or virtual hosts to the cloud – essentially you just run your host on somebody else’s hardware. Remember: This is a critical aspect as you do not want to migrate a service and suddenly introduce high latency or costs to a system that you forgot about having a dependency with!

Cloud

Cloud Traffic Database Network

Crucial Redis Monitoring Metrics You Must Watch

Scalegrid

JANUARY 25, 2024

Key Takeaways Critical performance indicators such as latency, CPU usage, memory utilization, hit rate, and number of connected clients/slaves/evictions must be monitored to maintain Redis’s high throughput and low latency capabilities. Similarly, an increased throughput signifies an intensive workload on a server and a larger latency.

Metrics

Metrics Monitoring Latency Cache

Redis® Monitoring Strategies for 2025

Scalegrid

JANUARY 21, 2025

Identifying key Redis metrics such as latency, CPU usage, and memory metrics is crucial for effective Redis monitoring. To monitor Redis instances effectively, collect Redis metrics focusing on cache hit ratio, memory allocated, and latency threshold. It is important to understand these challenges properly to find solutions for them.

Strategy

Strategy Monitoring Latency DevOps

Amazon EC2 Cluster GPU Instances - All Things Distributed

All Things Distributed

NOVEMBER 15, 2010

For example, the most fundamental abstraction trade-off has always been latency versus throughput. Modern CPUs strongly favor lower latency of operations with clock cycles in the nanoseconds and we have built general purpose software architectures that can exploit these low latencies very well.Â General Purpose GPU programming.

AWS

AWS Programming Latency Architecture

How To Scale a Single-Host PostgreSQL Database With Citus

Percona

NOVEMBER 3, 2023

PostgreSQL Cluster One coordinator node citus-coord-01 Three worker nodes citus1 citus2 citus3 Hardware AWS Instance Ubuntu Server 20.04, SSD volume type 64-bit (x86) c5.xlarge And now, execute the benchmark: -- execute the following on the coordinator node pgbench -c 20 -j 3 -T 60 -P 3 pgbench The results are not pretty.

Database

Database Benchmarking Latency C++

Save Money in AWS RDS: Don’t Trust the Defaults

Percona

MAY 1, 2023

The innodb_io_capacity_max parameter was set to 2000, so the hardware should be able to deliver that many IOPS without major issues. ” Yes, this is true, io2 volumes are expensive, and honestly, I think they should be used only where really high IO capacity at expected latencies is required, and this didn’t seem to be the case.

AWS

AWS Hardware Storage Tuning

InnoDB Performance Optimization Basics

Percona

MARCH 23, 2023

Hardware Memory The amount of RAM to be provisioned for database servers can vary greatly depending on the size of the database and the specific requirements of the company. have been released since then with some major changes. Some servers may need a few GBs of RAM, while others may need hundreds of GBs or even terabytes of RAM.

Performance

Performance Hardware Tuning Storage

RPCValet: NI-driven tail-aware balancing of µs-scale RPCs

The Morning Paper

MAY 19, 2019

Last week we learned about the [increased tail-latency sensitivity of microservices based applications with high RPC fan-outs. Seer uses estimates of queue depths to mitigate latency spikes on the order of 10-100ms, in conjunction with a cluster manager. So what we have here is a glimpse of the limits for low-latency RPCs under load.

Latency

Latency Hardware Network Architecture

Snap: a microkernel approach to host networking

The Morning Paper

NOVEMBER 10, 2019

Here are the bombshell paragraphs: Our datacenter applications seek ever more CPU-efficient and lower-latency communication, which Pony Express delivers. The desire for CPU efficiency and lower latencies is easy to understand. When under/over provisioning is not a concern, this can deliver very low latency. Emphasis mine).

Network

Network Transportation Latency Entertainment

Growth Engineering at Netflix?—?Automated Imagery Generation

The Netflix TechBlog

FEBRUARY 9, 2021

Server-generated assets, since client-side generation would require the retrieval of many individual images, which would increase latency and time-to-render. To reduce latency, assets should be generated in an offline fashion and not in real time. Different assets for different device types and screen sizes.

Engineering

Engineering Storage Latency Entertainment

Redis® Monitoring Strategies for 2024

Scalegrid

DECEMBER 21, 2023

Identifying key Redis® metrics such as latency, CPU usage, and memory metrics is crucial for effective Redis monitoring. To monitor Redis® instances effectively, collect Redis metrics focusing on cache hit ratio, memory allocated, and latency threshold. It is important to understand these challenges properly to find solutions for them.

Strategy

Strategy Monitoring Latency DevOps

Best Practices for a Seamless MongoDB Upgrade

Percona

NOVEMBER 2, 2023

Improved performance : MongoDB continually fine-tunes its database engine, resulting in faster query execution and reduced latency. You should also review your hardware resources, how you use MongoDB, and any custom configurations.

Best Practices

Best Practices Hardware Tuning Scalability

Seamless offloading of web app computations from mobile device to edge clouds via HTML5 Web Worker migration

The Morning Paper

JANUARY 30, 2020

Edge servers are the middle ground – more compute power than a mobile device, but with latency of just a few ms. These use their regression models to estimate processing time (which will depend on the hardware available, current load, etc.). Why would we want to live migrate web workers?

Mobile

Mobile Cloud Latency Games

Expanding the Cloud - Cluster Compute Instances for Amazon EC2.

All Things Distributed

JULY 13, 2010

In particular this has been true for applications based on algorithms - often MPI-based - that depend on frequent low-latency communication and/or require significant cross sectional bandwidth. There is no more need for hardware tinkering to keep the clusters up and running (I spent many nights doing this; there is no glory in it).

Cloud

Cloud AWS Automotive Latency

Embrace event-driven computing: Amazon expands DynamoDB with streams, cross-region replication, and database triggers

All Things Distributed

JULY 14, 2015

In traditional database architectures, database engines often run a small search engine or data warehouse engines on the same hardware as the database. No matter which mechanism you choose to use, we make the stream data available to you instantly (latency in milliseconds) and how fast you want to apply the changes is up to you.

Database

Database Lambda AWS IoT

Single-core memory bandwidth: Latency, Bandwidth, and Concurrency

RabbitMQ vs. Kafka: Key Differences

Trending Sources

Cloud infrastructure monitoring in action: Dynatrace on Dynatrace

How to Optimize CPU Performance Through Isolation and System Tuning

Dynatrace Managed turnkey Premium High Availability for globally distributed data centers (Early Adopter)

Optimize your environment: Unveiling Dynatrace Hyper-V extension for enhanced performance and efficient troubleshooting

What is AWS Lambda?

Time to First Byte: What It Is and Why It Matters

What is serverless computing? Driving efficiency without sacrificing observability

Predictive CPU isolation of containers at Netflix

Best Practices for Scaling RabbitMQ

What is ITOps? Why IT operations is more crucial than ever in a multicloud world

These 7 Edge Data Challenges Will Test Companies the Most in 2025

What is cloud migration?

Towards a Reliable Device Management Platform

Balancing Low Latency, High Availability, and Cloud Choice

Performance Tuning Java Applications in Linux

Achieving 100Gbps intrusion prevention on a single server

Under the Hood of Amazon EC2 Container Service

Understanding operational 5G: a first measurement study on its coverage, performance and energy consumption

What is observability? Not just logs, metrics and traces

Single-core memory bandwidth: Latency, Bandwidth, and Concurrency

Amazon DynamoDB ? a Fast and Scalable NoSQL Database.

This spring: High-Performance and Low-Latency C++ (Stockholm) and ACCU (Bristol)

10 Lessons from 10 Years of Amazon Web Services

USENIX LISA2021 Computing Performance: On the Horizon

The Three Types of Performance Testing

Bring Your Own Cloud (BYOC) vs. Dedicated Hosting at ScaleGrid

Välkommen till Stockholm – An AWS Region is coming to the Nordics

5 Steps to Accelerate your Cloud Migration with Dynatrace

Crucial Redis Monitoring Metrics You Must Watch

Redis® Monitoring Strategies for 2025

Amazon EC2 Cluster GPU Instances - All Things Distributed

How To Scale a Single-Host PostgreSQL Database With Citus

Save Money in AWS RDS: Don’t Trust the Defaults

InnoDB Performance Optimization Basics

RPCValet: NI-driven tail-aware balancing of µs-scale RPCs

Snap: a microkernel approach to host networking

Growth Engineering at Netflix?—?Automated Imagery Generation

Redis® Monitoring Strategies for 2024

Best Practices for a Seamless MongoDB Upgrade

Seamless offloading of web app computations from mobile device to edge clouds via HTML5 Web Worker migration

Expanding the Cloud - Cluster Compute Instances for Amazon EC2.

Embrace event-driven computing: Amazon expands DynamoDB with streams, cross-region replication, and database triggers

Stay Connected