Hardware, Latency and Servers - Technology Performance Pulse

Single-core memory bandwidth: Latency, Bandwidth, and Concurrency

John McCalpin

FEBRUARY 17, 2025

The Multicore Era Over the past ~15 years, server processors from Intel and AMD have evolved from the early quad-core processors to the current monsters with over 50 cores per socket. The example below is for a 2005-era processor with 60 ns memory latency and 6.4 If we want to sustain full bandwidth, we need 64/2 =32 cache lines.

Latency

Latency Hardware Cache Systems

RabbitMQ vs. Kafka: Key Differences

Scalegrid

FEBRUARY 6, 2025

Its partitioned log architecture supports both queuing and publish-subscribe models, allowing it to handle large-scale event processing with minimal latency. Kafka clusters can be deployed in Kubernetes using Helm charts to simplify scaling and management across multiple servers.

Latency

Latency Analytics Architecture Storage

Cloud infrastructure monitoring in action: Dynatrace on Dynatrace

Dynatrace

SEPTEMBER 29, 2020

A critical component to this success was that the Dynatrace Team itself uses the Dynatrace Platform to monitor every single Dynatrace cluster in the cloud and trusts the Dynatrace Davis AI to alert in case there are any issues, either with a new feature, a configuration change or with the infrastructure our servers are running on.

Infrastructure

Infrastructure Cloud Monitoring AWS

Time to First Byte: What It Is and Why It Matters

CSS Wizardry

AUGUST 7, 2019

A lot of people surmise that TTFB is merely time spent on the server, but that is only a small fraction of the true extent of things. The first—and often most surprising for people to learn—thing that I want to draw your attention to is that TTFB counts one whole round trip of latency. But what else is TTFB?

Latency

Latency Ecommerce Servers Mobile

Dynatrace Managed turnkey Premium High Availability for globally distributed data centers (Early Adopter)

Dynatrace

JUNE 25, 2020

The network latency between cluster nodes should be around 10 ms or less. Our Premium High Availability comes with the following features: Active-active deployment model for optimum hardware utilization. Save on costs for hardware and network bandwidth to optimize total cost of ownership. Self-contained turnkey solution.

Availability

Availability Hardware Latency Traffic

Optimize your environment: Unveiling Dynatrace Hyper-V extension for enhanced performance and efficient troubleshooting

Dynatrace

OCTOBER 23, 2023

It enables multiple operating systems to run simultaneously on the same physical hardware and integrates closely with Windows-hosted services. Therefore, they experience how the application code functions and how the application operations depend on the underlying hardware resources and the operating system managed by Hyper-V.

Efficiency

Efficiency Virtualization Hardware Performance

What is AWS Lambda?

Dynatrace

APRIL 5, 2021

The 2014 launch of AWS Lambda marked a milestone in how organizations use cloud services to deliver their applications more efficiently, by running functions at the edge of the cloud without the cost and operational overhead of on-premises servers. AWS continues to improve how it handles latency issues. What is AWS Lambda?

Lambda

Lambda AWS Serverless Hardware

What is serverless computing? Driving efficiency without sacrificing observability

Dynatrace

JANUARY 26, 2021

This allows teams to sidestep much of the cost and time associated with managing hardware, platforms, and operating systems on-premises, while also gaining the flexibility to scale rapidly and efficiently. When an application is triggered, it can cause latency as the application starts. This creates latency when they need to restart.

Serverless

Serverless Efficiency Lambda AWS

What is ITOps? Why IT operations is more crucial than ever in a multicloud world

Dynatrace

DECEMBER 15, 2022

Besides the traditional system hardware, storage, routers, and software, ITOps also includes virtual components of the network and cloud infrastructure. Computer operations manages the physical location of the servers — cooling, electricity, and backups — and monitors and responds to alerts. Performance. What does IT operations do?

Artificial Intelligence

Artificial Intelligence DevOps Hardware Virtualization

Achieving 100Gbps intrusion prevention on a single server

The Morning Paper

NOVEMBER 15, 2020

Achieving 100 Gbps intrusion prevention on a single server , Zhao et al., Today’s paper choice is a wonderful example of pushing the state of the art on a single server. This makes the whole system latency sensitive. Moreover, Pigasus wants to do all this on a single server! Can you really do all this on a single server??

Servers

Servers Hardware Latency Design

What is cloud migration?

Dynatrace

SEPTEMBER 30, 2021

It requires purchasing, powering, and configuring physical hardware, training and retaining the staff capable of servicing and securing the machines, operating a data center, and so on. They need enough hardware to serve their anticipated volume and keep things running smoothly without buying too much or too little. Reduced cost.

Cloud

Cloud Traffic Best Practices Strategy

Balancing Low Latency, High Availability, and Cloud Choice

VoltDB

MAY 14, 2024

Balancing Low Latency, High Availability and Cloud Choice Cloud hosting is no longer just an option — it’s now, in many cases, the default choice. As a result, IT teams picked hardware somewhat blindly but with a strong bias towards oversizing for the sake of expanding the budget, leading to systems running at 10-15% of maximum capacity.

Latency

Latency Availability Cloud Hardware

Amazon DynamoDB ? a Fast and Scalable NoSQL Database.

All Things Distributed

JANUARY 18, 2012

Behind the scenes, Amazon DynamoDB automatically spreads the data and traffic for a table over a sufficient number of servers to meet the request capacity specified by the customer. Amazon DynamoDB offers low, predictable latencies at any scale. s read latency, particularly as dataset sizes grow. Consistency. SimpleDBâ??s

Scalability

Scalability Database Ecommerce Latency

Crucial Redis Monitoring Metrics You Must Watch

Scalegrid

JANUARY 25, 2024

You will need to know which monitoring metrics for Redis to watch and a tool to monitor these critical server metrics to ensure its health. Understanding Redis Performance Indicators Redis is designed to handle high traffic and low latency with its in-memory data store and efficient data structures.

Metrics

Metrics Monitoring Latency Cache

InnoDB Performance Optimization Basics

Percona

MARCH 23, 2023

Hardware Memory The amount of RAM to be provisioned for database servers can vary greatly depending on the size of the database and the specific requirements of the company. Some servers may need a few GBs of RAM, while others may need hundreds of GBs or even terabytes of RAM. Benchmark before you decide.

Performance

Performance Hardware Tuning Storage

Redis® Monitoring Strategies for 2025

Scalegrid

JANUARY 21, 2025

Identifying key Redis metrics such as latency, CPU usage, and memory metrics is crucial for effective Redis monitoring. To monitor Redis instances effectively, collect Redis metrics focusing on cache hit ratio, memory allocated, and latency threshold.

Strategy

Strategy Monitoring Latency DevOps

USENIX LISA2021 Computing Performance: On the Horizon

Brendan Gregg

JULY 4, 2021

I summarized these topics and more as a plenary conference talk, including my own predictions (as a senior performance engineer) for the future of computing performance, with a focus on back-end servers. This was a chance to talk about other things I've been working on, such as the present and future of hardware performance.

Performance

Performance Latency Hardware Storage

Understanding operational 5G: a first measurement study on its coverage, performance and energy consumption

The Morning Paper

OCTOBER 4, 2020

We are standing on the eve of the 5G era… 5G, as a monumental shift in cellular communication technology, holds tremendous potential for spurring innovations across many vertical industries, with its promised multi-Gbps speed, sub-10 ms low latency, and massive connectivity. Throughput and latency. energy consumption).

Energy

Energy Latency Performance Network

5 Steps to Accelerate your Cloud Migration with Dynatrace

Dynatrace

AUGUST 5, 2019

Lift & Shift is where you basically just move physical or virtual hosts to the cloud – essentially you just run your host on somebody else’s hardware. Remember: This is a critical aspect as you do not want to migrate a service and suddenly introduce high latency or costs to a system that you forgot about having a dependency with!

Cloud

Cloud Traffic Database Network

Bring Your Own Cloud (BYOC) vs. Dedicated Hosting at ScaleGrid

Scalegrid

APRIL 16, 2020

This is why our BYOC pricing is less than our Dedicated Hosting pricing, as the costs listed for BYOC are only what you pay for ScaleGrid and don’t include your hardware costs. Deploying your application and database on the same VPC also provides the lowest possible latency path. Where to host your cloud database? Security Groups.

Cloud

Cloud Azure AWS Database

10 Lessons from 10 Years of Amazon Web Services

All Things Distributed

MARCH 11, 2016

This is a given, whether you are using the highest quality hardware or lowest cost components. When customers left the constraining, old world of IT hardware and datacenters behind, they started to develop systems with new and interesting usage patterns that no one had ever seen before. Primitives not frameworks. APIs are forever.

AWS

AWS Hardware Retail Virtualization

Redis® Monitoring Strategies for 2024

Scalegrid

DECEMBER 21, 2023

Identifying key Redis® metrics such as latency, CPU usage, and memory metrics is crucial for effective Redis monitoring. To monitor Redis® instances effectively, collect Redis metrics focusing on cache hit ratio, memory allocated, and latency threshold.

Strategy

Strategy Monitoring Latency DevOps

Save Money in AWS RDS: Don’t Trust the Defaults

Percona

MAY 1, 2023

After some time of receiving these messages, eventually, they hit performance issues to the point that the server becomes unresponsive for a few minutes. The innodb_io_capacity_max parameter was set to 2000, so the hardware should be able to deliver that many IOPS without major issues. After that, things went back to normal.

AWS

AWS Hardware Storage Tuning

RPCValet: NI-driven tail-aware balancing of µs-scale RPCs

The Morning Paper

MAY 19, 2019

Last week we learned about the [increased tail-latency sensitivity of microservices based applications with high RPC fan-outs. Seer uses estimates of queue depths to mitigate latency spikes on the order of 10-100ms, in conjunction with a cluster manager. So what we have here is a glimpse of the limits for low-latency RPCs under load.

Latency

Latency Hardware Network Architecture

Seamless offloading of web app computations from mobile device to edge clouds via HTML5 Web Worker migration

The Morning Paper

JANUARY 30, 2020

Edge servers are the middle ground – more compute power than a mobile device, but with latency of just a few ms. The kind of edge server envisaged here might, for example, be integrated with your WiFi access point. As such, web workers are a natural target to offload to a more powerful server.

Mobile

Mobile Cloud Latency Games

Välkommen till Stockholm – An AWS Region is coming to the Nordics

All Things Distributed

APRIL 4, 2017

This enables customers to serve content to their end users with low latency, giving them the best application experience. In 2011, AWS opened a Point of Presence (PoP) in Stockholm to enable customers to serve content to their end users with low latency. As well as AWS Regions, we also have 24 AWS Edge Network Locations in Europe.

AWS

AWS Airlines Latency Games

How To Scale a Single-Host PostgreSQL Database With Citus

Percona

NOVEMBER 3, 2023

PostgreSQL Cluster One coordinator node citus-coord-01 Three worker nodes citus1 citus2 citus3 Hardware AWS Instance Ubuntu Server 20.04, SSD volume type 64-bit (x86) c5.xlarge Steps Provisioning The first step is to provision the four nodes with both PostgreSQL and Citus. psql pgbench <<_eof1_ qecho adding node citus3.

Database

Database Benchmarking Latency C++

Growth Engineering at Netflix?—?Automated Imagery Generation

The Netflix TechBlog

FEBRUARY 9, 2021

Server-generated assets, since client-side generation would require the retrieval of many individual images, which would increase latency and time-to-render. To reduce latency, assets should be generated in an offline fashion and not in real time. Localized images for each of the titles.

Engineering

Engineering Storage Latency Entertainment

Single-core memory bandwidth: Latency, Bandwidth, and Concurrency

John McCalpin

FEBRUARY 17, 2025

The Multicore Era Over the past ~15 years, server processors from Intel and AMD have evolved from the early quad-core processors to the current monsters with over 50 cores per socket. The example below is for a 2005-era processor with 60 ns memory latency and 6.4 If we want to sustain full bandwidth, we need 64/2 =32 cache lines.

Latency

Latency Hardware Cache Systems

MySQL Key Performance Indicators (KPI) With PMM

Percona

JUNE 22, 2023

As a MySQL database administrator, keeping a close eye on the performance of your MySQL server is crucial to ensure optimal database operations. However, simply deploying a monitoring tool is not enough; you need to know which Key Performance Indicators (KPIs) to monitor to gain insights into your MySQL server’s health and performance.

Performance

Performance Monitoring Traffic Database

Amazon EC2 Cluster GPU Instances - All Things Distributed

All Things Distributed

NOVEMBER 15, 2010

For example, the most fundamental abstraction trade-off has always been latency versus throughput. Modern CPUs strongly favor lower latency of operations with clock cycles in the nanoseconds and we have built general purpose software architectures that can exploit these low latencies very well.Â General Purpose GPU programming.

AWS

AWS Programming Latency Architecture

What is a Distributed Storage System

Scalegrid

FEBRUARY 8, 2024

A distributed storage system is foundational in today’s data-driven landscape, ensuring data spread over multiple servers is reliable, accessible, and manageable. These storage nodes collaborate to manage and disseminate the data across numerous servers spanning multiple data centers.

Storage

Storage Systems Big Data Azure

ChatGPT vs. MySQL DBA Challenge

Percona

MAY 2, 2023

Questions Q: I have a MySQL server with 500 GB of RAM; my data set is 100 GB. Keep in mind that setting the buffer pool size too high may result in other processes on your server competing for memory, which can impact performance. Q: I have a MySQL server, and my application is writing at a rate of 100 MB/hour in my redo logs.

Social Media

Social Media Database Servers Cache

Narrowing the gap between serverless and its state with storage functions

The Morning Paper

JANUARY 28, 2020

Shredder is " a low-latency multi-tenant cloud store that allows small units of computation to be performed directly within storage nodes. " Entry/exit in/out of V8 contexts is less expensive than hardware-based isolation mechanisms, keeping request processing latency low and throughput high.

Serverless

Serverless Storage Latency Cloud

Expanding the Cloud - Cluster Compute Instances for Amazon EC2.

All Things Distributed

JULY 13, 2010

In particular this has been true for applications based on algorithms - often MPI-based - that depend on frequent low-latency communication and/or require significant cross sectional bandwidth. There is no more need for hardware tinkering to keep the clusters up and running (I spent many nights doing this; there is no glory in it).

Cloud

Cloud AWS Automotive Latency

USENIX SREcon APAC 2022: Computing Performance: What's on the Horizon

Brendan Gregg

FEBRUARY 28, 2023

My personal opinion is that I don't see a widespread need for more capacity given horizontal scaling and servers that can already exceed 1 Tbyte of DRAM; bandwidth is also helpful, but I'd be concerned about the increased latency for adding a hop to more memory. Ford, et al., “TCP

Performance

Performance Latency Cache Virtualization

The AWS Storage Gateway - All Things Distributed

All Things Distributed

JANUARY 23, 2012

VM Import allows our customers to move virtual machine images from their datacenters to the Cloud and Amazon Direct Connect makes the network latencies and bandwidth between on-premises and AWS more predictable. AWS Identity and Access Management brings together on-premises and cloud identity management.

Storage

Storage AWS Virtualization Cloud

Why Traditional Monitoring Isn’t Enough for Modern Web Applications

Dotcom-Montior

MAY 12, 2020

Early web applications involved less on client-side behavior and more server-side for all its navigation, query handling, and updates. A request will be sent from the client-side and an HTTP check waits on the server port to get the message, process it, and then send back the response. Connection closed by the server.

Monitoring

Monitoring Entertainment Hardware Traffic

Monitoring Distributed Systems

Dotcom-Montior

NOVEMBER 24, 2021

Do you have a web server? Is the web server running? The last item to check was if the web server was able to talk to the database? These systems can include physical servers, containers, virtual machines, or even a device, or node, that connects and communicates with the network. Do you have a database? Peer-to-Peer.

Systems

Systems Monitoring Hardware Network

A case for managed and model-less inference serving

The Morning Paper

JUNE 13, 2019

As we saw with the SOAP paper last time out, even with a fixed model variant and hardware there are a lot of different ways to map a training workload over the available hardware. First off there still is a model of course (but then there are servers hiding behind a serverless abstraction too!). autoscaling).

Hardware

Hardware Latency Serverless Energy

Volt Significantly Faster, and Cheaper, than Intel on (AWS) ARM

VoltDB

JULY 24, 2024

The raw performance improvement is due to c7g not using hardware threading, while Intel uses two hardware threads per physical core, which increases latency once you get beyond 50% CPU usage. File systems and IO Regardless of size, each server always has the same IO configuration. We then shifted to a bigger server.

AWS

AWS Benchmarking Hardware Latency

SQL Server 2016 – It Just Runs Faster: Always On Availability Groups Turbocharged

SQL Server According to Bob

SEPTEMBER 26, 2016

When we released Always On Availability Groups in SQL Server 2012 as a new and powerful way to achieve high availability, hardware environments included NUMA machines with low-end multi-core processors and SATA and SAN drives for storage (some SSDs). As we moved towards SQL Server 2014, the pace of hardware accelerated.

Availability

Availability Servers Hardware Benchmarking

What is Serverless Architecture?

cdemi

FEBRUARY 20, 2017

Let's talk about the elephant in the room; Serverless doesn't really mean that there are no Software or Hardware servers. It just means that from Software Development perspective, servers are abstracted and outsourced to another entity, so you don't need to worry about it. Advantages and Disadvantages of Serverless. Advantages.

Serverless

Serverless Architecture Lambda Azure

Updated Azure SQL Database Tier Options

SQL Performance

APRIL 27, 2020

Many of the newer features we have in SQL Server were initially launched in Azure SQL Database, including (but not limited to) Always Encrypted, Dynamic Data Masking, Row Level Security, and Query Store. Gen 5 is the primary hardware option now for most regions since Gen 4 is aging out. DTU Pricing Tier. GB per vCore.

Azure

Azure Database Serverless Hardware

Single-core memory bandwidth: Latency, Bandwidth, and Concurrency

RabbitMQ vs. Kafka: Key Differences

Trending Sources

Cloud infrastructure monitoring in action: Dynatrace on Dynatrace

Time to First Byte: What It Is and Why It Matters

Dynatrace Managed turnkey Premium High Availability for globally distributed data centers (Early Adopter)

Optimize your environment: Unveiling Dynatrace Hyper-V extension for enhanced performance and efficient troubleshooting

What is AWS Lambda?

What is serverless computing? Driving efficiency without sacrificing observability

What is ITOps? Why IT operations is more crucial than ever in a multicloud world

Achieving 100Gbps intrusion prevention on a single server

What is cloud migration?

Balancing Low Latency, High Availability, and Cloud Choice

Amazon DynamoDB ? a Fast and Scalable NoSQL Database.

Crucial Redis Monitoring Metrics You Must Watch

InnoDB Performance Optimization Basics

Redis® Monitoring Strategies for 2025

USENIX LISA2021 Computing Performance: On the Horizon

Understanding operational 5G: a first measurement study on its coverage, performance and energy consumption

5 Steps to Accelerate your Cloud Migration with Dynatrace

Bring Your Own Cloud (BYOC) vs. Dedicated Hosting at ScaleGrid

10 Lessons from 10 Years of Amazon Web Services

Redis® Monitoring Strategies for 2024

Save Money in AWS RDS: Don’t Trust the Defaults

RPCValet: NI-driven tail-aware balancing of µs-scale RPCs

Seamless offloading of web app computations from mobile device to edge clouds via HTML5 Web Worker migration

Välkommen till Stockholm – An AWS Region is coming to the Nordics

How To Scale a Single-Host PostgreSQL Database With Citus

Growth Engineering at Netflix?—?Automated Imagery Generation

Single-core memory bandwidth: Latency, Bandwidth, and Concurrency

MySQL Key Performance Indicators (KPI) With PMM

Amazon EC2 Cluster GPU Instances - All Things Distributed

What is a Distributed Storage System

ChatGPT vs. MySQL DBA Challenge

Narrowing the gap between serverless and its state with storage functions

Expanding the Cloud - Cluster Compute Instances for Amazon EC2.

USENIX SREcon APAC 2022: Computing Performance: What's on the Horizon

The AWS Storage Gateway - All Things Distributed

Why Traditional Monitoring Isn’t Enough for Modern Web Applications

Monitoring Distributed Systems

A case for managed and model-less inference serving

Volt Significantly Faster, and Cheaper, than Intel on (AWS) ARM

SQL Server 2016 – It Just Runs Faster: Always On Availability Groups Turbocharged

What is Serverless Architecture?

Updated Azure SQL Database Tier Options

Stay Connected