Benchmarking, Cache and Design - Technology Performance Pulse

Benchmark (YCSB) numbers for Redis, MongoDB, Couchbase2, Yugabyte and BangDB

High Scalability

FEBRUARY 17, 2021

Application example: user profile cache, where profiles are constructed elsewhere (e.g., The latency table shows that 99th percentile latency for Yugabyte is quite high compared to others (lower is better). Workload C: Read only. This workload is 100% read. However, Very high Read latency for MonoDB makes it the last db to finish the task.

Benchmarking

Benchmarking Latency C++ Database

Introducing Configurable Metaflow

The Netflix TechBlog

DECEMBER 19, 2024

This has been a guiding design principle with Metaflow since its inception. this could take a few minutes) All packages already cached in s3. All environments already cached in s3. Take a look at two interesting examples of this pattern in the documentation. training.mP4eIStG.yaml run --prediction_date20241006 Metaflow 2.12.39+nflxfastdata(2.13.5);nflx(2.13.5);metaboost(0.0.27)

Best Practices

Best Practices Cache Metrics Code

Distance-Based ISA for Efficient Register Management

ACM Sigarch

APRIL 2, 2025

To create a CPU core that can execute a large number of instructions in parallel, it is necessary to improve both the architecturewhich includes the overall CPU design and the instruction set architecture (ISA) designand the microarchitecture, which refers to the hardware design that optimizes instruction execution.

Efficiency

Efficiency Hardware Architecture Design

Why Tcl is 700% faster than Python for database benchmarking

HammerDB

JULY 5, 2022

Python is a popular programming language, especially for beginners, and consequently we see it occurring in places where it just shouldn’t be used, such as database benchmarking. We use stored procedures because, as the introductory post shows, using single SQL statements turns our database benchmark into a network test).

Benchmarking

Benchmarking Database C++ Virtualization

PostgreSQL: Pgpool-II Use Cases and Benefits

Percona

APRIL 6, 2023

Query caching Pgpool-II can cache frequently used queries in memory, reducing the load on your PostgreSQL servers and improving response times. This means that when a query is executed, pgpool-II can check the cache first to see if the results are already available rather than sending the query to the database server.

Cache

Cache Open Source Benchmarking Database

What Web Designers Can Do To Speed Up Mobile Websites

Smashing Magazine

JUNE 26, 2019

What Web Designers Can Do To Speed Up Mobile Websites. What Web Designers Can Do To Speed Up Mobile Websites. I recently wrote a blog post for a web designer client about page speed and why it matters. However, their focus has always been on making a great-looking and effective design. Suzanne Scacca. Minification.

Website

Website Speed Mobile Design

Crucial Redis Monitoring Metrics You Must Watch

Scalegrid

JANUARY 25, 2024

Key metrics like throughput, request latency, and memory utilization are essential for assessing Redis health, with tools like the MONITOR command and Redis-benchmark for latency and throughput analysis and MEMORY USAGE/STATS commands for evaluating memory. offers the Software Watchdog specifically designed for this purpose.

Metrics

Metrics Monitoring Latency Cache

Compress objects, not cache lines: an object-based compressed memory hierarchy

The Morning Paper

MAY 23, 2019

Compress objects, not cache lines: an object-based compressed memory hierarchy Tsai & Sanchez, ASPLOS’19. One of the important attributes of their design was easy and rapid deployment across an existing fleet. Consider a B-Tree node from the B-tree Java benchmark: Uncompressed, it’s memory layout looks like (a) below.

Cache

Cache Benchmarking Hardware Java

Redis vs Memcached in 2024

Scalegrid

MARCH 28, 2024

Key Takeaways Redis offers complex data structures and additional features for versatile data handling, while Memcached excels in simplicity with a fast, multi-threaded architecture for basic caching needs. Redis is better suited for complex data models, and Memcached is better suited for high-throughput, string-based caching scenarios.

Cache

Cache Storage Scalability Architecture

The evolution of single-core bandwidth in multicore processors

John McCalpin

APRIL 25, 2023

For most high-end processors these values have remained in the range of 75% to 85% of the peak DRAM bandwidth of the system over the past 15-20 years — an amazing accomplishment given the increase in core count (with its associated cache coherence issues), number of DRAM channels, and ever-increasing pipelining of the DRAMs themselves.

Benchmarking

Benchmarking Cache Latency Tuning

Characterizing, modeling, and benchmarking RocksDB key-value workloads at Facebook

The Morning Paper

MARCH 10, 2020

Characterizing, modeling, and benchmarking RocksDB key-value workloads at Facebook , Cao et al., Or in the case of key-value stores, what you benchmark. So if you want to design a system that will offer good real-world performance, it’s really useful to have benchmarks that accurately represent real-world workloads.

Benchmarking

Benchmarking Storage Cache Open Source

Core Web Vitals Tools To Boost Your Web Performance Scores

Smashing Magazine

AUGUST 9, 2022

PageSpeed Compare is a page speed evaluation and benchmarking tool. It also lists cached resources and identifies unused Javascript. With Treo Site Speed, you can also benchmark a website against competitors. Continue reading below ↓ Meet Smashing Email Newsletter with useful tips on front-end, design & UX.

Performance

Performance Benchmarking Website Metrics

20X Faster Backup Preparation With Percona XtraBackup 8.0.33-28!

Percona

JULY 25, 2023

After the “data dictionary” (DD) engine and DD cache are initialized on a server, the Storage Engines can ask for a table definition. Initializing a DD engine and the cache adds complexity and other server dependencies. Old design (until Percona XtraBackup 8.0.33-27): New design (from Percona XtraBackup 8.0.33-28)

Cache

Cache Servers Benchmarking Design

Percona Monitoring and Management 2 Scaling and Capacity Planning

Percona

MARCH 17, 2023

These updates are designed to keep databases running at peak performance and simplify database operations. Please note that the focus of these tests was around standard metrics gathering and display, we’ll use a future blog post to benchmark some of the more intensive query analytics (QAN) performance numbers.

Monitoring

Monitoring Scalability Database Cache

Improving The Performance Of An Online Store (Case Study)

Smashing Magazine

JUNE 2, 2021

The design was made by a design and UX agency that also handled the HTML prototype (based on Bootstrap 4). For the launch, we mostly focused on getting the new design out the door, but once the website’s relaunch went live, we started focusing our attention on turning the red and orange scores to greens. Large preview ).

Performance

Performance Cache Website Mobile

The Return of the Frame Pointers

Brendan Gregg

MARCH 16, 2024

Only in extreme circumstances does the cost (in processor time and I-cache footprint) translate to a tangible benefit - circumstances which usually resort to hand-coded assembly anyway. It shouldn't be 10%, unless it's cache effects. And for leaf routines (which never establish a frame), this is a non-issue.

Java

Java Cache Google Hardware

HammerDB for Managers

HammerDB

JUNE 27, 2022

HammerDB is a software application for database benchmarking. Databases are highly sophisticated software, and to design and run a fair benchmark workload is a complex undertaking. However, although these results are the gold standard of database benchmarking to do so, requires time, expertise and not insignificant cost.

Benchmarking

Benchmarking Open Source C++ Cache

AppFabric Caching: Retry Later

ScaleOut Software

MAY 15, 2014

Given all this, we thought it would be a good opportunity to see how we are doing relative to the competition, and in particular, relative to Microsoft’s AppFabric caching for Windows on-premise servers. One or more specified cache servers are unavailable, which could be caused by busy network or servers. …). Please retry later.

Cache

Cache Servers Network Design

Tuning PostgreSQL Database Parameters to Optimize Performance

Percona

AUGUST 31, 2018

This parameter sets how much dedicated memory will be used by PostgreSQL for cache. If your working set of data can easily fit into your RAM, then you might want to increase the shared_buffer value to contain your entire database, so that the whole working set of data can reside in cache. wal_buffers. effective_cache_size.

Tuning

Tuning Database Performance Benchmarking

Choosing a cloud DBMS: architectures and tradeoffs

The Morning Paper

AUGUST 29, 2019

use the TPC-H benchmark to assess Redshift, Redshift Spectrum, Athena, Presto, Hive, and Vertica to find out what works best and the trade-offs involved. The design space. in the TPC-H Benchmark Standard for details of the queries). Query performance is measured from both warm and cold caches. System initialisation time.

Architecture

Architecture Cloud Storage Serverless

Understanding MySQL Triggers: Exploring How Triggers Impact MySQL Memory Allocation

Percona

SEPTEMBER 1, 2023

However, they can also introduce performance bottlenecks, especially if poorly designed. Well-designed indexes can significantly improve query performance within triggers. Carefully design transactions to avoid unnecessary locking and contention. Benchmark different trigger implementations to identify the most efficient option.

Cache

Cache Database Games Healthcare

Comparisons of Proxies for MySQL

Percona

MARCH 20, 2023

When designing an architecture, many components need to be considered before deciding on the best solution. In this context, features like filtering, firewalling, or caching are redundant and may consume resources that could be allocated to scaling. MySQL Router was never in the game.

Games

Games Latency Traffic Cache

Fixing a slow site iteratively

CSS - Tricks

APRIL 1, 2021

Google’s industry benchmarks from 2018 also provide a striking breakdown of how each second of loading affects bounce rates. Compressing, minifying and caching assets. We can compress our assets, minify our styles and scripts, and cache things responsibly so we’re serving what the user needs in the most efficient way possible.

Cache

Cache Social Media Media Website

Don’t Sink Your Website With Third Parties

Smashing Magazine

JUNE 1, 2022

You’ve spent months putting together a great website design, crowd-pleasing content, and a business plan to bring it all together. You’ve focused on making the web design responsive to ensure that the widest audience of visitors can access your content. You’ve agonized over design patterns and usability. Ken Harker.

Website

Website Design Network Testing

Towards multiverse databases

The Morning Paper

JUNE 16, 2019

Such a design rules out an entire class of application errors, protecting private data from accidentally leaking. Although state-of-the-are databases have security features designed for exactly this purpose, such as row-level access policies and grants of views, these features are too limiting for many web applications.

Database

Database Cache Benchmarking Efficiency

Progress Delayed Is Progress Denied

Alex Russell

APRIL 29, 2021

As an engineer on a browser team, I'm privy to the blow-by-blow of various performance projects, benchmark fire drills, and the ways performance marketing (deeply) impacts engineering priorities. With each team, benchmarks lost are understood as bugs. For example, an area where Apple is absolutely killing it is in mobile CPU design.

Media

Media Games Education Engineering

The Surprising Effectiveness of Non-Overlapping, Sensitivity-Based Performance Models

John McCalpin

APRIL 2, 2020

The presentation discusses a family of simple performance models that I developed over the last 20 years — originally in support of processor and system design at SGI (1996-1999), IBM (1999-2005), and AMD (2006-2008), but more recently in support of system procurements at The Texas Advanced Computing Center (TACC) (2009-present).

Benchmarking

Benchmarking Performance Latency Architecture

How to Assess MySQL Performance

HammerDB

APRIL 19, 2023

GHz 4th Generation Intel Xeon Scalable processors (code-named Sapphire Rapids) Up to 20% higher compute performance than z1d instances Up to 50 Gbps of networking speed Up to 40 Gbps of bandwidth to the Amazon Elastic Block Store (EBS) We can also verify these capabilities by running some simple benchmarks on the different subsystems.

Performance

Performance Benchmarking Cache Storage

The Ultimate Guide to Database High Availability

Percona

JUNE 22, 2023

Typically, the servers are configured in a primary/replica configuration, with one server designated as the primary server that handles all incoming requests and the others designated as replica servers that monitor the primary and take over its workload if it fails. This flexibility can be crucial in designing a scalable architecture.

Availability

Availability Database Open Source Hardware

How I Used Brotli to Get Even Smaller CSS and JavaScript Files at CDN Scale

CSS - Tricks

JUNE 5, 2020

The service categorized all the optimizations in three groups consisting of several “Content,” “Delivery,” and “Cache” optimizations. In my benchmarks, Brotli:11 takes several hundred milliseconds to compress a single minified jQuery file. of Common Compression (A) Brotli:11 (B) ( A) / (B) – 1 Ant Design 1,938.99

Cache

Cache Website Servers Open Source

Azure Virtual Machines for SQL Server Usage

SQL Performance

DECEMBER 17, 2019

Microsoft currently has eight main types of virtual machines designed for different types of workloads. Microsoft's Mine Tokus has a good article where she ran a series of scaled down TPC-E benchmarks against an E64s_v3 VM that was using the older Broadwell processor. GHz, 128MB of L3 cache, 128 PCIe 4.0

Azure

Azure Virtualization Servers Storage

PostgreSQL Performance Tuning: Optimizing Database Parameters for Maximum Efficiency

Percona

MAY 1, 2023

Schema design: Evaluating the database schema design and making adjustments such as partitioning large tables, eliminating redundant data, and denormalizing tables for frequently accessed information can improve performance. This parameter sets how much dedicated memory will be used by PostgreSQL for the cache.

Tuning

Tuning Database Efficiency Performance

The evolution of single-core bandwidth in multicore processors

John McCalpin

APRIL 25, 2023

For most high-end processors these values have remained in the range of 75% to 85% of the peak DRAM bandwidth of the system over the past 15-20 years — an amazing accomplishment given the increase in core count (with its associated cache coherence issues), number of DRAM channels, and ever-increasing pipelining of the DRAMs themselves.

Benchmarking

Benchmarking Cache Latency Tuning

Supercomputing Predictions: Custom CPUs, CXL3.0, and Petalith Architectures

Adrian Cockcroft

JANUARY 20, 2023

Here’s some predictions I’m making: Jack Dongarra’s efforts to highlight the low efficiency of the HPCG benchmark as an issue will influence the next generation of supercomputer architectures to optimize for sparse matrix computations. In early January a related paper was published by Satoshi Matsuoka et. petaflops, which is 0.8%

Architecture

Architecture Latency Benchmarking AWS

Can You Afford It?: Real-world Web Performance Budgets

Alex Russell

OCTOBER 22, 2017

Budgets are scaled to a benchmark network & device. Deciding what benchmark to use for a performance budget is crucial. Simulated packet loss and variable latency, however, can make benchmarking extremely difficult and slow. They use “do what it takes” language to describe the efforts to get and stay fast.

Performance

Performance Network Benchmarking Mobile

The Surprising Effectiveness of Non-Overlapping, Sensitivity-Based Performance Models

John McCalpin

APRIL 2, 2020

The presentation discusses a family of simple performance models that I developed over the last 20 years — originally in support of processor and system design at SGI (1996-1999), IBM (1999-2005), and AMD (2006-2008), but more recently in support of system procurements at The Texas Advanced Computing Center (TACC) (2009-present).

Benchmarking

Benchmarking Performance Latency Architecture

Performance tips for building responsive sites

Speed Curve

DECEMBER 25, 2013

Responsive Web Design (RWD) is now a well established technique yet it’s adoption is still surprisingly low. All CMS's should be offering this level of image manipulation and caching. There are great tools available to monitor the actual in browser speed and benchmark your site against others.

Performance

Performance Mobile Website Benchmarking

MySQL Performance Tuning 101: Key Tips to Improve MySQL Database Performance

Percona

SEPTEMBER 1, 2023

These issues often arise from suboptimal query design, missing or ineffective indexes, or dealing with large datasets. Inefficient query design, such as utilizing SELECT * instead of specifying necessary columns, can escalate data transfer and processing overhead. Another highly beneficial caching method is key-value caching.

Tuning

Tuning Database Performance Hardware

AMD EPYC Processors in Azure Virtual Machines

SQL Performance

APRIL 8, 2019

The L3 cache size is 64MB. I wrote about using CPU-Z to benchmark the Intel Xeon E5-2673 v3 processor in an Azure VM in this article. Figure 1: CPU-Z Benchmark Results for LS16v2. The L3 cache size is 64MB. Like all AMD EPYC 7000 series processors, this particular SKU supports 128 PCIe 3.0 lanes for I/O connectivity.

Azure

Azure Virtualization Storage Benchmarking

Seer: leveraging big data to navigate the complexity of performance debugging in cloud microservices

The Morning Paper

MAY 14, 2019

Last time around we looked at the DeathStarBench suite of microservices-based benchmark applications and learned that microservices systems can be especially latency sensitive, and that hotspots can propagate through a microservices architecture in interesting ways. ASPLOS’19. Distributed tracing and instrumentation.

Big Data

Big Data Cloud Performance Hardware

Upcoming of the learned data structures

Abhishek Tiwari

DECEMBER 13, 2017

This has far-reaching implications how future data systems and algorithms will be designed. They demonstrated that neural nets based learned index outperforms cache-optimized B-Tree index by up to 70% in speed while saving an order-of-magnitude in memory.

Artificial Intelligence

Artificial Intelligence Hardware Google Benchmarking

SQL Server I/O Basics Chapter #1

SQL Server According to Bob

JANUARY 11, 2020

Stable media is commonly physical disk storage, but other devices and certain caching facilities qualify as well. Many high-end disk subsystems provide high-speed cache facilities to reduce the latency of read and write operations. This cache is often supported by a battery-powered backup facility.

Servers

Servers Cache Media Hardware

HOW IT WORKS: SQL Server Scheduler Affinity

SQL Server According to Bob

AUGUST 23, 2021

Manual affinity instructs SQL Server to set the worker’s CPU affinity to only the CPU designated for the scheduler. MANUAL affinity provides the best, top end performance (for benchmarks by utilizing L1 caches) but is susceptible to noisy, CPU neighbors.

Servers

Servers Azure Operating System Database

Five Data-Loading Patterns To Improve Frontend Performance

Smashing Magazine

SEPTEMBER 28, 2022

On your first try, you can use it as a benchmark for optimizations later. On design systems, UX, web performance and CSS/JS. Active Memory Caching. When you want to get data that you already had quickly, you need to do caching — caching stores data that a user recently retrieved. Caching Schemes.

Cache

Cache Performance Servers Architecture

Benchmark (YCSB) numbers for Redis, MongoDB, Couchbase2, Yugabyte and BangDB

Introducing Configurable Metaflow

Trending Sources

Distance-Based ISA for Efficient Register Management

Why Tcl is 700% faster than Python for database benchmarking

PostgreSQL: Pgpool-II Use Cases and Benefits

What Web Designers Can Do To Speed Up Mobile Websites

Crucial Redis Monitoring Metrics You Must Watch

Compress objects, not cache lines: an object-based compressed memory hierarchy

Redis vs Memcached in 2024

The evolution of single-core bandwidth in multicore processors

Characterizing, modeling, and benchmarking RocksDB key-value workloads at Facebook

Core Web Vitals Tools To Boost Your Web Performance Scores

20X Faster Backup Preparation With Percona XtraBackup 8.0.33-28!

Percona Monitoring and Management 2 Scaling and Capacity Planning

Improving The Performance Of An Online Store (Case Study)

The Return of the Frame Pointers

HammerDB for Managers

AppFabric Caching: Retry Later

Tuning PostgreSQL Database Parameters to Optimize Performance

Choosing a cloud DBMS: architectures and tradeoffs

Understanding MySQL Triggers: Exploring How Triggers Impact MySQL Memory Allocation

Comparisons of Proxies for MySQL

Fixing a slow site iteratively

Don’t Sink Your Website With Third Parties

Towards multiverse databases

Progress Delayed Is Progress Denied

The Surprising Effectiveness of Non-Overlapping, Sensitivity-Based Performance Models

How to Assess MySQL Performance

The Ultimate Guide to Database High Availability

How I Used Brotli to Get Even Smaller CSS and JavaScript Files at CDN Scale

Azure Virtual Machines for SQL Server Usage

PostgreSQL Performance Tuning: Optimizing Database Parameters for Maximum Efficiency

The evolution of single-core bandwidth in multicore processors

Supercomputing Predictions: Custom CPUs, CXL3.0, and Petalith Architectures

Can You Afford It?: Real-world Web Performance Budgets

The Surprising Effectiveness of Non-Overlapping, Sensitivity-Based Performance Models

Performance tips for building responsive sites

MySQL Performance Tuning 101: Key Tips to Improve MySQL Database Performance

AMD EPYC Processors in Azure Virtual Machines

Seer: leveraging big data to navigate the complexity of performance debugging in cloud microservices

Upcoming of the learned data structures

SQL Server I/O Basics Chapter #1

HOW IT WORKS: SQL Server Scheduler Affinity

Five Data-Loading Patterns To Improve Frontend Performance

Stay Connected