Benchmarking, Storage and Systems - Technology Performance Pulse

RabbitMQ vs. Kafka: Key Differences

Scalegrid

FEBRUARY 6, 2025

Introduction to Message Brokers Message brokers enable applications, services, and systems to communicate by acting as intermediaries between senders and receivers. This decoupling simplifies system architecture and supports scalability in distributed environments.

Latency

Latency Analytics Architecture Storage

Block Size and Its Impact on Storage Performance

DZone

JUNE 21, 2024

This article analyzes the correlation between block sizes and their impact on storage performance. This paper deals with definitions and understanding of structured data vs unstructured data, how various storage segments react to block size changes, and differences between I/O-driven and throughput-driven workloads.

Storage

Storage Performance Benchmarking Processing

PostgreSQL Benchmark: ScaleGrid vs. Amazon RDS

Scalegrid

NOVEMBER 4, 2024

Performance Benchmarking of PostgreSQL on ScaleGrid vs. AWS RDS Using Sysbench This article evaluates PostgreSQL’s performance on ScaleGrid and AWS RDS, focusing on versions 13, 14, and 15. This study benchmarks PostgreSQL performance across two leading managed database platforms—ScaleGrid and AWS RDS—using versions 13, 14, and 15.

Benchmarking

Benchmarking AWS Tuning Metrics

Measuring the importance of data quality to causal AI success

Dynatrace

JANUARY 4, 2024

Traditional analytics and AI systems rely on statistical models to correlate events with possible causes. It removes much of the guesswork of untangling complex system issues and establishes with certainty why a problem occurred. Fragmented and siloed data storage can create inconsistencies and redundancies. Timeliness.

Government

Government Analytics Benchmarking Storage

What is infrastructure monitoring and why is it mission-critical in the new normal?

Dynatrace

NOVEMBER 2, 2020

IT infrastructure is the heart of your digital business and connects every area – physical and virtual servers, storage, databases, networks, cloud services. This shift requires infrastructure monitoring to ensure all your components work together across applications, operating systems, storage, servers, virtualization, and more.

Infrastructure

Infrastructure Monitoring Virtualization Serverless

Building Netflix’s Distributed Tracing Infrastructure

The Netflix TechBlog

OCTOBER 19, 2020

which is difficult when troubleshooting distributed systems. Troubleshooting a session in Edgar When we started building Edgar four years ago, there were very few open-source distributed tracing systems that satisfied our needs. Investigating a video streaming failure consists of inspecting all aspects of a member account.

Infrastructure

Infrastructure Transportation Storage Open Source

Netflix at AWS re:Invent 2019

The Netflix TechBlog

NOVEMBER 22, 2019

4:45pm-5:45pm NFX 202 A day in the life of a Netflix Engineer Dave Hahn , SRE Engineering Manager Abstract : Netflix is a large, ever-changing ecosystem serving millions of customers across the globe through cloud-based systems and a globally distributed CDN. We explore all the systems necessary to make and stream content from Netflix.

AWS

AWS Entertainment Open Source Benchmarking

Further improved handling and reliability of OneAgent deployments

Dynatrace

NOVEMBER 11, 2020

Dynatrace OneAgent deployment and life-cycle management are already widely considered to be industry benchmarks for reliability and efficiency. Easier rollout thanks to log storage best practices. Easier rollout thanks to log storage best practices. Dynatrace news. Advanced customization of OneAgent deployments made easy.

Best Practices

Best Practices Storage Java Benchmarking

Building a Media Understanding Platform for ML Innovations

The Netflix TechBlog

MARCH 14, 2023

We implemented a batch processing system for users to submit their requests and wait for the system to generate the output. This limited pilot system greatly reduced the time spent by our users to manually analyze the content. Maintaining disparate systems posed a challenge. Processing took several hours to complete.

Media

Media Innovation Energy Architecture

Maximizing Performance of AWS RDS for MySQL with Dedicated Log Volumes

Percona

DECEMBER 11, 2023

A Dedicated Log Volume (DLV) is a specialized storage volume designed to house database transaction logs separately from the volume containing the database tables. DLVs are particularly advantageous for databases with large allocated storage, high I/O per second (IOPS) requirements, or latency-sensitive workloads.

AWS

AWS Benchmarking Performance Traffic

PostgreSQL vs. Oracle: Difference in Costs, Ease of Use & Functionality

Scalegrid

JULY 13, 2020

Oracle Database is a commercial, proprietary multi-model database management system produced by Oracle Corporation, and the largest relational database management system (RDBMS) in the world. Compare ease of use across compatibility, extensions, tuning, operating systems, languages and support providers. Compare Ease of Use.

Open Source

Open Source Tuning C++ Database

Characterizing, modeling, and benchmarking RocksDB key-value workloads at Facebook

The Morning Paper

MARCH 10, 2020

Characterizing, modeling, and benchmarking RocksDB key-value workloads at Facebook , Cao et al., Or in the case of key-value stores, what you benchmark. So if you want to design a system that will offer good real-world performance, it’s really useful to have benchmarks that accurately represent real-world workloads.

Benchmarking

Benchmarking Storage Cache Open Source

Grafana Dashboards: A PoC Implementing the PostgreSQL Extension pg_stat_monitor

Percona

DECEMBER 26, 2023

Querying the data While it is reasonable to create panels showing real-time load in order to explore better the types of queries that can be run against pg_stat_monitor, it is more practical to copy and query the data into tables after the benchmarking has completed its run. A script executing a benchmarking run: #!/bin/bash

Benchmarking

Benchmarking Metrics C++ Database

InnoDB Performance Optimization Basics

Percona

MARCH 23, 2023

Storage The type of storage and disk used for database servers can have a significant impact on performance and reliability. Operating system Linux is the most common operating system for high-performance MySQL servers. Benchmark before you decide. Transparent huge pages (THP) disabled.

Performance

Performance Hardware Tuning Storage

How To Scale a Single-Host PostgreSQL Database With Citus

Percona

NOVEMBER 3, 2023

Rather than listing the concepts, function calls, etc, available in Citus, which frankly is a bit boring, I’m going to explore scaling out a database system starting with a single host. And now, execute the benchmark: -- execute the following on the coordinator node pgbench -c 20 -j 3 -T 60 -P 3 pgbench The results are not pretty.

Database

Database Benchmarking Latency C++

Evaluating the Evaluation: A Benchmarking Checklist

Brendan Gregg

JUNE 30, 2018

These have inspired me to summarize another performance activity: evaluating benchmark accuracy. Accurate benchmarking rewards engineering investment that actually improves performance, but, unfortunately, inaccurate benchmarking is more common. If the benchmark reported 20k ops/sec, you should ask: why not 40k ops/sec?

Benchmarking

Benchmarking Latency Cache Network

Netflix at AWS re:Invent 2019

The Netflix TechBlog

NOVEMBER 22, 2019

4:45pm-5:45pm NFX 202 A day in the life of a Netflix Engineer Dave Hahn , SRE Engineering Manager Abstract : Netflix is a large, ever-changing ecosystem serving millions of customers across the globe through cloud-based systems and a globally distributed CDN. We explore all the systems necessary to make and stream content from Netflix.

AWS

AWS Entertainment Open Source Benchmarking

Netflix at AWS re:Invent 2019

The Netflix TechBlog

NOVEMBER 22, 2019

4:45pm-5:45pm NFX 202 A day in the life of a Netflix Engineer Dave Hahn , SRE Engineering Manager Abstract : Netflix is a large, ever-changing ecosystem serving millions of customers across the globe through cloud-based systems and a globally distributed CDN. We explore all the systems necessary to make and stream content from Netflix.

AWS

AWS Entertainment Open Source Benchmarking

MySQL Key Performance Indicators (KPI) With PMM

Percona

JUNE 22, 2023

Indexing efficiency Monitoring indexing efficiency in MySQL involves analyzing query performance, using EXPLAIN statements, utilizing performance monitoring tools, reviewing error logs, performing regular index maintenance, and benchmarking/testing. This KPI is also directly related to Query Performance and helps improve it.

Performance

Performance Monitoring Traffic Database

Virtual consensus in Delos

The Morning Paper

NOVEMBER 8, 2020

While ultimately this new system should be able to take advantage of the latest advances in consensus for improved performance, that’s not realistic given a 6-9 month in-production target. It’s such a powerful idea that I can imagine distributed systems implementers everywhere adopting it from now on. What does the VirtualLog give us?

Virtualization

Virtualization Latency Storage Systems

RPC vs. Messaging – which is faster?

Particular Software

SEPTEMBER 20, 2021

Why RPC is “faster” It’s tempting to simply write a micro-benchmark test where we issue 1000 requests to a server over HTTP and then repeat the same test with asynchronous messages. If you did such a benchmark, here’s an incomplete picture you might end up with: Graph of microbenchmark showing RPC is faster than messaging.

Benchmarking

Benchmarking Latency Servers Systems

Percona Monitoring and Management 2 Scaling and Capacity Planning

Percona

MARCH 17, 2023

PMM2 uses VictoriaMetrics (VM) as its metrics storage engine. Please note that the focus of these tests was around standard metrics gathering and display, we’ll use a future blog post to benchmark some of the more intensive query analytics (QAN) performance numbers.

Monitoring

Monitoring Scalability Database Cache

Redis vs Memcached in 2024

Scalegrid

MARCH 28, 2024

This article will explore how they handle data storage and scalability, perform in different scenarios, and, most importantly, how these factors influence your choice. It uses a hash table to manage these pairs, divided into fixed-size buckets with linked lists for key-value storage. High data availability is achieved.

Cache

Cache Storage Scalability Architecture

What Is a Workload in Cloud Computing

Scalegrid

JANUARY 12, 2024

Simply put, it’s the set of computational tasks that cloud systems perform, such as hosting databases, enabling collaboration tools, or running compute-intensive algorithms. Such demanding use cases place a great value on systems capable of fast and reliable execution, a need that spans across various industry segments.

Cloud

Cloud Virtualization Storage Efficiency

Lerner?—?using RL agents for test case scheduling

The Netflix TechBlog

MAY 21, 2019

Netflix engineers run a series of tests and benchmarks to validate the device across multiple dimensions including compatibility of the device with the Netflix SDK, device performance, audio-video playback quality, license handling, encryption and security. We also provide an API client in Python.

Testing

Testing AWS Lambda Network

Choosing a cloud DBMS: architectures and tradeoffs

The Morning Paper

AUGUST 29, 2019

use the TPC-H benchmark to assess Redshift, Redshift Spectrum, Athena, Presto, Hive, and Vertica to find out what works best and the trade-offs involved. We focused on OLAP-oriented parallel data warehouse products available for AWS and restricted our attention to commercially available systems. Key findings. Key findings.

Architecture

Architecture Cloud Storage Serverless

DBaaS vs Self-Managed Cloud Databases

Scalegrid

DECEMBER 6, 2023

Understanding this DBaaS system might take some adjusting too similar just how traveling on cruises has its tempo which requires us to get used to optimally benefit from what it provides. The great thing about this is that it provides companies with an unprecedented level of freedom when configuring their system.

Database

Database Cloud Hardware Storage

Evaluating the Evaluation: A Benchmarking Checklist

Brendan Gregg

JUNE 29, 2018

These have inspired me to summarize another performance activity: evaluating benchmark accuracy. Accurate benchmarking rewards engineering investment that actually improves performance, but, unfortunately, inaccurate benchmarking is more common. If the benchmark reported 20k ops/sec, you should ask: why not 40k ops/sec?

Benchmarking

Benchmarking Latency Cache Network

Crucial Redis Monitoring Metrics You Must Watch

Scalegrid

JANUARY 25, 2024

Key metrics like throughput, request latency, and memory utilization are essential for assessing Redis health, with tools like the MONITOR command and Redis-benchmark for latency and throughput analysis and MEMORY USAGE/STATS commands for evaluating memory. It depends upon your application workload and its business logic.

Metrics

Metrics Monitoring Latency Cache

WAL Compression in PostgreSQL and Recent Improvements in Version 15

Percona

JANUARY 24, 2023

This will be clearly visible in PostgreSQL performance benchmarks as a “ Sawtooth wave ” pattern observed by Vadim in his tests: As we can see, the throughput suddenly drops after every checkpoint due to heavy WAL writing and gradually picks up until the next checkpoint. But this comes with a considerable performance implication.

Database

Database Benchmarking Open Source Latency

Fine-grained, secure and efficient data provenance on blockchain systems

The Morning Paper

SEPTEMBER 15, 2019

Fine-grained, secure and efficient data provenance on blockchain systems Ruan et al., That’s hard to do in today’s blockchain systems for two reasons: Provenance can only be determined by querying and replaying all on-chain transactions, which is inefficient and an offline activity. VLDB’19.

Blockchain

Blockchain Efficiency Systems Storage

The top 5 reasons to run your own database benchmarks

HammerDB

JANUARY 5, 2019

Some opinions claim that “Benchmarks are meaningless”, “benchmarks are irrelevant” or “benchmarks are nothing like your real applications” However for others “Benchmarks matter,” as they “account for the processing architecture and speed, memory, storage subsystems and the database engine.”

Benchmarking

Benchmarking Database Social Media Scalability

The Most Important MySQL Setting

Percona

APRIL 7, 2023

To illustrate this, I ran the Sysbench-TPCC synthetic benchmark against two different GCP instances running a freshly installed Percona Server for MySQL version 8.0.31 In MySQL, considering the standard storage engine, InnoDB , the data cache is called Buffer Pool. In PostgreSQL, it is called shared buffers.

Tuning

Tuning Cache Servers Benchmarking

2019 PostgreSQL Trends Report: Private vs. Public Cloud, Migrations, Database Combinations & Top Reasons Used

High Scalability

APRIL 3, 2019

PostgreSQL is an open source object-relational database system that has soared in popularity over the past 30 years from its active, loyal, and growing community. For the 2nd year in a row, PostgreSQL has kept the title of #1 fastest growing database in the world according to the DBMS of the Year report by the experts at DB-Engines.

Database

Database Cloud Open Source Systems

Is It a Read Intensive or a Write Intensive Workload?

Percona

AUGUST 30, 2018

Let’s examine the TPC-C Benchmark from this point of view, or more specifically its implementation in Sysbench. The illustrations below are taken from Percona Monitoring and Management (PMM) while running this benchmark. Let’s now look at the operating system level. Analyzing read/write workload by counts.

Benchmarking

Benchmarking Database Operating System Architecture

Mergeable replicated data types – Part II

The Morning Paper

NOVEMBER 26, 2019

An OCaml compiler extension for generating merge functions, and also for serializing and deserializing data structures for replication, using the third component of Quark… A content-addressable distributed storage abstraction, called the Quark store.

C++

C++ Storage Benchmarking Efficiency

How to Assess MySQL Performance

HammerDB

APRIL 19, 2023

Therefore, before we attempt to measure our database performance, we should know the system or cloud instance to be tested in detail. As database performance is heavily influenced by the performance of storage, network, memory, and processors, we must understand the upper limit of these key components. Operating System: Ubuntu 22.04

Performance

Performance Benchmarking Cache Storage

Azure Virtual Machines for SQL Server Usage

SQL Performance

DECEMBER 17, 2019

This removes the burden of purchasing and maintaining your hardware, storage and networking infrastructure, while still giving you a very familiar experience with Windows and SQL Server itself. You will still have to maintain your operating system, SQL Server and databases just like you would in an on-premises scenario.

Azure

Azure Virtualization Servers Storage

The Ultimate Guide to Database High Availability

Percona

JUNE 22, 2023

Defining high availability In general terms, high availability refers to the continuous operation of a system with little to no interruption to end users in the event of hardware or software failures, power outages, or other disruptions. Some disruption might occur, but it will be minimal. Fault tolerance aims for zero downtime and data loss.

Availability

Availability Database Open Source Hardware

HammerDB for Managers

HammerDB

JUNE 27, 2022

HammerDB is a software application for database benchmarking. HammerDB has graphical and command line interfaces for the Windows and Linux operating systems. Databases are highly sophisticated software, and to design and run a fair benchmark workload is a complex undertaking. Why HammerDB was developed. HammerDB Licensing.

Benchmarking

Benchmarking Open Source C++ Cache

Kubernetes for Big Data Workloads

Abhishek Tiwari

DECEMBER 27, 2017

faster access to external storage and data locality (I/O, bandwidth). A recent performance benchmark completed by Intel and BlueData using the BigBench benchmarking kit has shown that the performance ratios for container-based Hadoop workloads on BlueData EPIC are equal to and in some cases, better than bare-metal Hadoop [7].

Big Data

Big Data Storage Benchmarking Hardware

New (Old) Paper.

n0derunner

MAY 6, 2019

A 2007 paper, that still has lots to say on the subject of benchmarking storage and filesystems. Primarily aimed at researchers and developers, but is relevant to anyone about to embark on a benchmarking effort. A Nine year study of filesystem and storage benchmarking Download.

Benchmarking

Benchmarking Storage Cache Testing

Towards multiverse databases

The Morning Paper

JUNE 16, 2019

If we do that naively though, we’re going to end up with a lot of universes to store and maintain and the storage requirements alone will be prohibitive. Specifically, scalable, parallel streaming dataflow computing systems now support partially-stateful and dynamically-changing dataflows. It runs to about 2,000 lines of Rust.

Database

Database Cache Benchmarking Efficiency

The Power of Cosmos DB Comes to NServiceBus

Particular Software

OCTOBER 26, 2020

Backed by Cosmos DB, a fully managed, globally distributed, elastically scaled, pay-as-you-go service, your NServiceBus-based systems can benefit from guaranteed single-digit-millisecond latency with 99.999% availability. How does this compare with Azure Storage Persistence?

Azure

Azure Storage Benchmarking Latency

RabbitMQ vs. Kafka: Key Differences

Block Size and Its Impact on Storage Performance

Trending Sources

PostgreSQL Benchmark: ScaleGrid vs. Amazon RDS

Measuring the importance of data quality to causal AI success

What is infrastructure monitoring and why is it mission-critical in the new normal?

Building Netflix’s Distributed Tracing Infrastructure

Netflix at AWS re:Invent 2019

Further improved handling and reliability of OneAgent deployments

Building a Media Understanding Platform for ML Innovations

Maximizing Performance of AWS RDS for MySQL with Dedicated Log Volumes

PostgreSQL vs. Oracle: Difference in Costs, Ease of Use & Functionality

Characterizing, modeling, and benchmarking RocksDB key-value workloads at Facebook

Grafana Dashboards: A PoC Implementing the PostgreSQL Extension pg_stat_monitor

InnoDB Performance Optimization Basics

How To Scale a Single-Host PostgreSQL Database With Citus

Evaluating the Evaluation: A Benchmarking Checklist

Netflix at AWS re:Invent 2019

Netflix at AWS re:Invent 2019

MySQL Key Performance Indicators (KPI) With PMM

Virtual consensus in Delos

RPC vs. Messaging – which is faster?

Percona Monitoring and Management 2 Scaling and Capacity Planning

Redis vs Memcached in 2024

What Is a Workload in Cloud Computing

Lerner?—?using RL agents for test case scheduling

Choosing a cloud DBMS: architectures and tradeoffs

DBaaS vs Self-Managed Cloud Databases

Evaluating the Evaluation: A Benchmarking Checklist

Crucial Redis Monitoring Metrics You Must Watch

WAL Compression in PostgreSQL and Recent Improvements in Version 15

Fine-grained, secure and efficient data provenance on blockchain systems

The top 5 reasons to run your own database benchmarks

The Most Important MySQL Setting

2019 PostgreSQL Trends Report: Private vs. Public Cloud, Migrations, Database Combinations & Top Reasons Used

Is It a Read Intensive or a Write Intensive Workload?

Mergeable replicated data types – Part II

How to Assess MySQL Performance

Azure Virtual Machines for SQL Server Usage

The Ultimate Guide to Database High Availability

HammerDB for Managers

Kubernetes for Big Data Workloads

New (Old) Paper.

Towards multiverse databases

The Power of Cosmos DB Comes to NServiceBus

Stay Connected