Benchmarking, Open Source and Testing - Technology Performance Pulse

Benchmarking

Open Source

Testing

Introducing Configurable Metaflow

The Netflix TechBlog

DECEMBER 19, 2024

Frequently, practitioners want to experiment with variants of these flows, testing new data, new parameterizations, or new algorithms, while keeping the overall structure of the flow or flowsintact. A natural solution is to make flows configurable using configuration files, so variants can be defined without changing the code.

Best Practices

Best Practices Cache Metrics Code

An Engineer's Guide to AI Code Model Evals

Addy Osmani

JULY 24, 2025

In the context of AI models, evals refer to structured tests or benchmarks we use to measure a model’s performance on specific tasks. When you develop traditional software, you likely write tests to ensure your code works as intended. A crucial part of the process is evaluation – often abbreviated as “evals”.

Code

Code Engineering Benchmarking Tuning

Join 5,000+

Insiders

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

Generative AI in the Real World: Stefania Druga on Designing for the Next Generation

O'Reilly

JUNE 26, 2025

We have open source models that are multimodal and can run on devices, so you don’t need to send your data to the cloud. We created a benchmark of misconceptions first. We tested to see if multimodal LLMs can pick up misconceptions based on pictures of kids’ handwritten exercises. The first was in math.

Design

Design Education Open Source Games

Improving PHP Performance for Web Applications

KeyCDN

MAY 17, 2023

Over time, he added more features to the language, such as dynamic generation of HTML pages, and released it as an open-source project in 1995. A more sensible approach is to conduct tests during the development process ; otherwise, you may find yourself rewriting large chunks of code to make your application function properly.

Performance

Performance Cache Speed Traffic

What Comes After the LLM: Human-Centered AI, Spatial Intelligence, and the Future of Practice

O'Reilly

JUNE 6, 2025

How do we debug or test agents when output isn’t just text but spatial behavior? It emerges from ecosystems: funding systems, research labs, open source communities, and public education. She’s not trying to chase benchmarks; she’s trying to shape institutions that can adapt over time. It’s transition.

Education

Education Government Healthcare Transportation

How to understand TPC-C tpmC and TPROC-C NOPM and what is ‘good’ performance?

HammerDB

DECEMBER 10, 2024

tpmC tpmC is the transactions per minute metric that is the measurement of the official TPC-C benchmark from the TPC-Council. Without exception, TPC-C and tpmC can only be used for official audited TPC-C benchmarks published here by the TPC-Council. Why this would be the case is straightforward.

C++

C++ Benchmarking Hardware Performance

5 powerful use cases beyond debugging for Dynatrace Live Debugger

Dynatrace

MARCH 25, 2025

White box testing The nicest thing about deploying UI changes to production is that you can immediately see the changes in action. You can see when a new version is deployed, test it to ensure everything works as expected, and youre done. Test data collection Accurate test data can mean life or death.

Benchmarking

Benchmarking Code Open Source Engineering

MySQL on Azure Performance Benchmark – ScaleGrid vs. Azure Database

Scalegrid

AUGUST 26, 2020

MySQL is the number one open source database that’s commonly hosted through Azure instances. MySQL Azure Performance Benchmark. In this benchmark report, we compare MySQL hosting on Azure at ScaleGrid vs. Azure Database for MySQL across these three workload scenarios: Read-Intensive Workload: 80% reads and 20% writes.

Azure

Azure Benchmarking Database Latency

What Happened to HornetQ, the JMS That Shattered Records?

DZone

JULY 24, 2022

broke records and defeated top-ranked messaging services in benchmark tests. When testing a new product, it's important to see how it stacks up against its competition. In 2010, the SPECjms2007 benchmark record was smashed by HornetQ , an open-source enterprise messaging system from JBoss.

Benchmarking

Benchmarking Open Source Java Servers

PostgreSQL Benchmark: ScaleGrid vs. Amazon RDS

Scalegrid

NOVEMBER 4, 2024

Performance Benchmarking of PostgreSQL on ScaleGrid vs. AWS RDS Using Sysbench This article evaluates PostgreSQL’s performance on ScaleGrid and AWS RDS, focusing on versions 13, 14, and 15. This study benchmarks PostgreSQL performance across two leading managed database platforms—ScaleGrid and AWS RDS—using versions 13, 14, and 15.

Benchmarking

Benchmarking AWS Tuning Metrics

Container security: What it is, why it’s tricky, and how to do it right

Dynatrace

SEPTEMBER 23, 2021

Application developers commonly leverage open-source software when building containerized applications. In fact, the market research firm Forrester says that the average container image is comprised of 70% open-source software.[1] 1] And unfortunately, open-source software is often fraught with security vulnerabilities.

Open Source

Open Source Benchmarking Best Practices Network

Performance Testing at MongoDB

Alex Podelko

APRIL 18, 2021

MongoDB has the most advanced continuous performance testing I know about. MongoDB shared a lot of information on how we do performance testing and even open sourced some parts of it. Continuous performance testing is built on the top of Evergreen. If I missed something interesting, please let me know.]

Performance Testing

Performance Testing Testing Performance Education

Best MySQL DigitalOcean Performance – ScaleGrid vs. DigitalOcean Managed Databases

Scalegrid

JUNE 22, 2020

MySQL is the all-time number one open source database in the world, and a staple in RDBMS space. MySQL DigitalOcean Performance Benchmark. In this benchmark, we compare equivalent plan sizes between ScaleGrid MySQL on DigitalOcean and DigitalOcean Managed Databases for MySQL. Read-Intensive Throughput Benchmark.

Database

Database Benchmarking Latency Performance

Comparing PostgreSQL DigitalOcean Performance & Pricing – ScaleGrid vs. DigitalOcean Managed Databases

Scalegrid

JUNE 4, 2020

As an open source database, it’s a highly popular choice for enterprise applications looking to modernize their infrastructure and reduce their total cost of ownership, along with startup and developer applications looking for a powerful, flexible and cost-effective database to work with. PostgreSQL DigitalOcean Performance Test.

Database

Database Latency Benchmarking Performance

PostgreSQL vs. Oracle: Difference in Costs, Ease of Use & Functionality

Scalegrid

JULY 13, 2020

The unstoppable rise of open source databases. One database in particular is causing a huge dent in Oracle’s market share – open source PostgreSQL. See how open source PostgreSQL Community version costs compare to Oracle Standard Edition and Oracle Enterprise Edition. What’s causing this massive shift?

Open Source

Open Source Tuning C++ Database

What is DORA? The application security and reliability implications of the Digital Operational Resilience Act

Dynatrace

AUGUST 26, 2024

DORA compliance initiatives can help drive a risk-based approach to application security, including exposure management, security testing, threat detection and response, and software supply chain security. Conduct digital operational resilience testing to simulate various scenarios. Critical systems testing. Penetration testing.

Government

Government Best Practices Infrastructure Analytics

MySQL on Azure Performance Benchmark – ScaleGrid vs. Azure Database

High Scalability

SEPTEMBER 1, 2020

MySQL is the number one open source database that’s commonly hosted through Azure instances. Microsoft Azure is one of the most popular cloud providers in the world, and a natural fit for database hosting on applications leveraging Microsoft across their infrastructure.

Azure

Azure Database Benchmarking Performance

10 tips for migrating from monolith to microservices

Dynatrace

OCTOBER 2, 2023

Because they’re separate, they allow for faster release cycles, greater scalability, and the flexibility to test new methodologies and technologies. However, the distributed system of a microservices architecture comes with its own cost: increased application complexity and convoluted testing. Migration is time-consuming and involved.

Architecture

Architecture Artificial Intelligence Cloud Open Source

The State Of Mobile And Why Mobile Web Testing Matters

Smashing Magazine

MARCH 2, 2021

The State Of Mobile And Why Mobile Web Testing Matters. The State Of Mobile And Why Mobile Web Testing Matters. And to ensure the quality of a product, we always need to test — on a number of devices, and in a number of conditions. What’s a representative device to test on in 2021? Kelvin Omereshone. State Of Mobile 2021.

Mobile

Mobile Testing Website Benchmarking

How to Improve MySQL AWS Performance 2X Over Amazon RDS at The Same Cost

Scalegrid

OCTOBER 24, 2019

AWS is the #1 cloud provider for open-source database hosting, and the go-to cloud for MySQL deployments. MySQL on AWS Performance Test. MySQL Performance Benchmark Configuration. MySQL Performance Test Scenarios and Results. Amazon RDS. Instance Type. AWS High Performance XLarge (see system details below).

AWS

AWS Latency Performance Performance Testing

Lerner?—?using RL agents for test case scheduling

The Netflix TechBlog

MAY 21, 2019

using RL agents for test case scheduling By: Stanislav Kirdey , Kevin Cureton , Scott Rick , Sankar Ramanathan Introduction Netflix brings delightful customer experiences to homes on a variety of devices that continues to grow each day. Detect a regression in a test case. These problems could be solved in several different ways.

Testing

Testing AWS Lambda Network

Building Netflix’s Distributed Tracing Infrastructure

The Netflix TechBlog

OCTOBER 19, 2020

Troubleshooting a session in Edgar When we started building Edgar four years ago, there were very few open-source distributed tracing systems that satisfied our needs. Our tactical approach was to use Netflix-specific libraries for collecting traces from Java-based streaming services until open source tracer libraries matured.

Infrastructure

Infrastructure Transportation Storage Open Source

Protect your organization against zero-day vulnerabilities

Dynatrace

AUGUST 3, 2022

Spring4Shell is a critical vulnerability that emerged in March of 2022 that affects the Spring Java framework, an open-source platform for Java-based application development. The Spring framework is popular because it enables software engineers to more easily write and test code to maintain modular applications.

Java

Java Traffic Benchmarking Strategy

PostgreSQL: Pgpool-II Use Cases and Benefits

Percona

APRIL 6, 2023

PostgreSQL is a popular open source relational database management system many organizations use to store and manage their data. Two benchmarks from users can be found here: [1] [2] 4. One of the key benefits of using PostgreSQL is its reliability, scalability, and performance. Pgpool-II This is where the pgpool-II comes in.

Cache

Cache Open Source Benchmarking Database

Maximizing Performance of AWS RDS for MySQL with Dedicated Log Volumes

Percona

DECEMBER 11, 2023

We performed a standard benchmarking test using the sysbench tool to compare the performance of a DLV instance vs a standard RDS MySQL instance, as shared in the following section. Benchmarking AWS RDS DLV setup Setup 2 RDS Single DB instances 1 EC2 Instance Regular DLV Enabled Sysbench db.m6i.2xlarge Get in touch

AWS

AWS Benchmarking Performance Traffic

Common Software Testing Mistakes Beginners Make & How To Avoid

Testsigma

MAY 19, 2020

Software testing is the process of finding bugs or discrepancies in a software. As a beginner in software testing, you would make your own mistakes and learn from them to shape your career path. Following are a few common mistakes often made by software testing beginners when they start their journey in the world of testing.

Software

Software Software Testing Open Source

Grafana Dashboards: A PoC Implementing the PostgreSQL Extension pg_stat_monitor

Percona

DECEMBER 26, 2023

Although the default configuration simulates loading based loosely upon TPC-B, it is nevertheless easy to test other use cases by writing one’s own transaction script files. A script executing a benchmarking run: #!/bin/bash tps, lat 11.718 ms stddev 3.951 progress: 4440.0 tps, lat 11.075 ms stddev 3.519 progress: 4445.0

Benchmarking

Benchmarking Metrics C++ Database

Why Choose a Third-Party Database Migration Service

Percona

NOVEMBER 3, 2023

Sure, database migration is complex, particularly when you’re looking to migrate from a proprietary database to an open source one. Quality assurance: How do you plan to test? Have you built a testing environment? Percona, for example, has helped hundreds of companies migrate to open source.

Database

Database Open Source Benchmarking Testing

How to maximize CPU performance for PostgreSQL 12.0 benchmarks on Linux

HammerDB

OCTOBER 18, 2019

HammerDB doesn’t publish competitive database benchmarks, instead we always encourage people to be better informed by running their own. So over at Phoronix some database benchmarks were published showing PostgreSQL 12 Performance With AMD EPYC 7742 vs. Intel Xeon Platinum 8280 Benchmarks . uname -a Linux ubuntu19 5.3.0-rc3-custom

Benchmarking

Benchmarking Performance Hardware C++

The Ultimate Guide to Database High Availability

Percona

JUNE 22, 2023

” Here are additional metrics used to determine the reliability of a database, make adjustments that minimize downtime, and set benchmarks for meeting business continuity requirements. The transparency of open code enhances security as potential vulnerabilities can be identified and patched more quickly by the community.

Availability

Availability Database Open Source Hardware

How To Scale a Single-Host PostgreSQL Database With Citus

Percona

NOVEMBER 3, 2023

Leveraging pgbench , which is a benchmarking utility that comes bundled with PostgreSQL, I will put the cluster through its paces by executing a series of DML operations. And now, execute the benchmark: -- execute the following on the coordinator node pgbench -c 20 -j 3 -T 60 -P 3 pgbench The results are not pretty.

Database

Database Benchmarking Latency C++

The Most Important MySQL Setting

Percona

APRIL 7, 2023

To illustrate this, I ran the Sysbench-TPCC synthetic benchmark against two different GCP instances running a freshly installed Percona Server for MySQL version 8.0.31 This explains, in part , how PostgreSQL performed better out of the box for this test workload. The throughput didn’t double but increased by 57%.

Tuning

Tuning Cache Servers Benchmarking

MySQL Key Performance Indicators (KPI) With PMM

Percona

JUNE 22, 2023

A monitoring tool like Percona Monitoring and Management (PMM) is a popular choice among open source options for effectively monitoring MySQL performance. You should not only monitor the backup mount for disk space and backup log but also regularly test the restores and log to match RPO and RTO objectives.

Performance

Performance Monitoring Traffic Database

Impact of Querying Table Information From information_schema

Percona

MARCH 27, 2023

A lot of useful information can be retrieved from this schema, for example, table metadata and foreign key relations, but trying to query I_S can induce performance degradation if your server is under heavy load, as shown in the following example test. The same tests have been executed in Percona Server for MySQL 5.7

Cache

Cache Hardware Servers Benchmarking

Performance Testing - Tools, Steps, and Best Practices

KeyCDN

AUGUST 15, 2019

Web performance is a broad subject, and you’ll find no shortage of performance testing tips and tutorials all over the web. Before you begin tuning your website or application, you must first figure out which metrics matter most to your users and establish some achievable benchmarks. What is Performance Testing?

Testing Tools

Testing Tools Best Practices Performance Testing Testing

HammerDB for Managers

HammerDB

JUNE 27, 2022

HammerDB is a software application for database benchmarking. Databases are highly sophisticated software, and to design and run a fair benchmark workload is a complex undertaking. The Transaction Processing Performance Council (TPC) was founded to bring standards to database benchmarking, and the history of the TPC can be found here.

Benchmarking

Benchmarking Open Source C++ Cache

WAL Compression in PostgreSQL and Recent Improvements in Version 15

Percona

JANUARY 24, 2023

This will be clearly visible in PostgreSQL performance benchmarks as a “ Sawtooth wave ” pattern observed by Vadim in his tests: As we can see, the throughput suddenly drops after every checkpoint due to heavy WAL writing and gradually picks up until the next checkpoint. I couldn’t see any adverse effect on the TPS on quick tests.

Database

Database Benchmarking Open Source Latency

The Importance of Selecting the Proper Azure VM Size

SQL Performance

NOVEMBER 18, 2019

The common trend is to choose a VM based exclusively on vCPU, memory, and storage capacity without benchmarking the current IO and throughput requirements. I put each of these VMs to a test using CrystalDiskMark. Benchmark Test. The number 5 is the number of iterations of the test that will be run. Generation.

Azure

Azure Benchmarking Storage Virtualization

Why MySQL Could Be Slow With Large Tables

Percona

JANUARY 19, 2023

Example: Creating four simple tables to store strings but using different data types: db1 test> CREATE TABLE tb1 (id int auto_increment primary key, test_text char(200)); Query OK, 0 rows affected (0.11 sec) db1 test> CREATE TABLE tb2 (id int auto_increment primary key, test_text varchar(200)); Query OK, 0 rows affected (0.05

Open Source

Open Source Storage Database Big Data

The top 5 reasons to run your own database benchmarks

HammerDB

JANUARY 5, 2019

Some opinions claim that “Benchmarks are meaningless”, “benchmarks are irrelevant” or “benchmarks are nothing like your real applications” However for others “Benchmarks matter,” as they “account for the processing architecture and speed, memory, storage subsystems and the database engine.”

Benchmarking

Benchmarking Database Social Media Scalability

Real Time Oracle Performance Monitoring for Benchmarks

HammerDB

OCTOBER 18, 2018

An essential part of database performance testing is viewing the statistics generated by the database during the test and in 2009 HammerDB introduced automatic AWR snapshot generation for Oracle for the TPC-C test. With this feature Oracle generates a wealth of performance data that can be reviewed once the test is complete.

Benchmarking

Benchmarking Monitoring Performance C++

How to Improve MySQL AWS Performance 2X Over Amazon RDS at The Same Cost

High Scalability

OCTOBER 29, 2019

AWS is the #1 cloud provider for open-source database hosting, and the go-to cloud for MySQL deployments.

AWS

AWS Latency Performance Open Source

Optimizing Video For Size And Quality

Smashing Magazine

FEBRUARY 15, 2021

To figure out what exactly is the problem, we can use FFMPEG , which is open source and free, and proves to be one of the most reliable tools to optimize videos. In FFMPEG, the principal quality/size benchmark is the Constant Rate Factor (CRF) compression, with values ranging from 0 (no compression) to 50 (high compression).

Network

Network Website Speed Mobile

Progress Delayed Is Progress Denied

Alex Russell

APRIL 29, 2021

Apple Corporate is at fault, not Open Source engineers or the line managers who support them. As an engineer on a browser team, I'm privy to the blow-by-blow of various performance projects, benchmark fire drills, and the ways performance marketing (deeply) impacts engineering priorities. So is speedy resolution and agreement.

Media

Media Games Education Engineering

Introducing Configurable Metaflow

An Engineer's Guide to AI Code Model Evals

Trending Sources

Generative AI in the Real World: Stefania Druga on Designing for the Next Generation

Improving PHP Performance for Web Applications

What Comes After the LLM: Human-Centered AI, Spatial Intelligence, and the Future of Practice

How to understand TPC-C tpmC and TPROC-C NOPM and what is ‘good’ performance?

5 powerful use cases beyond debugging for Dynatrace Live Debugger

MySQL on Azure Performance Benchmark – ScaleGrid vs. Azure Database

What Happened to HornetQ, the JMS That Shattered Records?

PostgreSQL Benchmark: ScaleGrid vs. Amazon RDS

Container security: What it is, why it’s tricky, and how to do it right

Performance Testing at MongoDB

Best MySQL DigitalOcean Performance – ScaleGrid vs. DigitalOcean Managed Databases

Comparing PostgreSQL DigitalOcean Performance & Pricing – ScaleGrid vs. DigitalOcean Managed Databases

PostgreSQL vs. Oracle: Difference in Costs, Ease of Use & Functionality

What is DORA? The application security and reliability implications of the Digital Operational Resilience Act

MySQL on Azure Performance Benchmark – ScaleGrid vs. Azure Database

10 tips for migrating from monolith to microservices

The State Of Mobile And Why Mobile Web Testing Matters

How to Improve MySQL AWS Performance 2X Over Amazon RDS at The Same Cost

Lerner?—?using RL agents for test case scheduling

Building Netflix’s Distributed Tracing Infrastructure

Protect your organization against zero-day vulnerabilities

PostgreSQL: Pgpool-II Use Cases and Benefits

Maximizing Performance of AWS RDS for MySQL with Dedicated Log Volumes

Common Software Testing Mistakes Beginners Make & How To Avoid

Grafana Dashboards: A PoC Implementing the PostgreSQL Extension pg_stat_monitor

Why Choose a Third-Party Database Migration Service

How to maximize CPU performance for PostgreSQL 12.0 benchmarks on Linux

The Ultimate Guide to Database High Availability

How To Scale a Single-Host PostgreSQL Database With Citus

The Most Important MySQL Setting

MySQL Key Performance Indicators (KPI) With PMM

Impact of Querying Table Information From information_schema

Performance Testing - Tools, Steps, and Best Practices

HammerDB for Managers

WAL Compression in PostgreSQL and Recent Improvements in Version 15

The Importance of Selecting the Proper Azure VM Size

Why MySQL Could Be Slow With Large Tables

The top 5 reasons to run your own database benchmarks

Real Time Oracle Performance Monitoring for Benchmarks

How to Improve MySQL AWS Performance 2X Over Amazon RDS at The Same Cost

Optimizing Video For Size And Quality

Progress Delayed Is Progress Denied

Stay Connected