Performance, Storage and Tuning - Technology Performance Pulse

Catching up with OpenTelemetry in 2025

Dynatrace

FEBRUARY 27, 2025

OpenTelemetry is enhancing GenAI observability : By defining semantic conventions for GenAI and implementing Python-based instrumentation for OpenAI, OpenTel is moving towards addressing GenAI monitoring and performance tuning needs. The Collector is expected to be ready for prime time in 2025, reaching the v1.0

Tuning

Tuning Open Source Innovation Monitoring

Dynatrace + Metis: Helping developers & SREs solve Database issues with AI

Dynatrace

MARCH 5, 2025

Site Reliability Engineers (SREs) also face significant challenges in maintaining database reliability, ensuring performance, and preventing disruptions in highly dynamic and distributed environments. Why this matters Databases are the backbone of modern applications, but they can also be a major source of performance bottlenecks.

Database

Database Development Tuning DevOps

Optimizing data warehouse storage

The Netflix TechBlog

DECEMBER 21, 2020

At this scale, we can gain a significant amount of performance and cost benefits by optimizing the storage layout (records, objects, partitions) as the data lands into our warehouse. We built AutoOptimize to efficiently and transparently optimize the data and metadata storage layout while maximizing their cost and performance benefits.

Storage

Storage Latency Efficiency Data Engineering

Introducing Impressions at Netflix

The Netflix TechBlog

FEBRUARY 14, 2025

The enriched data is seamlessly accessible for both real-time applications via Kafka and historical analysis through storage in an Apache Iceberg table. Automating Performance Tuning with Autoscalers Tuning the performance of our Apache Flink jobs is currently a manual process.

Tuning

Tuning Latency Efficiency Storage

Speed Trino Queries With These Performance-Tuning Tips

DZone

NOVEMBER 27, 2023

An open-source distributed SQL query engine, Trino is widely used for data analytics on distributed data storage. In this article, we will show you how to tune Trino by helping you identify performance bottlenecks and provide tuning tips that you can practice. But how do we do that?

Tuning

Tuning Speed Performance Open Source

Notes on tuning postgres for cpu and memory benchmarking

n0derunner

OCTOBER 18, 2024

Recently I wanted to measure the impact of NUMA placement and Hugepages on the performance of postgres running in a VM on a Nutanix node. To do this I needed to drive postgres to do real transactions but have very little jitter/noise from the filesystem and storage.

Benchmarking

Benchmarking Tuning Storage Performance

Netflix’s Distributed Counter Abstraction

The Netflix TechBlog

NOVEMBER 12, 2024

This counting service, built on top of the TimeSeries Abstraction, enables distributed counting at scale while maintaining similar low latency performance. After selecting a mode, users can interact with APIs without needing to worry about the underlying storage mechanisms and counting methods.

Latency

Latency Cache Infrastructure Strategy

RabbitMQ vs. Kafka: Key Differences

Scalegrid

FEBRUARY 6, 2025

This article outlines the key differences in architecture, performance, and use cases to help determine the best fit for your workload. Message brokers handle validation, routing, storage, and delivery, ensuring efficient and reliable communication. What is RabbitMQ?

Latency

Latency Analytics Architecture Storage

The Challenges of Ajax CDN

DZone

AUGUST 4, 2022

For the longest time, hosting static files on CDNs was the de facto standard for performance tuning website pages. The host offered browser caching advantages, better stability, and storage on fast edge servers across strategic geolocations. Not only did it have performance benefits, but it was also convenient for developers.

Cache

Cache Tuning Storage Website

Comparing PostgreSQL DigitalOcean Performance & Pricing – ScaleGrid vs. DigitalOcean Managed Databases

Scalegrid

JUNE 4, 2020

In this post, we are going to compare the performance and pricing of DigitalOcean PostgreSQL vs. ScaleGrid PostgreSQL to help you determine the best PostgreSQL hosting service on DigitalOcean. On average, ScaleGrid provides over 30% more storage vs. DigitalOcean for PostgreSQL at the same affordable price. Compare Pricing. Single Node.

Database

Database Latency Benchmarking Performance

Title Launch Observability at Netflix Scale

The Netflix TechBlog

DECEMBER 17, 2024

As Netflix scaled, we faced the mounting challenge of providing accurate, timely answers to increasingly complex queries about title performance and discoverability. By logging all titles as they are displayed, we can process these logs to identify anomalies and gain insights into system performance.

Traffic

Traffic Scalability Strategy Monitoring

Metadata Synchronization in Alluxio: Design, Implementation, and Optimization

DZone

DECEMBER 14, 2021

Metadata synchronization (sync) is a core feature in Alluxio that keeps files and directories consistent with their source of truth in under-storage systems, thus making it simple for users to reason the data retrieved from Alluxio. Meanwhile, understanding the internal process is important in order to tune the performance.

Design

Design Storage Tuning Systems

Title Launch Observability at Netflix Scale

The Netflix TechBlog

MARCH 4, 2025

Data storage and distribution through HollowFeeds Netflix Hollow is an Open Source java library and toolset for disseminating in-memory datasets from a single producer to many consumers for high performance read-only access.

Traffic

Traffic Strategy Entertainment Innovation

Best MySQL DigitalOcean Performance – ScaleGrid vs. DigitalOcean Managed Databases

Scalegrid

JUNE 22, 2020

ScaleGrid provides 30% more storage on average vs. DigitalOcean for MySQL at the same affordable price. MySQL DigitalOcean Performance Benchmark. We are going to use a common, popular plan size using the below configurations for this performance benchmark: Comparison Overview. Compare Pricing. DigitalOcean. Instance Type.

Database

Database Benchmarking Latency Performance

Introducing Netflix’s Key-Value Data Abstraction Layer

The Netflix TechBlog

SEPTEMBER 18, 2024

Firstly, developers struggled to reason about consistency, durability and performance in this complex global deployment across multiple stores. This flexibility allows our Data Platform to route different use cases to the most suitable storage system based on performance, durability, and consistency needs.

Latency

Latency Storage Cache Efficiency

Storage Strategies for PostgreSQL on Kubernetes

Percona

DECEMBER 11, 2023

There are a wealth of options on how you can approach storage configuration in Percona Operator for PostgreSQL , and in this blog post, we review various storage strategies — from basics to more sophisticated use cases. For example, you can choose the public cloud storage type – gp3, io2, etc, or set file system.

Storage

Storage Strategy Cloud Azure

Introducing Netflix TimeSeries Data Abstraction Layer

The Netflix TechBlog

OCTOBER 8, 2024

Flexible Storage : The service is designed to integrate with various storage backends, including Apache Cassandra and Elasticsearch , allowing Netflix to customize storage solutions based on specific use case requirements. DistinctAggregation) , this endpoint performs the given aggregation within a given time interval.

Latency

Latency Storage Traffic Tuning

InnoDB Performance Optimization Basics

Percona

MARCH 23, 2023

This blog is in reference to our previous ones for ‘Innodb Performance Optimizations Basics’ 2007 and 2013. Although there have been many blogs about adjusting MySQL variables for better performance since then, I think this topic deserves a blog update since the last update was a decade ago, and MySQL 5.7

Performance

Performance Hardware Tuning Storage

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

The Netflix TechBlog

MAY 4, 2023

Migrating Critical Traffic At Scale with No Downtime — Part 1 Shyam Gala , Javier Fernandez-Ivern , Anup Rokkam Pratap , Devang Shah Hundreds of millions of customers tune into Netflix every day, expecting an uninterrupted and immersive streaming experience. This approach has a handful of benefits.

Traffic

Traffic Latency Tuning Systems

Privacy spotlight: Ensure compliance by hard deleting individual records in Grail

Dynatrace

JULY 11, 2024

Dynatrace Grail™ is a data lakehouse optimized for high performance, automated data collection and processing, and queries of petabytes of data in real time. You can use the Grail Storage Record Deletion API to trigger a deletion request. To delete the records, use the Storage Record Deletion API.

Storage

Storage Best Practices Government Media

Netflix Cloud Packaging in the Terabyte Era

The Netflix TechBlog

SEPTEMBER 24, 2021

The ProRes codec family provides great editing performance and image quality. From chunk encoding to assembly and packaging, the result of each previous processing step must be uploaded to cloud storage and then downloaded by the next processing step. Uploading and downloading data always come with a penalty, namely latency.

Cloud

Cloud Media Storage Cache

Bulldozer: Batch Data Moving from Data Warehouse to Online Key-Value Stores

The Netflix TechBlog

OCTOBER 27, 2020

Iceberg is widely adopted in Netflix as a data warehouse table format that addresses many of the usability and performance problems with Hive tables. Then the KV DAL handles writing to the appropriate underlying storage engines depending on latency, availability, cost, and durability requirements.

Latency

Latency Storage Big Data Tuning

MySQL General Tablespaces: A Powerful Storage Option for Your Data

Percona

JANUARY 4, 2024

Managing storage and performance efficiently in your MySQL database is crucial, and general tablespaces offer flexibility in achieving this. In contrast to the single system tablespace that holds system tables by default, general tablespaces are user-defined storage containers for multiple InnoDB tables.

Storage

Storage Engineering Database Open Source

Best Practices for Scaling RabbitMQ

Scalegrid

FEBRUARY 24, 2025

Scaling RabbitMQ ensures your system can handle growing traffic and maintain high performance. Optimizing RabbitMQ performance through strategies such as keeping queues short, enabling lazy queues, and monitoring health checks is essential for maintaining system efficiency and effectively managing high traffic loads.

Best Practices

Best Practices Traffic Strategy Efficiency

Faster time to value with enhanced handling of OneAgent runtime data

Dynatrace

SEPTEMBER 23, 2020

Storage mount points in a system might be larger or smaller, local or remote, with high or low latency, and various speeds. Sometimes these locations landed on mount points which, due to capacity, availability, or access constraints, weren’t well suited for large runtime storage. Customizable location of large runtime files.

Storage

Storage Latency Operating System Network

Building Netflix’s Distributed Tracing Infrastructure

The Netflix TechBlog

OCTOBER 19, 2020

Our distributed tracing infrastructure is grouped into three sections: tracer library instrumentation, stream processing, and storage. An additional implication of a lenient sampling policy is the need for scalable stream processing and storage infrastructure fleets to handle increased data volume. Storage: don’t break the bank!

Infrastructure

Infrastructure Transportation Storage Open Source

Conducting log analysis with an observability platform and full data context

Dynatrace

APRIL 20, 2023

At Dynatrace Perform 2023 , Maciej Pawlowski, senior director of product management for infrastructure monitoring at Dynatrace, and a senior software engineer at a U.K.-based Business leaders can decide which logs they want to use and tune storage to their data needs. Seamless integration.

Analytics

Analytics Infrastructure Storage Architecture

PostgreSQL vs. Oracle: Difference in Costs, Ease of Use & Functionality

Scalegrid

JULY 13, 2020

Compare ease of use across compatibility, extensions, tuning, operating systems, languages and support providers. There are a wide range of tools and extensions for every conceivable scenario, like performance profiling, auditing, etc. pg_repack – reorganizes tables online to reclaim storage. Compare Ease of Use.

Open Source

Open Source Tuning C++ Database

Dynatrace log collection for ARM unlocks power-efficient architecture for your enterprise

Dynatrace

DECEMBER 5, 2023

This growth was spurred by mobile ecosystems with Android and iOS operating systems, where ARM has a unique advantage in energy efficiency while offering high performance. Huge performance leaps in recent years The top priority is often performance, where ARM resources have improved significantly.

Efficiency

Efficiency Architecture Energy Monitoring

Observability engineering: Getting Prometheus metrics right for Kubernetes with Dynatrace and Kepler

Dynatrace

DECEMBER 18, 2023

This challenge has given rise to the discipline of observability engineering, which concentrates on the details of telemetry data to fine-tune observability use cases. But often, we use additional services and solutions within our environment for backups, storage, networking, and more. Observability engineering success!

Metrics

Metrics Engineering Energy Tuning

Building a Media Understanding Platform for ML Innovations

The Netflix TechBlog

MARCH 14, 2023

In addition, we were able to perform a handful of A/B tests to validate or negate our hypotheses for tuning the search experience. Users have flexibility to perform multimodal search with input being a simple text term, image or short video. This service leverages Cassandra and Elasticsearch for data storage and retrieval.

Media

Media Innovation Energy Architecture

PostgreSQL Indexes Can Hurt You: Negative Effects and the Costs Involved

Percona

APRIL 24, 2023

Indexes are generally considered to be the panacea when it comes to SQL performance tuning, and PostgreSQL supports different types of indexes catering to different use cases. I keep seeing many articles and talks on “tuning” discussing how creating new indexes speeds up SQL but rarely ones discussing removing them.

Tuning

Tuning Cache Storage Database

No need to compromise visibility in public clouds with new Azure services supported by Dynatrace (Part 2)

Dynatrace

AUGUST 28, 2020

Azure Data Lake Storage Gen1. Understand the performance and status of Azure Logic Apps workflows. We’ll release additional monitoring support for new services soon, so stay tuned for further updates. Azure Logic Apps. Azure Container Instance. Azure Data Factory v1. Azure Data Factory v2. Azure Data Lake Analytics. What’s next?

Azure

Azure Cloud Tuning Monitoring

Using SLOs to become the optimization athlete with Dynatrace

Dynatrace

JUNE 8, 2021

This post was co-authored by Jean-Louis Lormeau, Digital Performance Architect at Dynatrace. . You’ll learn how to create production SLOs, to continuously improve the performance of services, and I’ll guide you on how to become a champion of your sport by: Creating calculated metrics with the help of multidimensional analysis.

Metrics

Metrics Tuning Programming Systems

5 SRE best practices you can implement today

Dynatrace

JULY 6, 2022

Virtualization has revolutionized system administration by making it possible for software to manage systems, storage, and networks. By removing physical dependencies, automation can help perform SRE at scale. Design, implement, and tune effective SLOs. This number will likely increase as the SRE discipline matures.

Best Practices

Best Practices Open Source Tuning Infrastructure

Understand and replay iOS app crashes with Session Replay

Dynatrace

FEBRUARY 9, 2021

As an app developer analyzing a crash detected by Dynatrace, you can see the sequence of steps that were performed by the user along with the crash stack trace: With this information in hand, you can immediately see in which method the error occurred. The masking API we provide allows you to fine-tune the masking configuration to your needs.

Mobile

Mobile Tuning Monitoring Storage

Top 10 Tips for Making the Spark + Alluxio Stack Blazing Fast

DZone

JULY 10, 2019

In addition, compute and storage are increasingly being separated causing larger latencies for queries. Alluxio is leveraged as compute-side virtual storage to improve performance. But to get the best performance, like any technology stack, you need to follow the best practices. A Note on Data Locality.

Best Practices

Best Practices Storage Latency Tuning

New Prometheus-based extensions enable intelligent observability for more than 200 additional technologies

Dynatrace

FEBRUARY 14, 2022

Among these, you can find essential elements of application and infrastructure stacks, from app gateways (like HAProxy), through app fabric (like RabbitMQ), to databases (like MongoDB) and storage systems (like NetApp, Consul, Memcached, and InfluxDB, just to name a few). documentation. Prometheus Data Source documentation.

Technology

Technology Technology Metrics Infrastructure

How Bloom Filters Work in MyRocks

Percona

FEBRUARY 15, 2023

Tuning In terms of tuning, two parameters can be tuned, the size of the bitmap and the number of bits set by every value. For good performance, the filter blocks are cached in the RocksDB block cache and normally stay there since they are accessed frequently. In most cases a bitmap of a few hundred bytes is sufficient.

Storage

Storage Tuning Cache Engineering

Easy SLA and SLO reporting for all your API endpoints with public synthetic HTTP monitors

Dynatrace

JUNE 26, 2020

Dynatrace Digital Experience Monitoring , as part of the Dynatrace Software Intelligence Platform, connects front-end monitoring and the outside-in user perspective with application performance to understand the impact of performance issues across your full stack on user experience and business outcomes. So stay tuned!

Monitoring

Monitoring Azure AWS Traffic

Data lakehouse innovations advance the three pillars of observability for more collaborative analytics

Dynatrace

FEBRUARY 16, 2023

In his keynote address on the first day of Perform 2023 in Las Vegas, Dynatrace Chief Technology Officer Bernd Greifeneder and his colleagues discussed how organizations struggle with this problem and how Dynatrace is meeting the moment. Grail combines the big-data storage of a data warehouse with the analytical flexibility of a data lake.

Analytics

Analytics Innovation Metrics Database

Why log monitoring and log analytics matter in a hyperscale world

Dynatrace

NOVEMBER 15, 2021

With the help of log monitoring software, teams can collect information and trigger alerts if something happens that affects system performance and health. Log monitoring and analytics work in conjunction to ensure an application is performing as it should be, and to determine how a system could be improved. Increased collaboration.

Analytics

Analytics Monitoring DevOps Artificial Intelligence

How digital experience monitoring helps deliver business observability

Dynatrace

APRIL 26, 2022

Understanding why a user is experiencing transactional or performance issues enables organizations to achieve greater observability that goes beyond metrics, traces and logs. Digital experience monitoring (DEM) allows an organization to optimize customer experiences by taking into account the context surrounding digital experience metrics.

Monitoring

Monitoring Social Media IoT Metrics

Building In-Video Search

The Netflix TechBlog

NOVEMBER 6, 2023

We have built an internal system that allows someone to perform in-video search across the entire Netflix video catalog, and we’d like to share our experience in building this system. During GPU computation, we stream mp4 video shots from S3 directly to the GPUs using a data loader that performs prefetching and preprocessing.

Media

Media Social Media Tuning Engineering

Catching up with OpenTelemetry in 2025

Dynatrace + Metis: Helping developers & SREs solve Database issues with AI

Trending Sources

Optimizing data warehouse storage

Introducing Impressions at Netflix

Speed Trino Queries With These Performance-Tuning Tips

Notes on tuning postgres for cpu and memory benchmarking

Netflix’s Distributed Counter Abstraction

RabbitMQ vs. Kafka: Key Differences

The Challenges of Ajax CDN

Comparing PostgreSQL DigitalOcean Performance & Pricing – ScaleGrid vs. DigitalOcean Managed Databases

Title Launch Observability at Netflix Scale

Metadata Synchronization in Alluxio: Design, Implementation, and Optimization

Title Launch Observability at Netflix Scale

Best MySQL DigitalOcean Performance – ScaleGrid vs. DigitalOcean Managed Databases

Introducing Netflix’s Key-Value Data Abstraction Layer

Storage Strategies for PostgreSQL on Kubernetes

Introducing Netflix TimeSeries Data Abstraction Layer

InnoDB Performance Optimization Basics

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

Privacy spotlight: Ensure compliance by hard deleting individual records in Grail

Netflix Cloud Packaging in the Terabyte Era

Bulldozer: Batch Data Moving from Data Warehouse to Online Key-Value Stores

MySQL General Tablespaces: A Powerful Storage Option for Your Data

Best Practices for Scaling RabbitMQ

Faster time to value with enhanced handling of OneAgent runtime data

Building Netflix’s Distributed Tracing Infrastructure

Conducting log analysis with an observability platform and full data context

PostgreSQL vs. Oracle: Difference in Costs, Ease of Use & Functionality

Dynatrace log collection for ARM unlocks power-efficient architecture for your enterprise

Observability engineering: Getting Prometheus metrics right for Kubernetes with Dynatrace and Kepler

Building a Media Understanding Platform for ML Innovations

PostgreSQL Indexes Can Hurt You: Negative Effects and the Costs Involved

No need to compromise visibility in public clouds with new Azure services supported by Dynatrace (Part 2)

Using SLOs to become the optimization athlete with Dynatrace

5 SRE best practices you can implement today

Understand and replay iOS app crashes with Session Replay

Top 10 Tips for Making the Spark + Alluxio Stack Blazing Fast

New Prometheus-based extensions enable intelligent observability for more than 200 additional technologies

How Bloom Filters Work in MyRocks

Easy SLA and SLO reporting for all your API endpoints with public synthetic HTTP monitors

Data lakehouse innovations advance the three pillars of observability for more collaborative analytics

Why log monitoring and log analytics matter in a hyperscale world

How digital experience monitoring helps deliver business observability

Building In-Video Search

Stay Connected