Architecture and Latency - Technology Performance Pulse

Optimizing Database Performance in Middleware Applications

DZone

FEBRUARY 14, 2025

In the realm of modern software architecture, middleware plays a pivotal role in connecting various components of distributed systems. Efficient database operations in middleware can dramatically improve overall system performance, reduce latency, and enhance user experience.

Database

Database Performance Software Architecture Latency

RabbitMQ vs. Kafka: Key Differences

Scalegrid

FEBRUARY 6, 2025

This article outlines the key differences in architecture, performance, and use cases to help determine the best fit for your workload. RabbitMQ follows a message broker model with advanced routing, while Kafkas event streaming architecture uses partitioned logs for distributed processing. What is RabbitMQ? What is Apache Kafka?

Latency

Latency Analytics Architecture Storage

How to Scale Elasticsearch to Solve Your Scalability Issues

DZone

FEBRUARY 26, 2025

However, the process for effectively scaling Elasticsearch can be nuanced, since one needs a proper understanding of the architecture behind it and of performance tradeoffs. This extra network overhead will easily result in increased latency compared to a single-node architecture where data access is straightforward.

Scalability

Scalability Open Source Latency Architecture

Netflix’s Distributed Counter Abstraction

The Netflix TechBlog

NOVEMBER 12, 2024

By: Rajiv Shringi , Oleksii Tkachuk , Kartik Sathyanarayanan Introduction In our previous blog post, we introduced Netflix’s TimeSeries Abstraction , a distributed service designed to store and query large volumes of temporal event data with low millisecond latencies. Today, we’re excited to present the Distributed Counter Abstraction.

Latency

Latency Cache Infrastructure Strategy

Single-core memory bandwidth: Latency, Bandwidth, and Concurrency

John McCalpin

FEBRUARY 17, 2025

“Latency” is the duration from the execution of a load instruction (to an address that misses in all the caches), and the completion of that load instruction when the data is returned from memory. The example below is for a 2005-era processor with 60 ns memory latency and 6.4 cache lines -> 5.6 cache lines -> 5.6

Latency

Latency Hardware Cache Systems

Efficient Multimodal Data Processing: A Technical Deep Dive

DZone

FEBRUARY 27, 2025

Handling multimodal data spanning text, images, videos, and sensor inputs requires resilient architecture to manage the diversity of formats and scale.

Efficiency

Efficiency Processing Latency Storage

Timestone: Netflix’s High-Throughput, Low-Latency Priority Queueing System with Built-in Support…

The Netflix TechBlog

SEPTEMBER 29, 2022

Timestone: Netflix’s High-Throughput, Low-Latency Priority Queueing System with Built-in Support for Non-Parallelizable Workloads by Kostas Christidis Introduction Timestone is a high-throughput, low-latency priority queueing system we built in-house to support the needs of Cosmos , our media encoding platform. Over the past 2.5

Latency

Latency Systems Media Serverless

Spring WebFlux: publishOn vs subscribeOn for Improving Microservices Performance

DZone

SEPTEMBER 23, 2024

With the rise of microservices architecture , there has been a rapid acceleration in the modernization of legacy platforms, leveraging cloud infrastructure to deliver highly scalable, low-latency, and more responsive services. Traditional blocking architectures often struggle to keep up performance, especially under high load.

Performance

Performance Latency Architecture Programming

Introducing Impressions at Netflix

The Netflix TechBlog

FEBRUARY 14, 2025

Architecture Overview The first pivotal step in managing impressions begins with the creation of a Source-of-Truth (SOT) dataset. Impression Source-of-Truth architecture Ensuring High Quality Impressions Maintaining the highest quality of impressions is a top priority.

Tuning

Tuning Latency Efficiency Storage

CDNs: Speed Up Performance by Reducing Latency

DZone

MAY 3, 2023

Upload files with HTML Upload files with JavaScript Receive uploads in Node.js (Nuxt.js) Optimize storage costs with Object Storage Optimize performance with a CDN Secure uploads with malware scans Today, we’ll do more architectural work, but this time it’ll be focused on optimizing performance.

Latency

Latency Speed Performance Storage

Cut costs and complexity: 5 strategies for reducing tool sprawl with Dynatrace

Dynatrace

APRIL 10, 2025

As an executive, I am always seeking simplicity and efficiency to make sure the architecture of the business is as streamlined as possible. Re-indexing data and rehydrating it from cold storage for incident investigation and forensics causes query latency and additional management overhead and cost.

Strategy

Strategy Storage Network Architecture

Foundation Model for Personalized Recommendation

The Netflix TechBlog

MARCH 28, 2025

This scenario underscored the need for a new recommender system architecture where member preference learning is centralized, enhancing accessibility and utility across different models. Yet, many are confined to a brief temporal window due to constraints in serving latency or training costs.

Tuning

Tuning Efficiency Latency Strategy

Architectural Insights: Designing Efficient Multi-Layered Caching With Instagram Example

DZone

FEBRUARY 27, 2024

Leveraging this hierarchical structure can significantly reduce latency and improve overall performance. Multi-layered caching involves using multiple levels of cache to store and retrieve data.

Cache

Cache Efficiency Architecture Design

Why Replace External Database Caches?

DZone

AUGUST 28, 2024

Putting an external cache in front of the database is commonly used to compensate for subpar latency stemming from various factors, such as inefficient database internals, driver usage, infrastructure choices, traffic spikes, and so on. In fact, they can be one of the more problematic components of a distributed application architecture.

Cache

Cache Database Latency Traffic

Scalable Annotation Service?—?Marken

The Netflix TechBlog

JANUARY 25, 2023

The service should be able to serve real-time, aka UI, applications so CRUD and search operations should be achieved with low latency. Our service will be used by a lot of internal UI applications hence the latency for CRUD and search operations must be low. Search latency for the generic text queries are in milliseconds.

Scalability

Scalability Latency Media Architecture

Best Practices for Scaling RabbitMQ

Scalegrid

FEBRUARY 24, 2025

This decoupling is crucial in modern architectures where scalability and fault tolerance are paramount. The architecture of RabbitMQ is meticulously designed for complex message routing, enabling dynamic and flexible interactions between producers and consumers. Keeping queues short maintains a responsive and efficient RabbitMQ setup.

Best Practices

Best Practices Traffic Strategy Scalability

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

The Netflix TechBlog

MAY 4, 2023

When undertaking system migrations, one of the main challenges is establishing confidence and seamlessly transitioning the traffic to the upgraded architecture without adversely impacting the customer experience. It provides a good read on the availability and latency ranges under different production conditions.

Traffic

Traffic Latency Tuning Systems

Introducing Netflix’s Key-Value Data Abstraction Layer

The Netflix TechBlog

SEPTEMBER 18, 2024

These include challenges with tail latency and idempotency, managing “wide” partitions with many rows, handling single large “fat” columns, and slow response pagination. Data Model At its core, the KV abstraction is built around a two-level map architecture. Useful for keeping “n-newest” or prefix path deletion.

Latency

Latency Storage Cache Servers

Dynatrace supports SnapStart for Lambda as an AWS launch partner

Dynatrace

NOVEMBER 28, 2022

The new Amazon capability enables customers to improve the startup latency of their functions from several seconds to as low as sub-second (up to 10 times faster) at P99 (the 99th latency percentile). This can cause latency outliers and may lead to a poor end-user experience for latency-sensitive applications.

Lambda

Lambda AWS Serverless Latency

Solve hybrid Kubernetes performance and reliability problems with unified observability

Dynatrace

APRIL 10, 2025

While this hybrid architectural approach offers flexibility, it also introduces the need for unified observability. Distributed tracing tracks these requests across both Windows and Linux workloads, helping you quickly understand latency, bottlenecks, and failure points that may span multiple nodes in your hybrid setup.

Performance

Performance Java Operating System Infrastructure

Implementing service-level objectives to improve software quality

Dynatrace

DECEMBER 27, 2022

As more organizations embrace microservices-based architecture to deliver goods and services digitally, maintaining customer satisfaction has become exponentially more challenging. Latency is the time that it takes a request to be served. Define SLOs for each service. Reliability. This is what Dynatrace captures as response time.

Software

Software Software Benchmarking Latency

Comparing PostgreSQL DigitalOcean Performance & Pricing – ScaleGrid vs. DigitalOcean Managed Databases

Scalegrid

JUNE 4, 2020

Compare Latency. lower latency compared to DigitalOcean for PostgreSQL. Now, let’s take a look at the throughput and latency performance of our comparison. Next, we are going to test and compare the latency performance between ScaleGrid and DigitalOcean for PostgreSQL. PostgreSQL DigitalOcean Latency Averages (ms).

Database

Database Latency Benchmarking Performance

Who will watch the watchers? Extended infrastructure observability for WSO2 API Manager

Dynatrace

SEPTEMBER 18, 2020

Cloud-based application architectures commonly leverage microservices. High latency or lack of responses. You receive an alert message from Dynatrace (your infrastructure observability hub) letting you know that the average response latency of all deployed APIs has tripled. Soaring number of active connections.

Infrastructure

Infrastructure Latency Metrics Cloud

Optimize your observability pipeline for AWS Lambda serverless functions

Dynatrace

NOVEMBER 10, 2022

As companies accelerate digital transformation, cloud services such as AWS Lambda help companies to modernize their application architectures to quickly adapt to the needs of their customers while offloading the operational complexity to their cloud vendor.

Lambda

Lambda Serverless AWS Latency

Lessons learned from enterprise service-level objective management

Dynatrace

MAY 19, 2022

Example 1: Architecture boundaries. First, they took a big step back and looked at their end-to-end architecture (Figure 2). SLO dashboard defined by architectural boundary. In their new dashboard, they added dimensions for load, latency, and open problems for each component. Not all attempts succeed on the first try.

Automotive

Automotive Latency Architecture Azure

Allegro Reduces Kafka Producer Latency Outliers by 82% After Switching to XFS

InfoQ

APRIL 26, 2024

Allegro experimented with different performance optimization options to improve Apache Kafka producer tail latency and eventually switched all its clusters to the XFS filesystem. The company used Kafka protocol sniffing, JVM profiling, and eBPF, which proved instrumental in identifying and eliminating performance bottlenecks.

Latency

Latency Performance Tuning Scalability

Why applying chaos engineering to data-intensive applications matters

Dynatrace

MAY 23, 2024

Stream processing One approach to such a challenging scenario is stream processing, a computing paradigm and software architectural style for data-intensive software systems that emerged to cope with requirements for near real-time processing of massive amounts of data. This significantly increases event latency.

Engineering

Engineering Tuning Latency Open Source

Optimizing your Kubernetes clusters without breaking the bank

Dynatrace

JANUARY 14, 2022

The following figure shows the high-level architecture where any load testing solution (e.g. The optimization goal was to improve the application efficiency, that is to improve the ratio between service throughput and cloud costs while not increasing the application latency (e.g. below 500ms) and error rates (e.g. lower than 2%.).

Latency

Latency Tuning Efficiency AWS

How to maximize serverless benefits and overcome its challenges

Dynatrace

OCTOBER 10, 2022

Reduced latency. Serverless architecture makes it possible to host code anywhere, rather than relying on an origin server. By using cloud providers with multiple server sites, organizations can reduce function latency for end users. Architectural complexity. Optimizes resources. Difficult to monitor.

Serverless

Serverless Infrastructure Lambda Latency

Analyze OpenTelemetry traces and log data at scale: Accelerate troubleshooting and optimize application performance

Dynatrace

OCTOBER 3, 2024

Trace your application Imagine a microservices architecture with hundreds of dependencies. Without distributed tracing, pinpointing the cause of increased latency could take hours or even days. Interact with data intuitively and easily and benefit from immediate, AI-supported insights.

Performance

Performance Architecture Innovation Latency

Netflix Cloud Packaging in the Terabyte Era

The Netflix TechBlog

SEPTEMBER 24, 2021

Table 1: Movie and File Size Examples Initial Architecture A simplified view of our initial cloud video processing pipeline is illustrated in the following diagram. Figure 1: A Simplified Video Processing Pipeline With this architecture, chunk encoding is very efficient and processed in distributed cloud computing instances.

Cloud

Cloud Media Storage Cache

Consistent caching mechanism in Titus Gateway

The Netflix TechBlog

NOVEMBER 3, 2022

The original assumptions and architectural choices were no longer viable. Overview The figure below depicts a simplified high-level architecture of a single Titus cluster (a.k.a We started seeing increased response latencies and leader servers running at dangerously high utilization.

Cache

Cache Latency Traffic Systems

Uber’s Big Data Platform: 100+ Petabytes with Minute Latency

Uber Engineering

OCTOBER 17, 2018

To accomplish this, Uber relies heavily on making data-driven decisions at every level, from forecasting rider demand during high traffic events to identifying and addressing bottlenecks … The post Uber’s Big Data Platform: 100+ Petabytes with Minute Latency appeared first on Uber Engineering Blog.

Big Data

Big Data Latency Transportation Traffic

Rapid Event Notification System at Netflix

The Netflix TechBlog

FEBRUARY 18, 2022

Motivation With the rapid growth in Netflix member base and the increasing complexity of our systems, our architecture has evolved into an asynchronous one that enables both online and offline computation. Architecture As shown in the diagram above, the RENO service can be broken down into the following components.

Systems

Systems Traffic Architecture Mobile

Introducing Netflix TimeSeries Data Abstraction Layer

The Netflix TechBlog

OCTOBER 8, 2024

Rajiv Shringi Vinay Chella Kaidan Fullerton Oleksii Tkachuk Joey Lynch Introduction As Netflix continues to expand and diversify into various sectors like Video on Demand and Gaming , the ability to ingest and store vast amounts of temporal data — often reaching petabytes — with millisecond access latency has become increasingly vital.

Latency

Latency Storage Traffic Tuning

PostgreSQL Connection Pooling: Part 1 – Pros & Cons

Scalegrid

OCTOBER 17, 2019

Moving to a multithreaded architecture will require extensive rewrites. But that causes a problem with PostgreSQL’s architecture – forking a process becomes expensive when transactions are very short, as the common wisdom dictates they should be. The PostgreSQL Architecture | Source. The Connection Pool Architecture.

Architecture

Architecture Database Latency Servers

What is serverless computing? Driving efficiency without sacrificing observability

Dynatrace

JANUARY 26, 2021

Within this paradigm, it is possible to run entire architectures without touching a traditional virtual server, either locally or in the cloud. In a serverless architecture, applications are distributed to meet demand and scale requirements efficiently. When an application is triggered, it can cause latency as the application starts.

Serverless

Serverless Efficiency Lambda Azure

Site reliability engineering: 5 things you need to know

Dynatrace

FEBRUARY 4, 2021

As a discipline, SRE focuses on improving software system reliability across key categories including availability, performance, latency, efficiency, capacity, and incident response. At a system level, SRE specialists develop tooling that coordinates releases and launches, evaluates system architecture readiness, and meets system-wide SLOs.

Engineering

Engineering DevOps Government Latency

The Netflix Cosmos Platform

The Netflix TechBlog

MARCH 1, 2021

It supports both high throughput services that consume hundreds of thousands of CPUs at a time, and latency-sensitive workloads where humans are waiting for the results of a computation. The subsystems all communicate with each other asynchronously via Timestone, a high-scale, low-latency priority queuing system.

Serverless

Serverless Media Latency Social Media

Rebuilding Netflix Video Processing Pipeline with Microservices

The Netflix TechBlog

JANUARY 10, 2024

This architecture shift greatly reduced the processing latency and increased system resiliency. We expanded pipeline support to serve our studio/content-development use cases, which had different latency and resiliency requirements as compared to the traditional streaming use case. divide the input video into small chunks 2.

Processing

Processing Media Latency Innovation

Dynatrace supports Azure Managed Instance for Apache Cassandra

Dynatrace

MAY 13, 2022

Because of its scalability and distributed architecture, thousands of companies trust it to run their cloud and hybrid-based workloads at high availability without compromising performance. With the Dynatrace Data Explorer, you can easily analyze metrics, such as client read/write latency by Cassandra nodes and disk space usage by keyspaces.

Azure

Azure Latency Metrics Infrastructure

Data ingestion pipeline with Operation Management

The Netflix TechBlog

MARCH 7, 2023

But we cannot search or present low latency retrievals from files Etc. Marken Architecture Marken’s architecture diagram is as follows. Marken Architecture Marken’s architecture diagram is as follows. Using memcache allows us to keep latencies for our search low (most of our queries are less than 100ms).

Media

Media Latency Architecture Database

Dynatrace supports the newly released AWS Lambda Response Streaming

Dynatrace

APRIL 7, 2023

Customers can use AWS Lambda Response Streaming to improve performance for latency-sensitive applications and return larger payload sizes. Customers can use response streaming to achieve the following: Improve Time to First Byte (TTFB) performance for latency-sensitive applications. Return larger payload sizes.

Lambda

Lambda AWS Serverless Latency

How Park ‘N Fly eliminated silos and improved customer experience with Dynatrace cloud monitoring

Dynatrace

APRIL 7, 2021

Organizations are rapidly adopting multicloud architectures to achieve the agility needed to drive customer success through new digital service channels. For example, if there is a latency on a particular service, Dynatrace will flag this and trace its source – even if the source is a third party.

Cloud

Cloud Monitoring Latency Games

Optimizing Database Performance in Middleware Applications

RabbitMQ vs. Kafka: Key Differences

Trending Sources

How to Scale Elasticsearch to Solve Your Scalability Issues

Netflix’s Distributed Counter Abstraction

Single-core memory bandwidth: Latency, Bandwidth, and Concurrency

Efficient Multimodal Data Processing: A Technical Deep Dive

Timestone: Netflix’s High-Throughput, Low-Latency Priority Queueing System with Built-in Support…

Spring WebFlux: publishOn vs subscribeOn for Improving Microservices Performance

Introducing Impressions at Netflix

CDNs: Speed Up Performance by Reducing Latency

Cut costs and complexity: 5 strategies for reducing tool sprawl with Dynatrace

Foundation Model for Personalized Recommendation

Architectural Insights: Designing Efficient Multi-Layered Caching With Instagram Example

Why Replace External Database Caches?

Scalable Annotation Service?—?Marken

Best Practices for Scaling RabbitMQ

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

Introducing Netflix’s Key-Value Data Abstraction Layer

Dynatrace supports SnapStart for Lambda as an AWS launch partner

Solve hybrid Kubernetes performance and reliability problems with unified observability

Implementing service-level objectives to improve software quality

Comparing PostgreSQL DigitalOcean Performance & Pricing – ScaleGrid vs. DigitalOcean Managed Databases

Who will watch the watchers? Extended infrastructure observability for WSO2 API Manager

Optimize your observability pipeline for AWS Lambda serverless functions

Lessons learned from enterprise service-level objective management

Allegro Reduces Kafka Producer Latency Outliers by 82% After Switching to XFS

Why applying chaos engineering to data-intensive applications matters

Optimizing your Kubernetes clusters without breaking the bank

How to maximize serverless benefits and overcome its challenges

Analyze OpenTelemetry traces and log data at scale: Accelerate troubleshooting and optimize application performance

Netflix Cloud Packaging in the Terabyte Era

Consistent caching mechanism in Titus Gateway

Uber’s Big Data Platform: 100+ Petabytes with Minute Latency

Rapid Event Notification System at Netflix

Introducing Netflix TimeSeries Data Abstraction Layer

PostgreSQL Connection Pooling: Part 1 – Pros & Cons

What is serverless computing? Driving efficiency without sacrificing observability

Site reliability engineering: 5 things you need to know

The Netflix Cosmos Platform

Rebuilding Netflix Video Processing Pipeline with Microservices

Dynatrace supports Azure Managed Instance for Apache Cassandra

Data ingestion pipeline with Operation Management

Dynatrace supports the newly released AWS Lambda Response Streaming

How Park ‘N Fly eliminated silos and improved customer experience with Dynatrace cloud monitoring

Stay Connected