Architecture, Latency and Storage - Technology Performance Pulse

Cut costs and complexity: 5 strategies for reducing tool sprawl with Dynatrace

Dynatrace

APRIL 10, 2025

As an executive, I am always seeking simplicity and efficiency to make sure the architecture of the business is as streamlined as possible. Simplify data ingestion and up-level storage for better, faster querying : With Dynatrace, petabytes of data are always hot for real-time insights, at a cold cost.

Strategy

Strategy Storage Network Architecture

RabbitMQ vs. Kafka: Key Differences

Scalegrid

FEBRUARY 6, 2025

This article outlines the key differences in architecture, performance, and use cases to help determine the best fit for your workload. RabbitMQ follows a message broker model with advanced routing, while Kafkas event streaming architecture uses partitioned logs for distributed processing. What is RabbitMQ? What is Apache Kafka?

Latency

Latency Analytics Architecture Storage

Netflix’s Distributed Counter Abstraction

The Netflix TechBlog

NOVEMBER 12, 2024

By: Rajiv Shringi , Oleksii Tkachuk , Kartik Sathyanarayanan Introduction In our previous blog post, we introduced Netflix’s TimeSeries Abstraction , a distributed service designed to store and query large volumes of temporal event data with low millisecond latencies. Today, we’re excited to present the Distributed Counter Abstraction.

Latency

Latency Cache Infrastructure Strategy

Efficient Multimodal Data Processing: A Technical Deep Dive

DZone

FEBRUARY 27, 2025

Handling multimodal data spanning text, images, videos, and sensor inputs requires resilient architecture to manage the diversity of formats and scale.

Efficiency

Efficiency Processing Latency Storage

Introducing Impressions at Netflix

The Netflix TechBlog

FEBRUARY 14, 2025

Architecture Overview The first pivotal step in managing impressions begins with the creation of a Source-of-Truth (SOT) dataset. The enriched data is seamlessly accessible for both real-time applications via Kafka and historical analysis through storage in an Apache Iceberg table.

Tuning

Tuning Latency Efficiency Storage

Optimizing data warehouse storage

The Netflix TechBlog

DECEMBER 21, 2020

At this scale, we can gain a significant amount of performance and cost benefits by optimizing the storage layout (records, objects, partitions) as the data lands into our warehouse. We built AutoOptimize to efficiently and transparently optimize the data and metadata storage layout while maximizing their cost and performance benefits.

Storage

Storage Latency Efficiency Data Engineering

CDNs: Speed Up Performance by Reducing Latency

DZone

MAY 3, 2023

In the previous posts, we covered things we had to do to upload files on the front end, things we had to do on the back end, and optimizing costs by moving file uploads to object storage.

Latency

Latency Speed Performance Storage

Introducing Netflix’s Key-Value Data Abstraction Layer

The Netflix TechBlog

SEPTEMBER 18, 2024

These include challenges with tail latency and idempotency, managing “wide” partitions with many rows, handling single large “fat” columns, and slow response pagination. Data Model At its core, the KV abstraction is built around a two-level map architecture. Developers just provide their data problem rather than a database solution!

Latency

Latency Storage Cache Servers

Netflix Cloud Packaging in the Terabyte Era

The Netflix TechBlog

SEPTEMBER 24, 2021

Table 1: Movie and File Size Examples Initial Architecture A simplified view of our initial cloud video processing pipeline is illustrated in the following diagram. Figure 1: A Simplified Video Processing Pipeline With this architecture, chunk encoding is very efficient and processed in distributed cloud computing instances.

Cloud

Cloud Media Storage Cache

Designing Instagram

High Scalability

JANUARY 11, 2022

Architecture. Firstly, the synchronous process which is responsible for uploading image content on file storage, persisting the media metadata in graph data-storage, returning the confirmation message to the user and triggering the process to update the user activity. Sending and receiving messages from other users.

Design

Design Media Storage Logistics

Introducing Netflix TimeSeries Data Abstraction Layer

The Netflix TechBlog

OCTOBER 8, 2024

Rajiv Shringi Vinay Chella Kaidan Fullerton Oleksii Tkachuk Joey Lynch Introduction As Netflix continues to expand and diversify into various sectors like Video on Demand and Gaming , the ability to ingest and store vast amounts of temporal data — often reaching petabytes — with millisecond access latency has become increasingly vital.

Latency

Latency Storage Traffic Tuning

Comparing PostgreSQL DigitalOcean Performance & Pricing – ScaleGrid vs. DigitalOcean Managed Databases

Scalegrid

JUNE 4, 2020

Compare Latency. lower latency compared to DigitalOcean for PostgreSQL. On average, ScaleGrid provides over 30% more storage vs. DigitalOcean for PostgreSQL at the same affordable price. Now, let’s take a look at the throughput and latency performance of our comparison. PostgreSQL DigitalOcean Latency Averages (ms).

Database

Database Latency Benchmarking Performance

Best Practices for Scaling RabbitMQ

Scalegrid

FEBRUARY 24, 2025

This decoupling is crucial in modern architectures where scalability and fault tolerance are paramount. The architecture of RabbitMQ is meticulously designed for complex message routing, enabling dynamic and flexible interactions between producers and consumers. Keeping queues short maintains a responsive and efficient RabbitMQ setup.

Best Practices

Best Practices Traffic Strategy Scalability

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

The Netflix TechBlog

MAY 4, 2023

When undertaking system migrations, one of the main challenges is establishing confidence and seamlessly transitioning the traffic to the upgraded architecture without adversely impacting the customer experience. It provides a good read on the availability and latency ranges under different production conditions.

Traffic

Traffic Latency Tuning Systems

What is a data lakehouse? Combining data lakes and warehouses for the best of both worlds

Dynatrace

OCTOBER 4, 2022

While data lakes and data warehousing architectures are commonly used modes for storing and analyzing data, a data lakehouse is an efficient third way to store and analyze data that unifies the two architectures while preserving the benefits of both. Unlike data warehouses, however, data is not transformed before landing in storage.

Artificial Intelligence

Artificial Intelligence Storage Analytics Government

Improved Alerting with Atlas Streaming Eval

The Netflix TechBlog

APRIL 27, 2023

While Atlas is architected around compute & storage separation, and we could theoretically just scale the query layer to meet the increased query demand, every query, regardless of its type, has a data component that needs to be pushed down to the storage layer.

Storage

Storage Cache Metrics Database

Analyze OpenTelemetry traces and log data at scale: Accelerate troubleshooting and optimize application performance

Dynatrace

OCTOBER 3, 2024

Trace your application Imagine a microservices architecture with hundreds of dependencies. Without distributed tracing, pinpointing the cause of increased latency could take hours or even days. There is no need to think about schema and indexes, re-hydration, or hot/cold storage.

Performance

Performance Architecture Innovation Latency

What is a Distributed Storage System

Scalegrid

FEBRUARY 8, 2024

A distributed storage system is foundational in today’s data-driven landscape, ensuring data spread over multiple servers is reliable, accessible, and manageable. Understanding distributed storage is imperative as data volumes and the need for robust storage solutions rise.

Storage

Storage Systems Big Data Azure

Bulldozer: Batch Data Moving from Data Warehouse to Online Key-Value Stores

The Netflix TechBlog

OCTOBER 27, 2020

At Netflix, we also heavily embrace a microservice architecture that emphasizes separation of concerns. The data warehouse is not designed to serve point requests from microservices with low latency. Therefore, we must efficiently move data from the data warehouse to a global, low-latency and highly-reliable key-value store.

Latency

Latency Storage Big Data Tuning

Data ingestion pipeline with Operation Management

The Netflix TechBlog

MARCH 7, 2023

But we cannot search or present low latency retrievals from files Etc. Marken Architecture Marken’s architecture diagram is as follows. Marken Architecture Marken’s architecture diagram is as follows. Using memcache allows us to keep latencies for our search low (most of our queries are less than 100ms).

Media

Media Latency Architecture Database

Unlock the power of contextual log analytics

Dynatrace

OCTOBER 2, 2024

There is no need to think about schema and indexes, re-hydration, or hot/cold storage. This architecture also means you are not required to determine your log data use cases beforehand or while analyzing logs within the new logs app. Keep in mind that Dynatrace Grail is schema-on-read and indexless, built with scaling in mind.

Analytics

Analytics AWS DevOps Cloud

Observability platform vs. observability tools

Dynatrace

DECEMBER 22, 2021

Metrics are measures of critical system values, such as CPU utilization or average write latency to persistent storage. They are particularly important in distributed systems, such as microservices architectures. Observability platforms are becoming essential as the complexity of cloud-native architectures increases.

Artificial Intelligence

Artificial Intelligence Metrics Architecture DevOps

The Anna Key-Value Store Now Has 355x the Performance of DynamoDB for the Dollar

High Scalability

SEPTEMBER 8, 2018

RISELabs , those wonderfully innovative folks over at Berkeley, have uplifted their Anna datatabase —a shared-nothing, thread-per-core architecture to achieve lightning-fast speeds by avoiding all coordination mechanisms—to become cloud-aware. While database neogenesis has slowed down considerably, it has not gone necrotic.

Storage

Storage Performance AWS Media

Consistent caching mechanism in Titus Gateway

The Netflix TechBlog

NOVEMBER 3, 2022

The original assumptions and architectural choices were no longer viable. Overview The figure below depicts a simplified high-level architecture of a single Titus cluster (a.k.a When a new leader is elected it loads all data from external storage. Active data includes jobs and tasks that are currently running.

Cache

Cache Latency Traffic Systems

Netflix Drive

The Netflix TechBlog

MAY 5, 2021

In the future posts, we will do an architectural deep dive into the several components of Netflix Drive. Netflix Drive relies on a data store that will be the persistent storage layer for assets, and a metadata store which will provide a relevant mapping from the file system hierarchy to the data store entities.

Media

Media Storage Architecture Cloud

These 7 Edge Data Challenges Will Test Companies the Most in 2025

VoltDB

DECEMBER 11, 2024

By bringing computation closer to the data source, edge-based deployments reduce latency, enhance real-time capabilities, and optimize network bandwidth. Data Overload and Storage Limitations As IoT and especially industrial IoT -based devices proliferate, the volume of data generated at the edge has skyrocketed.

IoT

IoT Energy Logistics Latency

Uber’s Big Data Platform: 100+ Petabytes with Minute Latency

Uber Engineering

OCTOBER 17, 2018

To accomplish this, Uber relies heavily on making data-driven decisions at every level, from forecasting rider demand during high traffic events to identifying and addressing bottlenecks … The post Uber’s Big Data Platform: 100+ Petabytes with Minute Latency appeared first on Uber Engineering Blog.

Big Data

Big Data Latency Transportation Traffic

Implementing AWS well-architected pillars with automated workflows

Dynatrace

SEPTEMBER 13, 2023

Because Google offers its own Google Cloud Architecture Framework and Microsoft its Azure Well-Architected Framework , organizations that use a combination of these platforms triple the challenge of integrating their performance frameworks into a cohesive strategy. SRG validates the status of the resiliency SLOs for the experiment period.

AWS

AWS Efficiency Azure Cloud

Amazon DynamoDB ? a Fast and Scalable NoSQL Database.

All Things Distributed

JANUARY 18, 2012

Amazon DynamoDB offers low, predictable latencies at any scale. This architectural pattern was a response to the scaling challenges that had challenged Amazon.com through its first 5 years, when direct database access was one of the major bottlenecks in scaling and operating the business. The growth of Amazonâ??s Consistency.

Scalability

Scalability Database Ecommerce Latency

What is ITOps? Why IT operations is more crucial than ever in a multicloud world

Dynatrace

DECEMBER 15, 2022

Besides the traditional system hardware, storage, routers, and software, ITOps also includes virtual components of the network and cloud infrastructure. This includes response time, accuracy, speed, throughput, uptime, CPU utilization, and latency. The primary goal of ITOps is to provide a high-performing, consistent IT environment.

Artificial Intelligence

Artificial Intelligence DevOps Hardware Virtualization

Using Docker To Deploy Neon Serverless PostgreSQL

Percona

MARCH 13, 2023

Architecture To understand more about deployment procedures, we need to look a little more at Neon architecture. Basically, you can view this as a PostgreSQL instance but without a storage layer Storage Broker Storage Broker is a coordination component between WAL Service and Pageserver. 5454 --listen-http=0.0.0.0:7676

Serverless

Serverless C++ Storage Servers

Managing risk for financial services: The secret to visibility and control during times of volatility

Dynatrace

APRIL 8, 2024

Historically, IT infrastructure performance, IT security, data architecture, and data analytics, have been managed in disparate, unconnected silos deep within IT organizational structures. Maximize performance for high-frequency and low-latency trading strategies. Automated issue resolution. Break down data silos.

Analytics

Analytics Infrastructure Efficiency Technology

Migrating Critical Traffic At Scale with No Downtime?—?Part 2

The Netflix TechBlog

JUNE 13, 2023

By collecting and analyzing key performance metrics of the service over time, we can assess the impact of the new changes and determine if they meet the availability, latency, and performance requirements. A/B testing is also a key technique in migrations where the updates to the architecture involve changing device contracts as well.

Traffic

Traffic Metrics Systems Strategy

Achieving observability in async workflows

The Netflix TechBlog

MAY 14, 2021

Managing and operating asynchronous workflows can be difficult without the proper tools and architecture that puts observability, debugging, and tracing at the forefront. We are expected to process 1,000 watermarks for a single distribution in a minute, with non-linear latency growth as the number of watermarks increases.

Traffic

Traffic Java Latency Google

Evolution of ML Fact Store

The Netflix TechBlog

APRIL 26, 2022

Figure 1: Netflix ML Architecture Fact: A fact is data about our members or videos. The first version of our logger library optimized for storage by deduplicating facts and optimized for network i/o using different compression methods for each fact. An example of video data is video metadata, like the length of a video.

Storage

Storage Design Scalability Latency

Understanding operational 5G: a first measurement study on its coverage, performance and energy consumption

The Morning Paper

OCTOBER 4, 2020

We are standing on the eve of the 5G era… 5G, as a monumental shift in cellular communication technology, holds tremendous potential for spurring innovations across many vertical industries, with its promised multi-Gbps speed, sub-10 ms low latency, and massive connectivity. Throughput and latency. energy consumption).

Energy

Energy Latency Performance Network

Choosing a cloud DBMS: architectures and tradeoffs

The Morning Paper

AUGUST 29, 2019

Choosing a cloud DBMS: architectures and tradeoffs Tan et al., As it is infeasible to test every OLAP system runnable on AWS, we chose widely-used systems that represented a variety of architectures and cost models. Another interesting experiment here compared the effects on performance of different storage types. VLDB’19.

Architecture

Architecture Cloud Storage Serverless

Netflix Video Quality at Scale with Cosmos Microservices

The Netflix TechBlog

NOVEMBER 2, 2021

The Reloaded system is a well-matured and scalable system, but its monolithic architecture can slow down rapid innovation. This enables us to use our scale to increase throughput and reduce latencies. Here, based on the video length, the throughput and latency requirements, available scale etc., via bug fixes).

Media

Media Innovation Metrics Latency

Amazon EC2 Cluster GPU Instances - All Things Distributed

All Things Distributed

NOVEMBER 15, 2010

Building general purpose architectures has always been hard; there are often so many conflicting requirements that you cannot derive an architecture that will serve all, so we have often ended up focusing on one side of the requirements that allow you to serve that area really well.

AWS

AWS Programming Latency Architecture

Titan Graph Database Integration with DynamoDB: World-class Performance, Availability, and Scale for New Workloads

All Things Distributed

AUGUST 20, 2015

Today, we are releasing a plugin that allows customers to use the Titan graph engine with Amazon DynamoDB as the backend storage layer. It opens up the possibility to enjoy the value that graph databases bring to relationship-centric use cases, without worrying about managing the underlying storage. The importance of relationships.

Database

Database Logistics Availability Social Media

Growth Engineering at Netflix?—?Automated Imagery Generation

The Netflix TechBlog

FEBRUARY 9, 2021

Server-generated assets, since client-side generation would require the retrieval of many individual images, which would increase latency and time-to-render. To reduce latency, assets should be generated in an offline fashion and not in real time. This requires an asset storage solution.

Engineering

Engineering Storage Latency Entertainment

Redis vs Memcached in 2024

Scalegrid

MARCH 28, 2024

Key Takeaways Redis offers complex data structures and additional features for versatile data handling, while Memcached excels in simplicity with a fast, multi-threaded architecture for basic caching needs. It uses a hash table to manage these pairs, divided into fixed-size buckets with linked lists for key-value storage.

Cache

Cache Storage Scalability Architecture

Redis® Monitoring Strategies for 2025

Scalegrid

JANUARY 21, 2025

With its widespread use in modern application architectures, understanding the ins and outs of Redis monitoring is essential for any tech professional. Identifying key Redis metrics such as latency, CPU usage, and memory metrics is crucial for effective Redis monitoring. Redis, a powerful in-memory data store, is no exception.

Strategy

Strategy Monitoring Latency DevOps

Redis® Monitoring Strategies for 2024

Scalegrid

DECEMBER 21, 2023

With its widespread use in modern application architectures, understanding the ins and outs of Redis® monitoring is essential for any tech professional. Identifying key Redis® metrics such as latency, CPU usage, and memory metrics is crucial for effective Redis monitoring. Redis®, a powerful in-memory data store, is no exception.

Strategy

Strategy Monitoring Latency DevOps

Cut costs and complexity: 5 strategies for reducing tool sprawl with Dynatrace

RabbitMQ vs. Kafka: Key Differences

Trending Sources

Netflix’s Distributed Counter Abstraction

Efficient Multimodal Data Processing: A Technical Deep Dive

Introducing Impressions at Netflix

Optimizing data warehouse storage

CDNs: Speed Up Performance by Reducing Latency

Introducing Netflix’s Key-Value Data Abstraction Layer

Netflix Cloud Packaging in the Terabyte Era

Designing Instagram

Introducing Netflix TimeSeries Data Abstraction Layer

Comparing PostgreSQL DigitalOcean Performance & Pricing – ScaleGrid vs. DigitalOcean Managed Databases

Best Practices for Scaling RabbitMQ

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

What is a data lakehouse? Combining data lakes and warehouses for the best of both worlds

Improved Alerting with Atlas Streaming Eval

Analyze OpenTelemetry traces and log data at scale: Accelerate troubleshooting and optimize application performance

What is a Distributed Storage System

Bulldozer: Batch Data Moving from Data Warehouse to Online Key-Value Stores

Data ingestion pipeline with Operation Management

Unlock the power of contextual log analytics

Observability platform vs. observability tools

The Anna Key-Value Store Now Has 355x the Performance of DynamoDB for the Dollar

Consistent caching mechanism in Titus Gateway

Netflix Drive

These 7 Edge Data Challenges Will Test Companies the Most in 2025

Uber’s Big Data Platform: 100+ Petabytes with Minute Latency

Implementing AWS well-architected pillars with automated workflows

Amazon DynamoDB ? a Fast and Scalable NoSQL Database.

What is ITOps? Why IT operations is more crucial than ever in a multicloud world

Using Docker To Deploy Neon Serverless PostgreSQL

Managing risk for financial services: The secret to visibility and control during times of volatility

Migrating Critical Traffic At Scale with No Downtime?—?Part 2

Achieving observability in async workflows

Evolution of ML Fact Store

Understanding operational 5G: a first measurement study on its coverage, performance and energy consumption

Choosing a cloud DBMS: architectures and tradeoffs

Netflix Video Quality at Scale with Cosmos Microservices

Amazon EC2 Cluster GPU Instances - All Things Distributed

Titan Graph Database Integration with DynamoDB: World-class Performance, Availability, and Scale for New Workloads

Growth Engineering at Netflix?—?Automated Imagery Generation

Redis vs Memcached in 2024

Redis® Monitoring Strategies for 2025

Redis® Monitoring Strategies for 2024

Stay Connected