Architecture, Development and Latency - Technology Performance Pulse

RabbitMQ vs. Kafka: Key Differences

Scalegrid

FEBRUARY 6, 2025

This article outlines the key differences in architecture, performance, and use cases to help determine the best fit for your workload. RabbitMQ follows a message broker model with advanced routing, while Kafkas event streaming architecture uses partitioned logs for distributed processing. What is RabbitMQ? What is Apache Kafka?

Latency

Latency Analytics Architecture Storage

Single-core memory bandwidth: Latency, Bandwidth, and Concurrency

John McCalpin

FEBRUARY 17, 2025

“Latency” is the duration from the execution of a load instruction (to an address that misses in all the caches), and the completion of that load instruction when the data is returned from memory. The example below is for a 2005-era processor with 60 ns memory latency and 6.4 cache lines -> 5.6 cache lines -> 5.6

Latency

Latency Hardware Cache Systems

Spring WebFlux: publishOn vs subscribeOn for Improving Microservices Performance

DZone

SEPTEMBER 23, 2024

With the rise of microservices architecture , there has been a rapid acceleration in the modernization of legacy platforms, leveraging cloud infrastructure to deliver highly scalable, low-latency, and more responsive services. Traditional blocking architectures often struggle to keep up performance, especially under high load.

Performance

Performance Latency Architecture Programming

Timestone: Netflix’s High-Throughput, Low-Latency Priority Queueing System with Built-in Support…

The Netflix TechBlog

SEPTEMBER 29, 2022

Timestone: Netflix’s High-Throughput, Low-Latency Priority Queueing System with Built-in Support for Non-Parallelizable Workloads by Kostas Christidis Introduction Timestone is a high-throughput, low-latency priority queueing system we built in-house to support the needs of Cosmos , our media encoding platform. Over the past 2.5

Latency

Latency Systems Serverless Media

Foundation Model for Personalized Recommendation

The Netflix TechBlog

MARCH 28, 2025

This scenario underscored the need for a new recommender system architecture where member preference learning is centralized, enhancing accessibility and utility across different models. Yet, many are confined to a brief temporal window due to constraints in serving latency or training costs.

Tuning

Tuning Efficiency Latency Strategy

Architectural Insights: Designing Efficient Multi-Layered Caching With Instagram Example

DZone

FEBRUARY 27, 2024

Leveraging this hierarchical structure can significantly reduce latency and improve overall performance. Multi-layered caching involves using multiple levels of cache to store and retrieve data.

Cache

Cache Efficiency Architecture Design

Best Practices for Scaling RabbitMQ

Scalegrid

FEBRUARY 24, 2025

This decoupling is crucial in modern architectures where scalability and fault tolerance are paramount. The architecture of RabbitMQ is meticulously designed for complex message routing, enabling dynamic and flexible interactions between producers and consumers. Keeping queues short maintains a responsive and efficient RabbitMQ setup.

Best Practices

Best Practices Traffic Strategy Scalability

Introducing Netflix’s Key-Value Data Abstraction Layer

The Netflix TechBlog

SEPTEMBER 18, 2024

Firstly, developers struggled to reason about consistency, durability and performance in this complex global deployment across multiple stores. Second, developers had to constantly re-learn new data modeling practices and common yet critical data access patterns.

Latency

Latency Storage Cache Servers

Scalable Annotation Service?—?Marken

The Netflix TechBlog

JANUARY 25, 2023

The service should be able to serve real-time, aka UI, applications so CRUD and search operations should be achieved with low latency. Our service will be used by a lot of internal UI applications hence the latency for CRUD and search operations must be low. Search latency for the generic text queries are in milliseconds.

Scalability

Scalability Latency Media Architecture

Comparing PostgreSQL DigitalOcean Performance & Pricing – ScaleGrid vs. DigitalOcean Managed Databases

Scalegrid

JUNE 4, 2020

DigitalOcean is a cost-effective cloud provider that caters to, and is widely adopted by the developer community. Compare Latency. lower latency compared to DigitalOcean for PostgreSQL. Now, let’s take a look at the throughput and latency performance of our comparison. At a glance – TLDR. Compare Throughput.

Database

Database Latency Benchmarking Performance

Implementing service-level objectives to improve software quality

Dynatrace

DECEMBER 27, 2022

As more organizations embrace microservices-based architecture to deliver goods and services digitally, maintaining customer satisfaction has become exponentially more challenging. When organizations implement SLOs, they can improve software development processes and application performance. SLOs improve software quality. Reliability.

Software

Software Software Benchmarking Latency

Analyze OpenTelemetry traces and log data at scale: Accelerate troubleshooting and optimize application performance

Dynatrace

OCTOBER 3, 2024

Trace your application Imagine a microservices architecture with hundreds of dependencies. Without distributed tracing, pinpointing the cause of increased latency could take hours or even days. Collaborating with your peers based on your software development lifecycle and all data in context has never been easier.

Performance

Performance Architecture Innovation Latency

Dynatrace supports SnapStart for Lambda as an AWS launch partner

Dynatrace

NOVEMBER 28, 2022

The new Amazon capability enables customers to improve the startup latency of their functions from several seconds to as low as sub-second (up to 10 times faster) at P99 (the 99th latency percentile). This can cause latency outliers and may lead to a poor end-user experience for latency-sensitive applications.

Lambda

Lambda AWS Serverless Latency

Lessons learned from enterprise service-level objective management

Dynatrace

MAY 19, 2022

Organizations have multiple stakeholders and almost always have different teams that set up monitoring, operate systems, and develop new functionality. Example 1: Architecture boundaries. First, they took a big step back and looked at their end-to-end architecture (Figure 2). SLO dashboard defined by architectural boundary.

Automotive

Automotive Latency Architecture Mobile

How to maximize serverless benefits and overcome its challenges

Dynatrace

OCTOBER 10, 2022

Code development also benefits from a serverless approach. Application developers can spin up isolated test environments that pose no risk to current operations. Reduced latency. Serverless architecture makes it possible to host code anywhere, rather than relying on an origin server. Architectural complexity.

Serverless

Serverless Infrastructure Lambda Latency

Site reliability engineering: 5 things you need to know

Dynatrace

FEBRUARY 4, 2021

As a discipline, SRE focuses on improving software system reliability across key categories including availability, performance, latency, efficiency, capacity, and incident response. SRE applies DevOps principles to developing systems and software that help increase site reliability and performance. Solving for SR.

Engineering

Engineering DevOps Government Latency

Why applying chaos engineering to data-intensive applications matters

Dynatrace

MAY 23, 2024

Stream processing One approach to such a challenging scenario is stream processing, a computing paradigm and software architectural style for data-intensive software systems that emerged to cope with requirements for near real-time processing of massive amounts of data. This significantly increases event latency.

Engineering

Engineering Tuning Latency Open Source

The Netflix Cosmos Platform

The Netflix TechBlog

MARCH 1, 2021

It supports both high throughput services that consume hundreds of thousands of CPUs at a time, and latency-sensitive workloads where humans are waiting for the results of a computation. Local development tools including specialized test runners, code generators, and a command line interface. Modularity?—?An Productivity?—?Local

Serverless

Serverless Media Latency Social Media

What is serverless computing? Driving efficiency without sacrificing observability

Dynatrace

JANUARY 26, 2021

Within this paradigm, it is possible to run entire architectures without touching a traditional virtual server, either locally or in the cloud. In a serverless architecture, applications are distributed to meet demand and scale requirements efficiently. When an application is triggered, it can cause latency as the application starts.

Serverless

Serverless Efficiency Lambda AWS

Rebuilding Netflix Video Processing Pipeline with Microservices

The Netflix TechBlog

JANUARY 10, 2024

This architecture shift greatly reduced the processing latency and increased system resiliency. We expanded pipeline support to serve our studio/content-development use cases, which had different latency and resiliency requirements as compared to the traditional streaming use case.

Processing

Processing Media Latency Innovation

PostgreSQL Connection Pooling: Part 1 – Pros & Cons

Scalegrid

OCTOBER 17, 2019

In that environment, the first PostgreSQL developers decided forking a process for each connection to the database is the safest choice. Moving to a multithreaded architecture will require extensive rewrites. Developers are often strongly discouraged from holding a database connection while other operations take place.

Architecture

Architecture Database Latency Servers

What is observability? Not just logs, metrics and traces

Dynatrace

OCTOBER 1, 2021

As dynamic systems architectures increase in complexity and scale, IT teams face mounting pressure to track and respond to conditions and issues across their multi-cloud environments. The architects and developers who create the software must design it to be observed. Dynatrace news.

Metrics

Metrics Open Source Monitoring Cloud

Site reliability engineering: 5 things to you need to know

Dynatrace

FEBRUARY 4, 2021

As a discipline, SRE focuses on improving software system reliability across key categories including availability, performance, latency, efficiency, capacity, and incident response. SRE applies DevOps principles to developing systems and software that help increase site reliability and performance. Solving for SR.

Engineering

Engineering DevOps Government Latency

Netflix Cloud Packaging in the Terabyte Era

The Netflix TechBlog

SEPTEMBER 24, 2021

Table 1: Movie and File Size Examples Initial Architecture A simplified view of our initial cloud video processing pipeline is illustrated in the following diagram. Figure 1: A Simplified Video Processing Pipeline With this architecture, chunk encoding is very efficient and processed in distributed cloud computing instances.

Cloud

Cloud Media Storage Cache

Dynatrace supports Azure Managed Instance for Apache Cassandra

Dynatrace

MAY 13, 2022

Because of its scalability and distributed architecture, thousands of companies trust it to run their cloud and hybrid-based workloads at high availability without compromising performance. It also removes the need for developers and database administrators to manage infrastructure or update database versions.

Azure

Azure Latency Metrics Infrastructure

Rapid Event Notification System at Netflix

The Netflix TechBlog

FEBRUARY 18, 2022

To this end, we developed a Rapid Event Notification System (RENO) to support use cases that require server initiated communication with devices in a scalable and extensible manner. Architecture As shown in the diagram above, the RENO service can be broken down into the following components.

Systems

Systems Traffic Architecture Mobile

Introducing Netflix TimeSeries Data Abstraction Layer

The Netflix TechBlog

OCTOBER 8, 2024

Rajiv Shringi Vinay Chella Kaidan Fullerton Oleksii Tkachuk Joey Lynch Introduction As Netflix continues to expand and diversify into various sectors like Video on Demand and Gaming , the ability to ingest and store vast amounts of temporal data — often reaching petabytes — with millisecond access latency has become increasingly vital.

Latency

Latency Storage Traffic Tuning

Seamlessly Swapping the API backend of the Netflix Android app

The Netflix TechBlog

SEPTEMBER 8, 2020

How we migrated our Android endpoints out of a monolith into a new microservice by Rohan Dhruva , Ed Ballot As Android developers, we usually have the luxury of treating our backends as magic boxes running in the cloud, faithfully returning us JSON. It was a Node.js Java…Script? The context around why the Node.js

Latency

Latency Cache Java Traffic

What is AWS Lambda?

Dynatrace

APRIL 5, 2021

These include website hosting, database management, backup and restore, IoT capabilities, e-commerce solutions, app development tools and more, with new services released regularly. Lambda’s toolbox of automated processes helps developers streamline to build fast, robust, and scalable applications on accelerated timelines.

Lambda

Lambda AWS Serverless Hardware

Dynatrace accelerates business transformation with new AI observability solution

Dynatrace

JANUARY 31, 2024

Retrieval-augmented generation emerges as the standard architecture for LLM-based applications Given that LLMs can generate factually incorrect or nonsensical responses, retrieval-augmented generation (RAG) has emerged as an industry standard for building GenAI applications.

Cache

Cache Azure Infrastructure Monitoring

Observability platform vs. observability tools

Dynatrace

DECEMBER 22, 2021

Observability gives developers and system operators real-time awareness of a highly distributed system’s current state based on the data it generates. Metrics are measures of critical system values, such as CPU utilization or average write latency to persistent storage. What is observability?

Artificial Intelligence

Artificial Intelligence Metrics Architecture DevOps

Designing Instagram

High Scalability

JANUARY 11, 2022

Currently, he is in the Alexa Shopping organization where he is developing machine-learning-based solutions to send personalized reorder hints to customers for improving their experience. Architecture. When a user requests for feed then there will be two parallel threads involved in fetching the user feeds to optimize for latency.

Design

Design Media Storage Logistics

Extending Vector with eBPF to inspect host and container performance

The Netflix TechBlog

FEBRUARY 20, 2019

Today we are excited to announce latency heatmaps and improved container support for our on-host monitoring solution?—?Vector?—?to Remotely view real-time process scheduler latency and tcp throughput with Vector and eBPF What is Vector? to the broader community. Vector is open source and in use by multiple companies.

Performance

Performance Latency Open Source Metrics

Site reliability done right: 5 SRE best practices that deliver on business objectives

Dynatrace

MAY 31, 2023

The practice uses continuous monitoring and high levels of automation in close collaboration with agile development teams to ensure applications are highly available and perform without friction. Microservices-based architectures and software containers enable organizations to deploy and modify applications with unprecedented speed.

Best Practices

Best Practices DevOps Latency Metrics

Unlock the power of contextual log analytics

Dynatrace

OCTOBER 2, 2024

Dynatrace enables various teams, such as developers, threat hunters, business analysts, and DevOps, to effortlessly consume advanced log insights within a single platform. This architecture also means you are not required to determine your log data use cases beforehand or while analyzing logs within the new logs app.

Analytics

Analytics AWS DevOps Cloud

Edge Authentication and Token-Agnostic Identity Propagation

The Netflix TechBlog

FEBRUARY 9, 2021

by AIM Team Members Karen Casella , Travis Nelson , Sunny Singh ; with prior art and contributions by Justin Ryan , Satyajit Thadeshwar As most developers can attest, dealing with security protocols and identity tokens, as well as user and device authentication, can be challenging. We are serving over 2.5

Architecture

Architecture Latency Servers Website

Improved Alerting with Atlas Streaming Eval

The Netflix TechBlog

APRIL 27, 2023

High Level Architecture The idea, at a high level, was to avoid the need to query the Atlas database almost entirely and transition most alert queries to streaming evaluation. First and foremost, we have successfully alleviated our initial scalability problem with the polling based architecture. OK, Results?

Storage

Storage Cache Metrics Database

Edgar: Solving Mysteries Faster with Observability

The Netflix TechBlog

SEPTEMBER 2, 2020

While this abundance of dashboards and information is by no means unique to Netflix, it certainly holds true within our microservices architecture. Telltale provides Edgar with latency benchmarks that indicate if the individual trace’s latency is abnormal for this given service.

Latency

Latency Transportation Engineering Traffic

Automated observability, security, and reliability at scale

Dynatrace

JULY 18, 2023

With Configuration as Code, developers can manage their observability and security tasks with config files that can be developed alongside source code conveniently and at scale. As software development grows more complex, managing components using an automated onboarding process becomes increasingly important.

Best Practices

Best Practices Code Infrastructure Latency

Implementing AWS well-architected pillars with automated workflows

Dynatrace

SEPTEMBER 13, 2023

Because Google offers its own Google Cloud Architecture Framework and Microsoft its Azure Well-Architected Framework , organizations that use a combination of these platforms triple the challenge of integrating their performance frameworks into a cohesive strategy. SRG validates the status of the resiliency SLOs for the experiment period.

AWS

AWS Efficiency Azure Cloud

What are SLOs? How service-level objectives work with SLIs to deliver on SLAs

Dynatrace

DECEMBER 2, 2021

As organizations adopt microservices-based architecture , service-level objectives (SLOs) have become a vital way for teams to set specific, measurable targets that ensure users are receiving agreed-upon service levels. Properly set and defined SLOs should have error budgets that give developers space to innovate without impacting operations.

Metrics

Metrics Best Practices DevOps Infrastructure

Under the Hood of Amazon EC2 Container Service

All Things Distributed

JULY 20, 2015

Today, I want to explore the Amazon ECS architecture and what this architecture enables. This architecture affords Amazon ECS high availability, low latency, and high throughput because the data store is never pessimistically locked. Below is a diagram of the basic components of Amazon ECS: How we coordinate the cluster.

Latency

Latency Architecture AWS Open Source

LinkedIn Migrates Espresso to HTTP2 and Reduces Connections by 88% and Latency by 75%

InfoQ

DECEMBER 4, 2023

to HTTP2, resulting in a reduction in the number of connections, latency, and garbage collection times. LinkedIn was able to dramatically improve the scalability and performance of its Espresso database by migrating it from HTTP1.1 To achieve these gains, the team had to optimize the Netty’s default HTTP2 stack to make it fit their needs.

Latency

Latency Scalability Database Performance

Bulldozer: Batch Data Moving from Data Warehouse to Online Key-Value Stores

The Netflix TechBlog

OCTOBER 27, 2020

At Netflix, we also heavily embrace a microservice architecture that emphasizes separation of concerns. The data warehouse is not designed to serve point requests from microservices with low latency. Therefore, we must efficiently move data from the data warehouse to a global, low-latency and highly-reliable key-value store.

Latency

Latency Storage Big Data Tuning

RabbitMQ vs. Kafka: Key Differences

Single-core memory bandwidth: Latency, Bandwidth, and Concurrency

Trending Sources

Spring WebFlux: publishOn vs subscribeOn for Improving Microservices Performance

Timestone: Netflix’s High-Throughput, Low-Latency Priority Queueing System with Built-in Support…

Foundation Model for Personalized Recommendation

Architectural Insights: Designing Efficient Multi-Layered Caching With Instagram Example

Best Practices for Scaling RabbitMQ

Introducing Netflix’s Key-Value Data Abstraction Layer

Scalable Annotation Service?—?Marken

Comparing PostgreSQL DigitalOcean Performance & Pricing – ScaleGrid vs. DigitalOcean Managed Databases

Implementing service-level objectives to improve software quality

Analyze OpenTelemetry traces and log data at scale: Accelerate troubleshooting and optimize application performance

Dynatrace supports SnapStart for Lambda as an AWS launch partner

Lessons learned from enterprise service-level objective management

How to maximize serverless benefits and overcome its challenges

Site reliability engineering: 5 things you need to know

Why applying chaos engineering to data-intensive applications matters

The Netflix Cosmos Platform

What is serverless computing? Driving efficiency without sacrificing observability

Rebuilding Netflix Video Processing Pipeline with Microservices

PostgreSQL Connection Pooling: Part 1 – Pros & Cons

What is observability? Not just logs, metrics and traces

Site reliability engineering: 5 things to you need to know

Netflix Cloud Packaging in the Terabyte Era

Dynatrace supports Azure Managed Instance for Apache Cassandra

Rapid Event Notification System at Netflix

Introducing Netflix TimeSeries Data Abstraction Layer

Seamlessly Swapping the API backend of the Netflix Android app

What is AWS Lambda?

Dynatrace accelerates business transformation with new AI observability solution

Observability platform vs. observability tools

Designing Instagram

Extending Vector with eBPF to inspect host and container performance

Site reliability done right: 5 SRE best practices that deliver on business objectives

Unlock the power of contextual log analytics

Edge Authentication and Token-Agnostic Identity Propagation

Improved Alerting with Atlas Streaming Eval

Edgar: Solving Mysteries Faster with Observability

Automated observability, security, and reliability at scale

Implementing AWS well-architected pillars with automated workflows

What are SLOs? How service-level objectives work with SLIs to deliver on SLAs

Under the Hood of Amazon EC2 Container Service

LinkedIn Migrates Espresso to HTTP2 and Reduces Connections by 88% and Latency by 75%

Bulldozer: Batch Data Moving from Data Warehouse to Online Key-Value Stores

Stay Connected