Latency and Scalability - Technology Performance Pulse

How to Scale Elasticsearch to Solve Your Scalability Issues

DZone

FEBRUARY 26, 2025

With the evolution of modern applications serving increasing needs for real-time data processing and retrieval, scalability does, too. This extra network overhead will easily result in increased latency compared to a single-node architecture where data access is straightforward.

Scalability

Scalability Open Source Latency Architecture

Scalable Annotation Service?—?Marken

The Netflix TechBlog

JANUARY 25, 2023

Scalable Annotation Service — Marken by Varun Sekhri , Meenakshi Jindal Introduction At Netflix, we have hundreds of micro services each with its own data models or entities. The service should be able to serve real-time, aka UI, applications so CRUD and search operations should be achieved with low latency.

Scalability

Scalability Latency Media Architecture

Improve Application Latency With Read Replicas Using YugabyteDB [Video]

DZone

MAY 15, 2023

Scalability and low latency are crucial for any application that relies on real-time data. In this post, we'll discuss how you can use YugabyteDB and its read replica nodes to improve the read latency for users across the globe. One way to achieve this is by storing data closer to the users.

Latency

Latency Scalability

Performance and Scalability Analysis of Redis and Memcached

DZone

JULY 2, 2024

Speed and scalability are significant issues today, at least in the application landscape. We compare throughput, operations per second, and latency under different loads, namely the P90 and P99 percentiles. We compare throughput, operations per second, and latency under different loads, namely the P90 and P99 percentiles.

Scalability

Scalability Performance Benchmarking Games

API Design Principles for Optimal Performance and Scalability

DZone

JUNE 22, 2023

The goal is to help developers, technical managers, and business owners understand the importance of API performance optimization and how they can improve the speed, scalability, and reliability of their APIs. API performance optimization is the process of improving the speed, scalability, and reliability of APIs.

Scalability

Scalability Design Best Practices Performance

RabbitMQ vs. Kafka: Key Differences

Scalegrid

FEBRUARY 6, 2025

This decoupling simplifies system architecture and supports scalability in distributed environments. Kafka stores and distributes data through a partitioned log system, which spans multiple brokers to provide fault tolerance and scalability. Apache Kafka uses a custom TCP/IP protocol for high throughput and low latency.

Latency

Latency Analytics Architecture Storage

Netflix’s Distributed Counter Abstraction

The Netflix TechBlog

NOVEMBER 12, 2024

By: Rajiv Shringi , Oleksii Tkachuk , Kartik Sathyanarayanan Introduction In our previous blog post, we introduced Netflix’s TimeSeries Abstraction , a distributed service designed to store and query large volumes of temporal event data with low millisecond latencies. Today, we’re excited to present the Distributed Counter Abstraction.

Latency

Latency Cache Infrastructure Strategy

The Power of Caching: Boosting API Performance and Scalability

DZone

AUGUST 16, 2023

Benefits of Caching Improved performance: Caching eliminates the need to retrieve data from the original source every time, resulting in faster response times and reduced latency. Reduced server load: By serving cached content, the load on the server is reduced, allowing it to handle more requests and improving overall scalability.

Cache

Cache Scalability Performance Latency

Efficient Multimodal Data Processing: A Technical Deep Dive

DZone

FEBRUARY 27, 2025

In this article, I will walk through a comprehensive end-to-end architecture for efficient multimodal data processing while striking a balance in scalability, latency, and accuracy by leveraging GPU-accelerated pipelines, advanced neural networks , and hybrid storage platforms.

Efficiency

Efficiency Processing Latency Storage

Title Launch Observability at Netflix Scale

The Netflix TechBlog

DECEMBER 17, 2024

The Challenge of Title Launch Observability As engineers, were wired to track system metrics like error rates, latencies, and CPU utilizationbut what about metrics that matter to a titlessuccess? The complexity of these operational demands underscored the urgent need for a scalable solution.

Traffic

Traffic Scalability Strategy Monitoring

Stuff The Internet Says On Scalability For January 24th, 2020

High Scalability

JANUARY 24, 2020

Solves compute, latency, and interop. Number Stuff: Don't miss all that the Internet has to say on Scalability, click below and become eventually. consistent with all scalability knowledge (which means this post has many more items to read so please keep on reading). Cars become mostly remote controlled pleasure palaces.

Internet

Internet Internet Scalability Latency

Spring WebFlux: publishOn vs subscribeOn for Improving Microservices Performance

DZone

SEPTEMBER 23, 2024

With the rise of microservices architecture , there has been a rapid acceleration in the modernization of legacy platforms, leveraging cloud infrastructure to deliver highly scalable, low-latency, and more responsive services. Why Use Spring WebFlux?

Performance

Performance Latency Architecture Programming

Best Practices for Scaling RabbitMQ

Scalegrid

FEBRUARY 24, 2025

Key Takeaways RabbitMQ improves scalability and fault tolerance in distributed systems by decoupling applications, enabling reliable message exchanges. This decoupling is crucial in modern architectures where scalability and fault tolerance are paramount. Keeping queues short maintains a responsive and efficient RabbitMQ setup.

Best Practices

Best Practices Traffic Strategy Efficiency

Amazon DynamoDB ? a Fast and Scalable NoSQL Database.

All Things Distributed

JANUARY 18, 2012

Werner Vogels weblog on building scalable and robust distributed systems. a Fast and Scalable NoSQL Database Service Designed for Internet Scale Applications. The original Dynamo design was based on a core set of strong distributed systems principles resulting in an ultra-scalable and highly reliable database system.

Scalability

Scalability Database Ecommerce Latency

Stuff The Internet Says On Scalability For March 1st, 2019

High Scalability

MARCH 1, 2019

It was made possible by using a low latency of 0.1 seconds, the lower the latency, the more responsive the robot. Don't miss all that the Internet has to say on Scalability, click below and become eventually consistent with all scalability knowledge (which means this post has many more items to read so please keep on reading).

Internet

Internet Internet Scalability Blockchain

Foundation Model for Personalized Recommendation

The Netflix TechBlog

MARCH 28, 2025

Yet, many are confined to a brief temporal window due to constraints in serving latency or training costs. These insights have shaped the design of our foundation model, enabling a transition from maintaining numerous small, specialized models to building a scalable, efficient system.

Tuning

Tuning Efficiency Latency Strategy

Stuff The Internet Says On Scalability For November 23rd, 2018

High Scalability

NOVEMBER 23, 2018

Delay is Not an Option: Low Latency Routing in Space , Murat ). Don't miss all that the Internet has to say on Scalability, click below and become eventually consistent with all scalability knowledge (which means this post has many more items to read so please keep on reading). Here's some fancy FCC reverse engineering magic.

Internet

Internet Internet Scalability Analytics

Stuff The Internet Says On Scalability For May 10th, 2019

High Scalability

MAY 10, 2019

Quotable Stuff: @mjpt777 : APIs to IO need to be asynchronous and support batching otherwise the latency of calls dominate throughput and latency profile under burst conditions. . $84.4 : average yearly Facebook ad revenue per user in North America. Also Thurs are now much more productive. We work too much.

Internet

Internet Internet Scalability Energy

Dynatrace supports SnapStart for Lambda as an AWS launch partner

Dynatrace

NOVEMBER 28, 2022

The new Amazon capability enables customers to improve the startup latency of their functions from several seconds to as low as sub-second (up to 10 times faster) at P99 (the 99th latency percentile). This can cause latency outliers and may lead to a poor end-user experience for latency-sensitive applications.

Lambda

Lambda AWS Serverless Latency

Stuff The Internet Says On Scalability For December 7th, 2018

High Scalability

DECEMBER 7, 2018

It's HighScalability time: This is your 1500ms latency in real life situations - pic.twitter.com/guot8khIPX. Don't miss all that the Internet has to say on Scalability, click below and become eventually consistent with all scalability knowledge (which means this post has many more items to read so please keep on reading).

Internet

Internet Internet Scalability Blockchain

Benchmark (YCSB) numbers for Redis, MongoDB, Couchbase2, Yugabyte and BangDB

High Scalability

FEBRUARY 17, 2021

We note that for MongoDB update latency is really very low (low is better) compared to other dbs, however the read latency is on the higher side. The latency table shows that 99th percentile latency for Yugabyte is quite high compared to others (lower is better). Again Yugabyte latency is quite high. Conclusion.

Benchmarking

Benchmarking Latency C++ Database

Designing Instagram

High Scalability

JANUARY 11, 2022

When a user requests for feed then there will be two parallel threads involved in fetching the user feeds to optimize for latency. FUN FACT : In this talk , Dikang Gu, a software engineer at Instagram core infra team has mentioned about how they use Cassandra to serve critical usecases, high scalability requirements, and some pain points.

Design

Design Media Storage Logistics

Dynatrace supports the newly released AWS Lambda Response Streaming

Dynatrace

APRIL 7, 2023

Customers can use AWS Lambda Response Streaming to improve performance for latency-sensitive applications and return larger payload sizes. Streaming raises the default 6 MB hard limit to a 20 MB soft limit, adding greater scalability and flexibility to their applications. What is a Lambda serverless function?

Lambda

Lambda AWS Serverless Latency

Stuff The Internet Says On Scalability For September 14th, 2018

High Scalability

SEPTEMBER 14, 2018

SCM slots between DRAM and flash in terms of latency, cost, and density. Don't miss all that the Internet has to say on Scalability, click below and become eventually consistent with all scalability knowledge (which means this post has many more items to read so please keep on reading). So many more quotes.

Internet

Internet Internet Scalability Education

Stuff The Internet Says On Scalability For January 25th, 2019

High Scalability

JANUARY 24, 2019

TServerless : We sat with a solution architect, apparently they are aware of the latency issue and suggested to ditch api gw and build our own solution. For those who sought to control nature through programmable machines, it responds by allowing us to build machines whose nature is that they can no longer be controlled by programs.

Internet

Internet Internet Scalability Games

Why applying chaos engineering to data-intensive applications matters

Dynatrace

MAY 23, 2024

Such frameworks support software engineers in building highly scalable and efficient applications that process continuous data streams of massive volume. Stream processing systems, designed for continuous, low-latency processing, demand swift recovery mechanisms to tolerate and mitigate failures effectively.

Engineering

Engineering Tuning Latency Open Source

Optimize Citrix platform performance and user experience with Dynatrace (GA)

Dynatrace

JANUARY 15, 2020

Citrix is a sophisticated, efficient, and highly scalable application delivery platform that is itself comprised of anywhere from hundreds to thousands of servers. Citrix platform performance—optimize your Citrix landscape with insights into user load and screen latency per server. Citrix VDA. SAP server. Citrix VDA. Citrix StoreFront.

Latency

Latency Performance Virtualization Infrastructure

Stuff The Internet Says On Scalability For December 21st, 2018

High Scalability

DECEMBER 21, 2018

It's HighScalability time: Have a very scalable Xmas everyone! Tim Bray : How to talk about [Serverless Latency] · To start with, don’t just say “I need 120ms.” See you in the New Year. Do you like this sort of Stuff? Please support me on Patreon. I'd really appreciate it. Explain the Cloud Like I'm 10.

Internet

Internet Internet Scalability Serverless

Stuff The Internet Says On Scalability For March 22nd, 2019

High Scalability

MARCH 22, 2019

µs of replication latency on lossy Ethernet, which is faster than or comparable to specialized replication systems that use programmable switches, FPGAs, or RDMA.". We achieve 5.5 matthewstoller : I just looked at Netflix’s 10K. At some point, the e-mail I send over WiFi will hit a wire, of course". Yep, there are more quotes.

Internet

Internet Internet Scalability Wireless

What are quality gates? How to use quality gates to deliver better software at speed and scale

Dynatrace

FEBRUARY 21, 2024

This approach supports innovation, ambitious SLOs, DevOps scalability, and competitiveness. These metrics are latency, traffic, errors, and saturation, all of which must be key considerations when curating user experience. In this example, unlike latency, the remaining three signals did not receive a “pass.”

Speed

Speed Software Software Latency

Allegro Reduces Kafka Producer Latency Outliers by 82% After Switching to XFS

InfoQ

APRIL 26, 2024

Allegro experimented with different performance optimization options to improve Apache Kafka producer tail latency and eventually switched all its clusters to the XFS filesystem. The company used Kafka protocol sniffing, JVM profiling, and eBPF, which proved instrumental in identifying and eliminating performance bottlenecks.

Latency

Latency Performance Tuning Scalability

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

The Netflix TechBlog

MAY 4, 2023

The first phase involves validating functional correctness, scalability, and performance concerns and ensuring the new systems’ resilience before the migration. It provides a good read on the availability and latency ranges under different production conditions.

Traffic

Traffic Latency Tuning Systems

LinkedIn Migrates Espresso to HTTP2 and Reduces Connections by 88% and Latency by 75%

InfoQ

DECEMBER 4, 2023

LinkedIn was able to dramatically improve the scalability and performance of its Espresso database by migrating it from HTTP1.1 to HTTP2, resulting in a reduction in the number of connections, latency, and garbage collection times. By Rafal Gancarz

Latency

Latency Scalability Database Performance

Introducing Netflix TimeSeries Data Abstraction Layer

The Netflix TechBlog

OCTOBER 8, 2024

Rajiv Shringi Vinay Chella Kaidan Fullerton Oleksii Tkachuk Joey Lynch Introduction As Netflix continues to expand and diversify into various sectors like Video on Demand and Gaming , the ability to ingest and store vast amounts of temporal data — often reaching petabytes — with millisecond access latency has become increasingly vital.

Latency

Latency Storage Traffic Tuning

Introducing Netflix’s Key-Value Data Abstraction Layer

The Netflix TechBlog

SEPTEMBER 18, 2024

Central to this infrastructure is our use of multiple online distributed databases such as Apache Cassandra , a NoSQL database known for its high availability and scalability. It also serves as central configuration of access patterns such as consistency or latency targets. Useful for keeping “n-newest” or prefix path deletion.

Latency

Latency Storage Cache Servers

Consistent caching mechanism in Titus Gateway

The Netflix TechBlog

NOVEMBER 3, 2022

In that scenario, the system would need to deal with the data propagation latency directly, for example, by use of timeouts or client-originated update tracking mechanisms. We started seeing increased response latencies and leader servers running at dangerously high utilization.

Cache

Cache Latency Traffic Systems

The Netflix Cosmos Platform

The Netflix TechBlog

MARCH 1, 2021

It supports both high throughput services that consume hundreds of thousands of CPUs at a time, and latency-sensitive workloads where humans are waiting for the results of a computation. The third generation, called Reloaded , has been online for about seven years and has proven to be stable and massively scalable.

Serverless

Serverless Media Latency Social Media

Dynatrace supports Azure Managed Instance for Apache Cassandra

Dynatrace

MAY 13, 2022

Because of its scalability and distributed architecture, thousands of companies trust it to run their cloud and hybrid-based workloads at high availability without compromising performance. With the Dynatrace Data Explorer, you can easily analyze metrics, such as client read/write latency by Cassandra nodes and disk space usage by keyspaces.

Azure

Azure Latency Metrics Infrastructure

How to maximize serverless benefits and overcome its challenges

Dynatrace

OCTOBER 10, 2022

Many organizations today rely on cloud-native applications for their scalability and agility, among other benefits. Serverless benefits include the following: Dynamic scalability. Reduced latency. By using cloud providers with multiple server sites, organizations can reduce function latency for end users.

Serverless

Serverless Infrastructure Lambda Latency

Stuff The Internet Says On Scalability For July 20th, 2018

High Scalability

JULY 20, 2018

A typical example of modern "microservices-inspired" Java application would function along these lines: Netflix : We observed during experimentation that RAM random read latencies were rarely higher than 1 microsecond whereas typical SSD random read speeds are between 100–500 microseconds. There are a few more quotes.

Internet

Internet Internet Scalability Automotive

Self-Host Your Static Assets

CSS Wizardry

MAY 31, 2019

Every new origin we need to visit needs a connection opening, and that can be very costly: DNS resolution, TCP handshakes, and TLS negotiation all add up, and the story gets worse the higher the latency of the connection is. On a slower, higher-latency connection, the story is much, mush worse. All completely avoidable. to just 3.6s.

Cache

Cache Latency Infrastructure Website

Edge Computing Orchestration in IoT: Coordinating Distributed Workloads

DZone

FEBRUARY 2, 2024

This proximity to data generation reduces latency, conserves bandwidth and enables real-time decision-making. However, managing distributed workloads across various edge nodes in a scalable and efficient manner is a complex challenge.

IoT

IoT Artificial Intelligence Latency Internet

Distributed Algorithms in NoSQL Databases

Highly Scalable

SEPTEMBER 18, 2012

Scalability is one of the main drivers of the NoSQL movement. Historically, NoSQL paid a lot of attention to tradeoffs between consistency, fault-tolerance and performance to serve geographically distributed systems, low-latency or highly available applications. Read/Write latency. Read/Write scalability. Data Placement.

Database

Database Latency C++ Scalability

How To Design For High-Traffic Events And Prevent Your Website From Crashing

Smashing Magazine

JANUARY 7, 2025

For example, you can switch to a scalable cloud-based web host, or compress/optimize images to save bandwidth. Choose A Scalable Web Host The most convenient way to design a high-traffic website without worrying about website crashes is to upgrade your web hosting solution.

Traffic

Traffic Website Design Cache

How to Scale Elasticsearch to Solve Your Scalability Issues

Scalable Annotation Service?—?Marken

Trending Sources

Improve Application Latency With Read Replicas Using YugabyteDB [Video]

Performance and Scalability Analysis of Redis and Memcached

API Design Principles for Optimal Performance and Scalability

RabbitMQ vs. Kafka: Key Differences

Netflix’s Distributed Counter Abstraction

The Power of Caching: Boosting API Performance and Scalability

Efficient Multimodal Data Processing: A Technical Deep Dive

Title Launch Observability at Netflix Scale

Stuff The Internet Says On Scalability For January 24th, 2020

Spring WebFlux: publishOn vs subscribeOn for Improving Microservices Performance

Best Practices for Scaling RabbitMQ

Amazon DynamoDB ? a Fast and Scalable NoSQL Database.

Stuff The Internet Says On Scalability For March 1st, 2019

Foundation Model for Personalized Recommendation

Stuff The Internet Says On Scalability For November 23rd, 2018

Stuff The Internet Says On Scalability For May 10th, 2019

Dynatrace supports SnapStart for Lambda as an AWS launch partner

Stuff The Internet Says On Scalability For December 7th, 2018

Benchmark (YCSB) numbers for Redis, MongoDB, Couchbase2, Yugabyte and BangDB

Designing Instagram

Dynatrace supports the newly released AWS Lambda Response Streaming

Stuff The Internet Says On Scalability For September 14th, 2018

Stuff The Internet Says On Scalability For January 25th, 2019

Why applying chaos engineering to data-intensive applications matters

Optimize Citrix platform performance and user experience with Dynatrace (GA)

Stuff The Internet Says On Scalability For December 21st, 2018

Stuff The Internet Says On Scalability For March 22nd, 2019

What are quality gates? How to use quality gates to deliver better software at speed and scale

Allegro Reduces Kafka Producer Latency Outliers by 82% After Switching to XFS

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

LinkedIn Migrates Espresso to HTTP2 and Reduces Connections by 88% and Latency by 75%

Introducing Netflix TimeSeries Data Abstraction Layer

Introducing Netflix’s Key-Value Data Abstraction Layer

Consistent caching mechanism in Titus Gateway

The Netflix Cosmos Platform

Dynatrace supports Azure Managed Instance for Apache Cassandra

How to maximize serverless benefits and overcome its challenges

Stuff The Internet Says On Scalability For July 20th, 2018

Self-Host Your Static Assets

Edge Computing Orchestration in IoT: Coordinating Distributed Workloads

Distributed Algorithms in NoSQL Databases

How To Design For High-Traffic Events And Prevent Your Website From Crashing

Stay Connected