Architecture, Cache and Efficiency - Technology Performance Pulse

Architectural Insights: Designing Efficient Multi-Layered Caching With Instagram Example

DZone

FEBRUARY 27, 2024

Caching is a critical technique for optimizing application performance by temporarily storing frequently accessed data, allowing for faster retrieval during subsequent requests. Multi-layered caching involves using multiple levels of cache to store and retrieve data.

Cache

Cache Efficiency Architecture Design

Netflix’s Distributed Counter Abstraction

The Netflix TechBlog

NOVEMBER 12, 2024

Best Effort Regional Counter This type of counter is powered by EVCache , Netflix’s distributed caching solution built on the widely popular Memcached. Without an efficient data retention strategy, this approach may struggle to scale effectively. Efficient Aggregation: Each rollup consumer processes a batch of counters simultaneously.

Latency

Latency Cache Infrastructure Strategy

Distance-Based ISA for Efficient Register Management

ACM Sigarch

APRIL 2, 2025

To create a CPU core that can execute a large number of instructions in parallel, it is necessary to improve both the architecturewhich includes the overall CPU design and the instruction set architecture (ISA) designand the microarchitecture, which refers to the hardware design that optimizes instruction execution.

Efficiency

Efficiency Hardware Architecture Design

Foundation Model for Personalized Recommendation

The Netflix TechBlog

MARCH 28, 2025

This scenario underscored the need for a new recommender system architecture where member preference learning is centralized, enhancing accessibility and utility across different models. At inference time, when multi-step decoding is needed, we can deploy KV caching to efficiently reuse past computations and maintain lowlatency.

Tuning

Tuning Efficiency Latency Strategy

Sustainable IT: Optimize your hybrid-cloud carbon footprint

Dynatrace

DECEMBER 21, 2023

Evaluating these on three levels—data center, host, and application architecture (plus code)—is helpful. Most approaches focus on improving Power Usage Effectiveness (PUE), a data center energy-efficiency measure. energy-efficient data centers—cloud providers—achieve values closer to 1.2. A PUE of 1.0

Cloud

Cloud Energy Best Practices Cache

Dynatrace accelerates business transformation with new AI observability solution

Dynatrace

JANUARY 31, 2024

Retrieval-augmented generation emerges as the standard architecture for LLM-based applications Given that LLMs can generate factually incorrect or nonsensical responses, retrieval-augmented generation (RAG) has emerged as an industry standard for building GenAI applications.

Cache

Cache Azure Infrastructure Monitoring

Introducing Netflix’s Key-Value Data Abstraction Layer

The Netflix TechBlog

SEPTEMBER 18, 2024

In this post, we dive deep into how Netflix’s KV abstraction works, the architectural principles guiding its design, the challenges we faced in scaling diverse use cases, and the technical innovations that have allowed us to achieve the performance and reliability required by Netflix’s global operations.

Latency

Latency Storage Cache Servers

AWS serverless services: Exploring your options

Dynatrace

OCTOBER 7, 2021

To get a better understanding of AWS serverless, we’ll first explore the basics of serverless architectures, review AWS serverless offerings, and explore common use cases. Serverless architecture: A primer. Serverless architecture shifts application hosting functions away from local servers onto those managed by providers.

Serverless

Serverless AWS Lambda Storage

Netflix Cloud Packaging in the Terabyte Era

The Netflix TechBlog

SEPTEMBER 24, 2021

Table 1: Movie and File Size Examples Initial Architecture A simplified view of our initial cloud video processing pipeline is illustrated in the following diagram. Figure 1: A Simplified Video Processing Pipeline With this architecture, chunk encoding is very efficient and processed in distributed cloud computing instances.

Cloud

Cloud Media Storage Cache

Managing the Dynatrace API across multiple thousand environments

Dynatrace

JULY 16, 2020

To do that, we need an easy and efficient API access to all of our Dynatrace Environments, without having to create and maintain API access tokens of individual tenants. TenantCache: a cache to store tenant information and API token information and semi-permanent data to avoid unnecessary roundtrips. ? Consolidating the APIs.

Cache

Cache Serverless Efficiency Tuning

Key Advantages of DBMS for Efficient Data Management

Scalegrid

JANUARY 5, 2024

Enhanced data security, better data integrity, and efficient access to information. Despite initial investment costs, DBMS presents long-term savings and improved efficiency through automated processes, efficient query optimizations, and scalability, contributing to enhanced decision-making and end-user productivity.

Efficiency

Efficiency Storage Database Scalability

What is a data lakehouse? Combining data lakes and warehouses for the best of both worlds

Dynatrace

OCTOBER 4, 2022

While data lakes and data warehousing architectures are commonly used modes for storing and analyzing data, a data lakehouse is an efficient third way to store and analyze data that unifies the two architectures while preserving the benefits of both. This is simply not possible with conventional architectures.

Artificial Intelligence

Artificial Intelligence Storage Analytics Government

How to Optimize Digital Experience and Operations with Dynatrace

Dynatrace

AUGUST 30, 2019

Missing Cache Settings – Make sure you cache resources that don’t change often on the browser or use a CDN. Reducing performance and architectural issues in their backend system gave them a 99% performance improvement! Missing caching layers, e.g. provide a read-only cache for static data.

Cache

Cache Database Architecture Government

Re-Architecting the Video Gatekeeper

The Netflix TechBlog

JULY 12, 2019

The Tech Hollow , an OSS technology we released a few years ago, has been best described as a total high-density near cache : Total : The entire dataset is cached on each node?—?there there is no eviction policy, and there are no cache misses. Near : the cache exists in RAM on any instance which requires access to the dataset.

Cache

Cache Architecture Latency Engineering

Predictive CPU isolation of containers at Netflix

The Netflix TechBlog

JUNE 4, 2019

Because microprocessors are so fast, computer architecture design has evolved towards adding various levels of caching between compute units and the main memory, in order to hide the latency of bringing the bits to the brains. This avoids thrashing caches too much for B and evens out the pressure on the L3 caches of the machine.

Cache

Cache Latency Airlines Logistics

Implementing AWS well-architected pillars with automated workflows

Dynatrace

SEPTEMBER 13, 2023

This is a set of best practices and guidelines that help you design and operate reliable, secure, efficient, cost-effective, and sustainable systems in the cloud. The framework comprises six pillars: Operational Excellence, Security, Reliability, Performance Efficiency, Cost Optimization, and Sustainability.

AWS

AWS Efficiency Azure Cloud

Azure Well-Architected Framework: What it is and how to tame it with AI and automation

Dynatrace

APRIL 21, 2022

As organizations adopt microservices architecture with cloud-native technologies such as Microsoft Azure , many quickly notice an increase in operational complexity. The Azure Well-Architected Framework is a set of guiding tenets organizations can use to evaluate architecture and implement designs that will scale over time.

Azure

Azure Monitoring Virtualization Metrics

Introducing Netflix TimeSeries Data Abstraction Layer

The Netflix TechBlog

OCTOBER 8, 2024

In previous blog posts, we introduced the Key-Value Data Abstraction Layer and the Data Gateway Platform , both of which are integral to Netflix’s data architecture. Instead, we focus on addressing the challenge of storing and accessing extremely high-throughput, immutable temporal event data in a low-latency and cost-efficient manner.

Latency

Latency Storage Traffic Tuning

ABAC on SpiceDB: Enabling Netflix’s Complex Identity Types

The Netflix TechBlog

MAY 19, 2023

Netflix is always looking for security, ergonomic, or efficiency improvements, and this extends to authorization tools. Over time, each node caches a subset of subproblems to support a distributed cache, reduce the datastore load, and achieve SpiceDB’s horizontal scalability.

Cache

Cache Google Open Source Systems

Seamlessly Swapping the API backend of the Netflix Android app

The Netflix TechBlog

SEPTEMBER 8, 2020

This allows the app to query a list of “paths” in each HTTP request, and get specially formatted JSON (jsonGraph) that we use to cache the data and hydrate the UI. For us, it means that we now need to have ~15 MDN tabs open when writing routes :) Let’s briefly discuss the architecture of this microservice. It was a Node.js

Latency

Latency Cache Java Traffic

Redis vs Memcached in 2024

Scalegrid

MARCH 28, 2024

Key Takeaways Redis offers complex data structures and additional features for versatile data handling, while Memcached excels in simplicity with a fast, multi-threaded architecture for basic caching needs. Introduction Caching serves a dual purpose in web development – speeding up client requests and reducing server load.

Cache

Cache Storage Scalability Architecture

Elasticsearch Indexing Strategy in Asset Management Platform (AMP)

The Netflix TechBlog

MARCH 10, 2023

These applications are built on a microservices architecture, and the Asset Management Platform provides asset management to those dozens of services for various asset types. To avoid the ES query for the list of indices for every indexing request, we keep the list of indices in a distributed cache.

Strategy

Strategy Cache Storage Analytics

Expanding the Cloud: More memory, more caching and more performance for your data

All Things Distributed

SEPTEMBER 3, 2013

Choosing your database architecture may be the most critical decision you’ll make and has a disproportionate impact on the performance, scalability, and availability of your app. No single database architecture or solution can meet all of Amazon.com’s or our customers’ needs.

Cache

Cache Cloud Performance Retail

How Data Inspires Building a Scalable, Resilient and Secure Cloud Infrastructure At Netflix

The Netflix TechBlog

MARCH 5, 2019

the order of the rows on your Netflix home page, issuing content licenses when you click play, finding the Open Connect cache closest to you with the content you requested, and many more). In the Efficiency space, our data teams focus on transparency and optimization.

Infrastructure

Infrastructure Cloud Scalability AWS

Breaking Down Bulky Builds With Netlify And Next.js

Smashing Magazine

JUNE 29, 2021

Without build optimizations (incremental builds, caching, we will get to those soon) this will eventually become unmanageable as well — think about going through all images in a website: resizing, deleting, and/or creating new files over and over again. Jamstack general service architecture ( Large preview ).

Cache

Cache Website Strategy Traffic

Comparisons of Proxies for MySQL

Percona

MARCH 20, 2023

When designing an architecture, many components need to be considered before deciding on the best solution. In this scenario, it is also crucial to be efficient in resource utilization and scaling with frugality. This is due to the multiplexing and the very efficient way ProxySQL uses to deal with high load.

Games

Games Latency Traffic Cache

Building an elastic query engine on disaggregated storage

The Morning Paper

MARCH 8, 2020

When I think about cloud-native architectures, I think about disaggregation (enabling each resource type to scale independently), fine-grained units of resource allocation (enabling rapid response to changing workload demands, i.e. elasticity), and isolation (keeping tenants apart). joins) during query processing.

Storage

Storage Engineering Cache Serverless

The case for a learned sorting algorithm

The Morning Paper

OCTOBER 18, 2020

Radix Sort is carefully designed to make effective use of the L2 cache and sequential memory accesses, whereas Learned Sort is making random accesses all over the destination array. How can learned sort be adapted to make it cache-efficient? Sympathy for the machine. For the evaluation set-up, this meant $f$ was around 1,000.

Cache

Cache C++ Java Tuning

The Winds of Architecture Changes at the USENIX ATC 2019

ACM Sigarch

NOVEMBER 1, 2019

This blog post gives a glimpse of the computer systems research papers presented at the USENIX Annual Technical Conference (ATC) 2019, with an emphasis on systems that use new hardware architectures. As a consequence, the vast majority of the papers in the past has usually focused on conventional X86 or GPU-accelerated architectures.

Architecture

Architecture Hardware Cache Storage

Optimizing CDN Architecture: Enhancing Performance and User Experience

IO River

NOVEMBER 2, 2023

A content delivery network (CDN) is a distributed network of servers strategically located across multiple geographical locations to deliver web content to end users more efficiently. What is CDN Architecture?‍CDN CDN architecture serves as a blueprint or plan that guides the distribution of CDN provider PoPs.

Architecture

Architecture Cache Performance Latency

Optimizing CDN Architecture: Enhancing Performance and User Experience

IO River

NOVEMBER 2, 2023

â€A content delivery network (CDN) is a distributed network of servers strategically located across multiple geographical locations to deliver web content to end users more efficiently. CDNs cache content on edge servers distributed globally, reducing the distance between users and the content they want.â€CDNs

Architecture

Architecture Cache Performance Latency

Handling user-initiated actions in an asynchronous, message-based architecture

O'Reilly Software

DECEMBER 11, 2017

A message-based microservices architecture offers many advantages, making solutions easier to scale and expand with new services. The asynchronous nature of interservice interactions inherent to this architecture, however, poses challenges for user-initiated actions such as create-read-update-delete (CRUD) requests on an object.

Architecture

Architecture Government Latency Efficiency

Embrace event-driven computing: Amazon expands DynamoDB with streams, cross-region replication, and database triggers

All Things Distributed

JULY 14, 2015

In this blog post, I will explain how these three new capabilities empower you to build applications with distributed systems architecture and create responsive, reliable, and high-performance applications using DynamoDB that work at any scale. The point-of-sales system records changes from all the purchases and stores them in DynamoDB.

Database

Database Lambda AWS IoT

Hierarchical Navigation and Faceted Search on Top of Oracle Coherence

Highly Scalable

APRIL 2, 2012

From the business logic point of view, this was a pretty typical eCommerce service for hierarchical and faceted navigation, although not without peculiarities, but high performance requirements led us to the quite advanced architecture and technical design. So, the only way was to cache all necessary data to minimize interaction with RDBMS.

Ecommerce

Ecommerce Cache Storage Architecture

Redis® Monitoring Strategies for 2025

Scalegrid

JANUARY 21, 2025

With its widespread use in modern application architectures, understanding the ins and outs of Redis monitoring is essential for any tech professional. To monitor Redis instances effectively, collect Redis metrics focusing on cache hit ratio, memory allocated, and latency threshold.

Strategy

Strategy Monitoring Latency DevOps

Cloudburst: stateful functions-as-a-service

The Morning Paper

FEBRUARY 6, 2020

Last week we looked at a function shipping solution to the problem; Cloudburst uses the more common data shipping to bring data to caches next to function runtimes (though you could also make a case that the scheduling algorithm placing function execution in locations where the data is cached a flavour of function-shipping too).

Serverless

Serverless Lambda Cache Latency

Mastering Distributed SQL™ Databases in 2025

Scalegrid

JANUARY 10, 2025

Importance of Managing and Scaling Distributed SQL Databases Managing and growing distributed SQL databases is important for modern businesses to work efficiently and stay agile. It requires implementing scalable architecture from the outset and continuously monitoring performance data to predict when to scale.

Database

Database Scalability Best Practices Blockchain

MICRO 2019 Trip Report

ACM Sigarch

NOVEMBER 4, 2019

The technical program, put together by program chairs Tor Aamodt and Reetuparna Das , showcased key innovations across a wide range of computer architecture topics, from domain-specific accelerators to in/near-memory computing and from security to quantum computing. . This year’s MICRO had three inspiring keynote talks.

Hardware

Hardware Architecture Programming Innovation

What is a Distributed Storage System

Scalegrid

FEBRUARY 8, 2024

Distributed Storage Architecture Distributed storage systems are designed with a core framework that includes the main system controller, a data repository for the system, and a database. This makes adopting such sophisticated multi-node-based arrangements exceedingly advantageous from both operational efficiency and financial viewpoints.

Storage

Storage Systems Big Data Azure

ML Platform Meetup: Infra for Contextual Bandits and Reinforcement Learning

The Netflix TechBlog

OCTOBER 18, 2019

It featured three relevant talks from LinkedIn, Netflix and Facebook, and a platform architecture overview talk from first time participant Dropbox. In this talk, Kinjal used the example of the LinkedIn Feed, to demonstrate how they use bandit algorithms to solve for the optimal parameter selection problem efficiently.

Infrastructure

Infrastructure Metrics Architecture Efficiency

How a RESTful API server reacts to requests

O'Reilly Software

DECEMBER 28, 2017

Learn how to properly design RESTful APIs communication with clients, accounting for request structure, authentication, and caching. This series of articles shows you how to derive an easy-to-use, robust, efficient API to serve users on the web or on mobile devices. We are using the principles of RESTful architecture over HTTP.

Servers

Servers Cache Mobile Architecture

How to bring fast data access to microservice architecture with in-memory data grids

O'Reilly Software

DECEMBER 7, 2017

For stack scalability, elasticity at the business logic layer should be matched with elasticity at the caching layer. Continue reading How to bring fast data access to microservice architecture with in-memory data grids. This use comes with expectations of real-time response, even during peak access times. ?

Architecture

Architecture Cache Mobile Scalability

Redis® Monitoring Strategies for 2024

Scalegrid

DECEMBER 21, 2023

With its widespread use in modern application architectures, understanding the ins and outs of Redis® monitoring is essential for any tech professional. To monitor Redis® instances effectively, collect Redis metrics focusing on cache hit ratio, memory allocated, and latency threshold.

Strategy

Strategy Monitoring Latency DevOps

Improving The Performance Of Wix Websites (Case Study)

Smashing Magazine

NOVEMBER 22, 2021

In order to create change across our entire organization, we needed to get all the relevant employees, partners, and even customers up to speed about performance quickly and efficiently. The results of some of these APIs are also cached in a CDN as appropriate. Creating A Performance Culture. Measuring And Monitoring. Large preview ).

Website

Website Performance Google Website Performance

Architectural Insights: Designing Efficient Multi-Layered Caching With Instagram Example

Netflix’s Distributed Counter Abstraction

Trending Sources

Distance-Based ISA for Efficient Register Management

Foundation Model for Personalized Recommendation

Sustainable IT: Optimize your hybrid-cloud carbon footprint

Dynatrace accelerates business transformation with new AI observability solution

Introducing Netflix’s Key-Value Data Abstraction Layer

AWS serverless services: Exploring your options

Netflix Cloud Packaging in the Terabyte Era

Managing the Dynatrace API across multiple thousand environments

Key Advantages of DBMS for Efficient Data Management

What is a data lakehouse? Combining data lakes and warehouses for the best of both worlds

How to Optimize Digital Experience and Operations with Dynatrace

Re-Architecting the Video Gatekeeper

Predictive CPU isolation of containers at Netflix

Implementing AWS well-architected pillars with automated workflows

Azure Well-Architected Framework: What it is and how to tame it with AI and automation

Introducing Netflix TimeSeries Data Abstraction Layer

ABAC on SpiceDB: Enabling Netflix’s Complex Identity Types

Seamlessly Swapping the API backend of the Netflix Android app

Redis vs Memcached in 2024

Elasticsearch Indexing Strategy in Asset Management Platform (AMP)

Expanding the Cloud: More memory, more caching and more performance for your data

How Data Inspires Building a Scalable, Resilient and Secure Cloud Infrastructure At Netflix

Breaking Down Bulky Builds With Netlify And Next.js

Comparisons of Proxies for MySQL

Building an elastic query engine on disaggregated storage

The case for a learned sorting algorithm

The Winds of Architecture Changes at the USENIX ATC 2019

Optimizing CDN Architecture: Enhancing Performance and User Experience

Optimizing CDN Architecture: Enhancing Performance and User Experience

Handling user-initiated actions in an asynchronous, message-based architecture

Embrace event-driven computing: Amazon expands DynamoDB with streams, cross-region replication, and database triggers

Hierarchical Navigation and Faceted Search on Top of Oracle Coherence

Redis® Monitoring Strategies for 2025

Cloudburst: stateful functions-as-a-service

Mastering Distributed SQL™ Databases in 2025

MICRO 2019 Trip Report

What is a Distributed Storage System

ML Platform Meetup: Infra for Contextual Bandits and Reinforcement Learning

How a RESTful API server reacts to requests

How to bring fast data access to microservice architecture with in-memory data grids

Redis® Monitoring Strategies for 2024

Improving The Performance Of Wix Websites (Case Study)

Stay Connected