Cache, Design and Storage - Technology Performance Pulse

Designing Instagram

High Scalability

JANUARY 11, 2022

Design a photo-sharing platform similar to Instagram where users can upload their photos and share it with their followers. High Level Design. Component Design. API Design. We have provided the API design of posting an image on Instagram below. Problem Statement. Sending and receiving messages from other users.

Design

Design Media Storage Logistics

Netflix’s Distributed Counter Abstraction

The Netflix TechBlog

NOVEMBER 12, 2024

By: Rajiv Shringi , Oleksii Tkachuk , Kartik Sathyanarayanan Introduction In our previous blog post, we introduced Netflix’s TimeSeries Abstraction , a distributed service designed to store and query large volumes of temporal event data with low millisecond latencies. Today, we’re excited to present the Distributed Counter Abstraction.

Latency

Latency Cache Infrastructure Strategy

How To Design For High-Traffic Events And Prevent Your Website From Crashing

Smashing Magazine

JANUARY 7, 2025

How To Design For High-Traffic Events And Prevent Your Website From Crashing How To Design For High-Traffic Events And Prevent Your Website From Crashing Saad Khan 2025-01-07T14:00:00+00:00 2025-01-07T22:04:48+00:00 This article is sponsored by Cloudways Product launches and sales typically attract large volumes of traffic.

Traffic

Traffic Website Design Cache

Consistent caching mechanism in Titus Gateway

The Netflix TechBlog

NOVEMBER 3, 2022

We introduce a caching mechanism in the API gateway layer, allowing us to offload processing from singleton leader elected controllers without giving up strict data consistency and guarantees clients observe. When a new leader is elected it loads all data from external storage. The cache is kept in sync with the current leader process.

Cache

Cache Latency Traffic Systems

MezzFS?—?Mounting object storage in Netflix’s media processing platform

The Netflix TechBlog

MARCH 6, 2019

Mounting object storage in Netflix’s media processing platform By Barak Alon (on behalf of Netflix’s Media Cloud Engineering team) MezzFS (short for “Mezzanine File System”) is a tool we’ve developed at Netflix that mounts cloud objects as local files via FUSE. Our object storage service splits objects into many parts and stores them in S3.

Media

Media Storage Processing Cache

Introducing Netflix’s Key-Value Data Abstraction Layer

The Netflix TechBlog

SEPTEMBER 18, 2024

In this post, we dive deep into how Netflix’s KV abstraction works, the architectural principles guiding its design, the challenges we faced in scaling diverse use cases, and the technical innovations that have allowed us to achieve the performance and reliability required by Netflix’s global operations.

Latency

Latency Storage Cache Efficiency

AWS serverless services: Exploring your options

Dynatrace

OCTOBER 7, 2021

This means you no longer have to provision, scale, and maintain servers to run your applications, databases, and storage systems. Instead of worrying about infrastructure management functions, such as capacity provisioning and hardware maintenance, teams can focus on application design, deployment, and delivery. Reliability.

Serverless

Serverless AWS Lambda Storage

Netflix Cloud Packaging in the Terabyte Era

The Netflix TechBlog

SEPTEMBER 24, 2021

From chunk encoding to assembly and packaging, the result of each previous processing step must be uploaded to cloud storage and then downloaded by the next processing step. Since not all projects are terabytes projects, allocating the largest cloud storage to all packager instances is not an efficient use of cloud resources.

Cloud

Cloud Media Storage Cache

Building an elastic query engine on disaggregated storage

The Morning Paper

MARCH 8, 2020

Building an elastic query engine on disaggregated storage , Vuppalapati, NSDI’20. This paper describes the design decisions behind the Snowflake cloud-based data warehouse. have altered the many assumptions that guided the design and optimization of the Snowflake system. From shared-nothing to disaggregation.

Storage

Storage Engineering Cache Serverless

What is a Distributed Storage System

Scalegrid

FEBRUARY 8, 2024

A distributed storage system is foundational in today’s data-driven landscape, ensuring data spread over multiple servers is reliable, accessible, and manageable. Understanding distributed storage is imperative as data volumes and the need for robust storage solutions rise.

Storage

Storage Systems Big Data Azure

Dynatrace Kubernetes Observability for Persistent Volume Claims

Dynatrace

AUGUST 1, 2022

Kubernetes was initially designed with a strong focus on stateless workloads, meaning these workloads do not need to store any persistent data. Interestingly, our partner RedHat reported in 2021 that around 80% of deployed workloads are databases or data caches, storing data in persistent volume claims (PVCs). Dynatrace news.

Storage

Storage Database Network Metrics

Geek Reading - Week of June 5, 2013

DZone

OCTOBER 11, 2022

Simpler UI Testing with CasperJS ( Architects Zone – Architectural Design Patterns & Best Practices). Using MongoDB as a cache store ( Architects Zone – Architectural Design Patterns & Best Practices). Why haven’t cash-strapped American schools embraced open source? Hacker News). Thoughts, Insights and Further Pointers.

Java

Java Best Practices Google Analytics

Introducing Netflix TimeSeries Data Abstraction Layer

The Netflix TechBlog

OCTOBER 8, 2024

Building on these foundational abstractions, we developed the TimeSeries Abstraction — a versatile and scalable solution designed to efficiently store and query large volumes of temporal event data with low millisecond latencies, all in a cost-effective manner across various use cases.

Latency

Latency Storage Traffic Tuning

What is a data lakehouse? Combining data lakes and warehouses for the best of both worlds

Dynatrace

OCTOBER 4, 2022

Data warehouses offer a single storage repository for structured data and provide a source of truth for organizations. Unlike data warehouses, however, data is not transformed before landing in storage. A data lakehouse provides a cost-effective storage layer for both structured and unstructured data. Data management.

Artificial Intelligence

Artificial Intelligence Storage Analytics Government

Elasticsearch Indexing Strategy in Asset Management Platform (AMP)

The Netflix TechBlog

MARCH 10, 2023

are stored in secure storage layers. Amsterdam is built on top of three storage layers. One of the first decisions when integrating with Elasticsearch is designing the indices, their settings and mappings. For every asset indexing request, we look at the cache to determine the corresponding time bucket index for the asset.

Strategy

Strategy Cache Storage Analytics

Helping VFX studios pave a path to the cloud

The Netflix TechBlog

NOVEMBER 15, 2022

But it’s not easy: to pull this off, VFX studios need to build and operate serious technical infrastructure (compute, storage, networking, and software licensing), otherwise known as a “ render farm.” Netflix production teams work with a global roster of VFX studios (both large and small) and their artists to create this amazing imagery.

Cloud

Cloud Entertainment AWS Infrastructure

What is session replay? Discover user pain points with session recordings

Dynatrace

DECEMBER 20, 2021

Conversely, if users encounter functional issues or poor UI design that frustrate common actions, replays provide clear evidence. Streamlined asset caching: Asset caching is critical for creating accurate replays. Tools that feature client-side compression can help reduce total data transfer volumes and storage footprints.

Mobile

Mobile Website Analytics Cache

Pushy to the Limit: Evolving Netflix’s WebSocket proxy for the future

The Netflix TechBlog

SEPTEMBER 10, 2024

To support this growth, we’ve revisited Pushy’s past assumptions and design decisions with an eye towards both Pushy’s future role and future stability. KeyValue is an abstraction over the storage engine itself, which allows us to choose the best storage engine that meets our SLO needs.

Latency

Latency Cache Tuning Efficiency

Redis vs Memcached in 2024

Scalegrid

MARCH 28, 2024

Key Takeaways Redis offers complex data structures and additional features for versatile data handling, while Memcached excels in simplicity with a fast, multi-threaded architecture for basic caching needs. Redis is better suited for complex data models, and Memcached is better suited for high-throughput, string-based caching scenarios.

Cache

Cache Storage Scalability Architecture

Data ingestion pipeline with Operation Management

The Netflix TechBlog

MARCH 7, 2023

We designed a unique concept called Annotation Operations which allows teams to create data pipelines and easily write annotations without worrying about access patterns of their data from different applications. We store all OperationIDs which are in STARTED state in a distributed cache (EVCache) for fast access during searches.

Media

Media Latency Architecture Database

Expanding the Cloud - Introducing Amazon ElastiCache - All Things.

All Things Distributed

AUGUST 22, 2011

Today AWS has launched Amazon ElastiCache , a new service that makes it easy to add distributed in-memory caching to any application. Amazon ElastiCache handles the complexity of creating, scaling and managing an in-memory cache to free up brainpower for more differentiating activities. Driving Storage Costs Down for AWS Customers.

Cloud

Cloud Cache AWS Storage

Implementing AWS well-architected pillars with automated workflows

Dynatrace

SEPTEMBER 13, 2023

This is a set of best practices and guidelines that help you design and operate reliable, secure, efficient, cost-effective, and sustainable systems in the cloud. Storing frequently accessed data in faster storage, usually in-memory caching, improves data retrieval speed and overall system performance. Beyond

AWS

AWS Efficiency Azure Cloud

Cloudburst: stateful functions-as-a-service

The Morning Paper

FEBRUARY 6, 2020

.’ Stateless is fine until you need state, at which point the coarse-grained solutions offered by current platforms limit the kinds of application designs that work well. On the Cloudburst design teams’ wish list: A running function’s ‘hot’ data should be kept physically nearby for low-latency access.

Serverless

Serverless Lambda Cache Latency

Amazon DynamoDB ? a Fast and Scalable NoSQL Database.

All Things Distributed

JANUARY 18, 2012

a Fast and Scalable NoSQL Database Service Designed for Internet Scale Applications. Today is a very exciting day as we release Amazon DynamoDB , a fast, highly reliable and cost-effective NoSQL database service designed for internet scale applications. Amazon DynamoDB â?? By Werner Vogels on 18 January 2012 07:00 AM. Comments ().

Scalability

Scalability Database Ecommerce Latency

File systems unfit as distributed storage backends: lessons from ten years of Ceph evolution

The Morning Paper

NOVEMBER 5, 2019

File systems unfit as distributed storage backends: lessons from 10 years of Ceph evolution Aghayev et al., In this case, the assumption that a distributed storage backend should clearly be layered on top of a local file system. What is a distributed storage backend? SOSP’19. This is not surprising in hindsight.

Storage

Storage Systems Hardware Efficiency

Crucial Redis Monitoring Metrics You Must Watch

Scalegrid

JANUARY 25, 2024

Effective management of memory stores with policies like LRU/LFU proactive monitoring of the replication process and advanced metrics such as cache hit ratio and persistence indicators are crucial for ensuring data integrity and optimizing Redis’s performance. offers the Software Watchdog specifically designed for this purpose.

Metrics

Metrics Monitoring Latency Cache

No Server Required - Jekyll & Amazon S3 - All Things Distributed

All Things Distributed

AUGUST 17, 2011

As some of you may remember I was pretty excited when Amazon Simple Storage Service (S3) released its website feature such that I could serve this weblog completely from S3. It is simple and elegant, as you would expect from someone who has won several design awards. Driving Storage Costs Down for AWS Customers. or rss feed.

Servers

Servers Social Media AWS Website

WiredTiger Logging and Checkpoint Mechanism

Percona

MARCH 28, 2023

The same data, in the form of pages inside the Wiredtiger cache, are also marked dirty. At every checkpoint interval (Default 60 seconds), MongoDB flushes the modified pages that are marked as dirty in the cache to their respective data files (both collection-*.wt This happens at every journalCommitIntervalMs. wt and index-*.wt).

Hardware

Hardware C++ Storage Cache

Expanding the Cloud with DNS - Introducing Amazon Route 53 - All.

All Things Distributed

DECEMBER 5, 2010

There are two main types of DNS servers: authoritative servers and caching resolvers. But the real robustness of the DNS system comes through the way lookups are handled, which is what caching resolvers do. Caching techniques ensure that the DNS system doesnt get overloaded with queries. Subscribe to this weblogs. or rss feed.

Cloud

Cloud Internet Internet AWS

Percona Monitoring and Management 2 Scaling and Capacity Planning

Percona

MARCH 17, 2023

These updates are designed to keep databases running at peak performance and simplify database operations. PMM2 uses VictoriaMetrics (VM) as its metrics storage engine. VictoriaMetrics maintains an in-memory cache for mapping active time series into internal series IDs. Virtual Memory utilization was averaging 48 GB of RAM.

Monitoring

Monitoring Scalability Database Cache

How To Optimize Progressive Web Apps: Going Beyond The Basics

Smashing Magazine

DECEMBER 23, 2020

The service workers enable the offline usage of the PWA by fetching cached data or informing the user about the absence of an Internet connection. When developing a PWA, you can cache the application shell’s resources and assets in the browser. Cached content with IndexedDB. Cache first, then network. Service Workers.

Cache

Cache Internet Internet Google

The Amazon.com 2010 Shareholder Letter Focusses on Technology.

All Things Distributed

APRIL 27, 2011

We use high-performance transactions systems, complex rendering and object caching, workflow and queuing systems, business intelligence and data analytics, machine learning and pattern recognition, neural networks and probabilistic decision making, and a wide variety of other techniques. Driving Storage Costs Down for AWS Customers.

Technology

Technology Technology AWS Storage

Hierarchical Navigation and Faceted Search on Top of Oracle Coherence

Highly Scalable

APRIL 2, 2012

Some time ago I participated in design of a backend for one large online retailer company. In particular, we built this system on top of Oracle Coherence and designed our own data structures and indexes. In particular, we built this system on top of Oracle Coherence and designed our own data structures and indexes.

Ecommerce

Ecommerce Cache Storage Architecture

Amazon DynamoDB Accelerator (DAX): Speed Up DynamoDB Response Times from Milliseconds to Microseconds without Application Rewrite.

All Things Distributed

JUNE 21, 2017

Today, I'm excited to announce the general availability of Amazon DynamoDB Accelerator (DAX) , a fully managed, highly available, in-memory cache that can speed up DynamoDB response times from milliseconds to microseconds, even at millions of requests per second. DynamoDB was the first service at AWS to use SSD storage.

Speed

Speed Cache Latency AWS

20X Faster Backup Preparation With Percona XtraBackup 8.0.33-28!

Percona

JULY 25, 2023

After the “data dictionary” (DD) engine and DD cache are initialized on a server, the Storage Engines can ask for a table definition. Initializing a DD engine and the cache adds complexity and other server dependencies. Old design (until Percona XtraBackup 8.0.33-27): ibd2sdi data/test/t1.ibd ibd > t1.sdi

Cache

Cache Servers Benchmarking Design

Key Advantages of DBMS for Efficient Data Management

Scalegrid

JANUARY 5, 2024

It was initially designed to mitigate the limitations of file management systems, including slow operations, inadequate security, and substantial data redundancy. It comprises a collection of interrelated data and a set of software tools that aid in the access, processing, and management of data.

Efficiency

Efficiency Storage Database Scalability

PostgreSQL Upgrade: Tricks With OID Columns and Extensions

Percona

APRIL 3, 2023

Percona Distribution for PostgreSQL provides the best and most critical enterprise components from the open-source community in a single distribution, designed and tested to work together. Upgrade Complete - Optimizer statistics are not transferred by pg_upgrade so, once you start the new server, consider running: /analyze_new_cluster.sh

C++

C++ Database Cache Storage

MICRO 2019 Trip Report

ACM Sigarch

NOVEMBER 4, 2019

With Moore’s Law becoming irrelevant, Asanovic made a strong case for the new vertical semiconductor business model where custom chip designs are needed for vertically integrated markets. In particular, she highlighted her transformative MIT’78 VLSI System Design Course she designed and taught as a Visiting Professor of EECS at MIT.

Hardware

Hardware Architecture Programming Innovation

AppFabric Caching: Retry Later

ScaleOut Software

MAY 15, 2014

For example, the IMDG must be able to efficiently create millions of objects in each server to make use of its huge storage capacity. Given all this, we thought it would be a good opportunity to see how we are doing relative to the competition, and in particular, relative to Microsoft’s AppFabric caching for Windows on-premise servers.

Cache

Cache Servers Network Design

Choosing a cloud DBMS: architectures and tradeoffs

The Morning Paper

AUGUST 29, 2019

The design space. We group the DBMS design choices and tradeoffs into three broad categories, which result from the need for dealing with (A) external storage; (B) query executors that are spun on demand; and (C) DBMS-as-a-service offerings. Query performance is measured from both warm and cold caches. Key findings.

Architecture

Architecture Cloud Storage Serverless

Optimizing Next.js Applications With Nx

Smashing Magazine

OCTOBER 26, 2021

Nx is an open-source build framework that helps you architect, test, and build at any scale — integrating seamlessly with modern technologies and libraries, while providing a robust command-line interface (CLI), caching, and dependency management. Nx uses distributed graph-based task execution and computation caching to speed up tasks.

Cache

Cache Servers Code Testing

Solving Common Cross-Platform Issues When Working With Flutter

Smashing Magazine

JUNE 18, 2020

More specifically, we’re going to talk about storage and UI differences, which are the ones that most often cause confusion to developers when writing Flutter code that they want to be cross-platform. Example 1: Storage. Secure Storage On Mobile. The situation when it comes to mobile apps is completely different.

Storage

Storage Mobile Website Java

USENIX SREcon APAC 2022: Computing Performance: What's on the Horizon

Brendan Gregg

FEBRUARY 28, 2023

Titus, the Netflix container management platform, is now open source,” [link] Apr 2018 - [Cutress 19] Dr. DDR6: Here's What to Expect in RAM Modules,” [link] Nov 2020 - [Salter 20] Jim Salter, “Western Digital releases new 18TB, 20TB EAMR drives,” [link] Jul 2020 - [Spier 20] Martin Spier, Brendan Gregg, et al.,

Performance

Performance Latency Cache Virtualization

A Decade of Dynamo: Powering the next wave of high-performance, internet-scale applications

All Things Distributed

OCTOBER 2, 2017

With these requirements in mind, and a willingness to question the status quo, a small group of distributed systems experts came together and designed a horizontally scalable distributed database that would scale out for both reads and writes to meet the long-term needs of our business. This was the genesis of the Amazon Dynamo database.

Internet

Internet Internet AWS Performance

Designing Instagram

Netflix’s Distributed Counter Abstraction

Trending Sources

How To Design For High-Traffic Events And Prevent Your Website From Crashing

Consistent caching mechanism in Titus Gateway

MezzFS?—?Mounting object storage in Netflix’s media processing platform

Introducing Netflix’s Key-Value Data Abstraction Layer

AWS serverless services: Exploring your options

Netflix Cloud Packaging in the Terabyte Era

Building an elastic query engine on disaggregated storage

What is a Distributed Storage System

Dynatrace Kubernetes Observability for Persistent Volume Claims

Geek Reading - Week of June 5, 2013

Introducing Netflix TimeSeries Data Abstraction Layer

What is a data lakehouse? Combining data lakes and warehouses for the best of both worlds

Elasticsearch Indexing Strategy in Asset Management Platform (AMP)

Helping VFX studios pave a path to the cloud

What is session replay? Discover user pain points with session recordings

Pushy to the Limit: Evolving Netflix’s WebSocket proxy for the future

Redis vs Memcached in 2024

Data ingestion pipeline with Operation Management

Expanding the Cloud - Introducing Amazon ElastiCache - All Things.

Implementing AWS well-architected pillars with automated workflows

Cloudburst: stateful functions-as-a-service

Amazon DynamoDB ? a Fast and Scalable NoSQL Database.

File systems unfit as distributed storage backends: lessons from ten years of Ceph evolution

Crucial Redis Monitoring Metrics You Must Watch

No Server Required - Jekyll & Amazon S3 - All Things Distributed

WiredTiger Logging and Checkpoint Mechanism

Expanding the Cloud with DNS - Introducing Amazon Route 53 - All.

Percona Monitoring and Management 2 Scaling and Capacity Planning

How To Optimize Progressive Web Apps: Going Beyond The Basics

The Amazon.com 2010 Shareholder Letter Focusses on Technology.

Hierarchical Navigation and Faceted Search on Top of Oracle Coherence

Amazon DynamoDB Accelerator (DAX): Speed Up DynamoDB Response Times from Milliseconds to Microseconds without Application Rewrite.

20X Faster Backup Preparation With Percona XtraBackup 8.0.33-28!

Key Advantages of DBMS for Efficient Data Management

PostgreSQL Upgrade: Tricks With OID Columns and Extensions

MICRO 2019 Trip Report

AppFabric Caching: Retry Later

Choosing a cloud DBMS: architectures and tradeoffs

Optimizing Next.js Applications With Nx

Solving Common Cross-Platform Issues When Working With Flutter

USENIX SREcon APAC 2022: Computing Performance: What's on the Horizon

A Decade of Dynamo: Powering the next wave of high-performance, internet-scale applications

Stay Connected