Scalability, Storage and Tuning - Technology Performance Pulse

RabbitMQ vs. Kafka: Key Differences

Scalegrid

FEBRUARY 6, 2025

This decoupling simplifies system architecture and supports scalability in distributed environments. Message brokers handle validation, routing, storage, and delivery, ensuring efficient and reliable communication. Scalability and Redundancy Both Kafka and RabbitMQ are built for scalability and redundancy but take different approaches.

Latency

Latency Analytics Architecture Storage

Title Launch Observability at Netflix Scale

The Netflix TechBlog

DECEMBER 17, 2024

The complexity of these operational demands underscored the urgent need for a scalable solution. Additionally, the time-sensitive nature of these investigations precludes the use of cold storage, which cannot meet the stringent SLAs required. Stay tuned for a closer look at the innovation behind thescenes!

Traffic

Traffic Scalability Strategy Monitoring

Optimizing data warehouse storage

The Netflix TechBlog

DECEMBER 21, 2020

At this scale, we can gain a significant amount of performance and cost benefits by optimizing the storage layout (records, objects, partitions) as the data lands into our warehouse. We built AutoOptimize to efficiently and transparently optimize the data and metadata storage layout while maximizing their cost and performance benefits.

Storage

Storage Latency Efficiency Data Engineering

Speed Trino Queries With These Performance-Tuning Tips

DZone

NOVEMBER 27, 2023

An open-source distributed SQL query engine, Trino is widely used for data analytics on distributed data storage. Optimizing Trino to make it faster can help organizations achieve quicker insights and better user experiences, as well as cut costs and improve infrastructure efficiency and scalability. But how do we do that?

Tuning

Tuning Speed Performance Open Source

Part 1: A Survey of Analytics Engineering Work at Netflix

The Netflix TechBlog

DECEMBER 17, 2024

Training: We created easy-to-provide feedback using and with a fully integrated fine-tuning loop to allow end-users to teach new domains and questions around it effectively. LORE also provides a confidence score to our end users based on its grounding in the domainspace.

Analytics

Analytics Engineering Entertainment Metrics

Netflix’s Distributed Counter Abstraction

The Netflix TechBlog

NOVEMBER 12, 2024

After selecting a mode, users can interact with APIs without needing to worry about the underlying storage mechanisms and counting methods. Let’s examine some of the drawbacks of this approach: Lack of Idempotency : There is no idempotency key baked into the storage data-model preventing users from safely retrying requests.

Latency

Latency Cache Infrastructure Strategy

Title Launch Observability at Netflix Scale

The Netflix TechBlog

MARCH 4, 2025

Data storage and distribution through HollowFeeds Netflix Hollow is an Open Source java library and toolset for disseminating in-memory datasets from a single producer to many consumers for high performance read-only access. Conclusion Throughout this series, weve explored the journey of enhancing title launch observability at Netflix.

Traffic

Traffic Strategy Entertainment Innovation

Introducing Netflix TimeSeries Data Abstraction Layer

The Netflix TechBlog

OCTOBER 8, 2024

The Key-Value Abstraction offers a flexible, scalable solution for storing and accessing structured key-value data, while the Data Gateway Platform provides essential infrastructure for protecting, configuring, and deploying the data tier.

Latency

Latency Storage Traffic Tuning

Introducing Netflix’s Key-Value Data Abstraction Layer

The Netflix TechBlog

SEPTEMBER 18, 2024

Central to this infrastructure is our use of multiple online distributed databases such as Apache Cassandra , a NoSQL database known for its high availability and scalability. This flexibility allows our Data Platform to route different use cases to the most suitable storage system based on performance, durability, and consistency needs.

Latency

Latency Storage Cache Efficiency

Best Practices for Scaling RabbitMQ

Scalegrid

FEBRUARY 24, 2025

Key Takeaways RabbitMQ improves scalability and fault tolerance in distributed systems by decoupling applications, enabling reliable message exchanges. This decoupling is crucial in modern architectures where scalability and fault tolerance are paramount.

Best Practices

Best Practices Traffic Strategy Efficiency

PostgreSQL vs. Oracle: Difference in Costs, Ease of Use & Functionality

Scalegrid

JULY 13, 2020

Compare ease of use across compatibility, extensions, tuning, operating systems, languages and support providers. Scalability. PostgreSQL offers free scalability, and can scale up to millions of transactions per seconds. Oracle Enterprise is recommended for high workloads which are highly scalable, but costly. PostgreSQL.

Open Source

Open Source Tuning C++ Database

Building Netflix’s Distributed Tracing Infrastructure

The Netflix TechBlog

OCTOBER 19, 2020

Our distributed tracing infrastructure is grouped into three sections: tracer library instrumentation, stream processing, and storage. An additional implication of a lenient sampling policy is the need for scalable stream processing and storage infrastructure fleets to handle increased data volume. Storage: don’t break the bank!

Infrastructure

Infrastructure Transportation Storage Open Source

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

The Netflix TechBlog

MAY 4, 2023

Migrating Critical Traffic At Scale with No Downtime — Part 1 Shyam Gala , Javier Fernandez-Ivern , Anup Rokkam Pratap , Devang Shah Hundreds of millions of customers tune into Netflix every day, expecting an uninterrupted and immersive streaming experience. This technique facilitates validation on multiple fronts.

Traffic

Traffic Latency Tuning Systems

Conducting log analysis with an observability platform and full data context

Dynatrace

APRIL 20, 2023

“Logs magnify these issues by far due to their volatile structure, the massive storage needed to process them, and due to potential gold hidden in their content,” Pawlowski said, highlighting the importance of log analysis. Business leaders can decide which logs they want to use and tune storage to their data needs.

Analytics

Analytics Infrastructure Storage Architecture

Bulldozer: Batch Data Moving from Data Warehouse to Online Key-Value Stores

The Netflix TechBlog

OCTOBER 27, 2020

As the paved path for moving data to key-value stores, Bulldozer provides a scalable and efficient no-code solution. The KV DAL allows applications to use a well-defined and storage engine agnostic HTTP/gRPC key-value data interface that in turn decouples applications from hard to maintain and backwards-incompatible datastore APIs.

Latency

Latency Storage Big Data Tuning

MySQL General Tablespaces: A Powerful Storage Option for Your Data

Percona

JANUARY 4, 2024

Managing storage and performance efficiently in your MySQL database is crucial, and general tablespaces offer flexibility in achieving this. In contrast to the single system tablespace that holds system tables by default, general tablespaces are user-defined storage containers for multiple InnoDB tables.

Storage

Storage Engineering Database Open Source

Get up to 300 new metrics out of the box with AWS supporting services (GA)

Dynatrace

DECEMBER 22, 2019

AWS offers a broad set of global, cloud-based services including computing, storage, networking, Internet of Things (IoT), and many others. You can use these services in combinations that are tailored to help your business move faster, lower IT costs, and support scalability. Amazon Simple Storage Service (S3). Amazon Redshift.

AWS

AWS Metrics IoT Storage

Dynatrace log collection for ARM unlocks power-efficient architecture for your enterprise

Dynatrace

DECEMBER 5, 2023

You’re no longer required to use a single offering or choose from a few instance families; Graviton includes general-purpose and accelerated-computing offerings, plus compute-, memory-, and storage-optimized instances.

Efficiency

Efficiency Architecture Energy Monitoring

Observability engineering: Getting Prometheus metrics right for Kubernetes with Dynatrace and Kepler

Dynatrace

DECEMBER 18, 2023

For busy site reliability engineers, ensuring system reliability, scalability, and overall health is an imperative that’s getting harder to achieve in ever-expanding, cloud-native, container-based environments. But often, we use additional services and solutions within our environment for backups, storage, networking, and more.

Metrics

Metrics Engineering Energy Tuning

Accelerate your cloud journey with Dynatrace observability for AWS S3 logs

Dynatrace

JUNE 27, 2023

Many AWS services and third party solutions use AWS S3 for log storage. Centralized log management for scalable ingestion into Grail As AWS S3 proves to be the preferred way of storing cloud logs, enterprise customers face mounting challenges in putting S3 log data to use. Or explore the recently introduced support for AWS Lambda logs.

AWS

AWS Cloud Lambda Analytics

What’s New at ScaleGrid – September 2024

Scalegrid

SEPTEMBER 10, 2024

At ScaleGrid, we’re always pushing the boundaries to offer more flexibility and scalability to our customers. Customer-Requested Features We’re always listening to your feedback, so we added the ability to access additional storage without upgrading to a larger plan. Stay tuned for more exciting updates in the months to come! <p>The

Latency

Latency AWS Storage Tuning

How Bloom Filters Work in MyRocks

Percona

FEBRUARY 15, 2023

Tuning In terms of tuning, two parameters can be tuned, the size of the bitmap and the number of bits set by every value. LSM storage engines like MyRocks are very different from the more common B-Tree-based storage engines like InnoDB. Download Percona Distribution for MySQL Today

Storage

Storage Tuning Cache Engineering

A Recap of the Data Engineering Open Forum at Netflix

The Netflix TechBlog

JUNE 20, 2024

This talk will delve into the creative solutions Netflix deploys to manage this high-volume, real-time data requirement while balancing scalability and cost. If you are interested in attending a future Data Engineering Open Forum, we highly recommend you join our Google Group to stay tuned to event announcements. Until next time!

Data Engineering

Data Engineering Engineering Entertainment Software Engineering

Get up to 300 new metrics out of the box with AWS supporting services (GA)

Dynatrace

DECEMBER 22, 2019

AWS offers a broad set of global, cloud-based services including computing, storage, networking, Internet of Things (IoT), and many others. You can use these services in combinations that are tailored to help your business move faster, lower IT costs, and support scalability. Amazon Simple Storage Service (S3). Amazon Redshift.

AWS

AWS Metrics IoT Storage

Building a Media Understanding Platform for ML Innovations

The Netflix TechBlog

MARCH 14, 2023

In addition, we were able to perform a handful of A/B tests to validate or negate our hypotheses for tuning the search experience. The primary searcher used in the current implementation is called Marken — scalable annotation service built at Netflix. This service leverages Cassandra and Elasticsearch for data storage and retrieval.

Media

Media Innovation Energy Architecture

InnoDB Performance Optimization Basics

Percona

MARCH 23, 2023

Storage The type of storage and disk used for database servers can have a significant impact on performance and reliability. Cloud Different cloud providers offer a range of instance types and sizes, each with varying amounts of CPU, memory, and storage. If you see concurrency issues, you can tune this variable.

Performance

Performance Hardware Tuning Storage

Expanding the Cloud ? Announcing Amazon Redshift, a Petabyte.

All Things Distributed

NOVEMBER 28, 2012

Werner Vogels weblog on building scalable and robust distributed systems. They contain large amounts of locally attached storage on multiple spindles and are connected by a minimally oversubscribed 10 Gigabit Ethernet network. Until now, these levels of performance and scalability were prohibitively expensive. Comments ().

Cloud

Cloud Storage Architecture Scalability

Building In-Video Search

The Netflix TechBlog

NOVEMBER 6, 2023

In order to train the model on internal training data (video clips with aligned text descriptions), we implemented a scalable version on Ray Train and switched to a more performant video decoding library. We also found that extending contrastive learning to videos and text provided a substantial improvement over frame-level models.

Media

Media Social Media Tuning Engineering

Monitoring of Kubernetes Infrastructure for day 2 operations

Dynatrace

JULY 8, 2020

One of the promises of container orchestration platforms is to make i t easier for the developers to accelerate the deployment of their app lication s without having to worry about scalability and infrastructure dependencies. How to find the right quota, what should be used as a CPU or Memory request and limit?

Infrastructure

Infrastructure Monitoring Cloud Metrics

DynamoDB One Year Later - All Things Distributed

All Things Distributed

MARCH 7, 2013

Werner Vogels weblog on building scalable and robust distributed systems. s fast and easy scalability can be quickly applied to building high scale applications. Indexed Storage costs : We are lowering the price of indexed storage by 75%. All Things Distributed. DynamoDB One Year Later: Bigger, Better, and 85% Cheaperâ?¦.

Ecommerce

Ecommerce Storage Scalability Database

PostgreSQL Benchmark: ScaleGrid vs. Amazon RDS

Scalegrid

NOVEMBER 4, 2024

The results will help database administrators and decision-makers choose the right platform for their performance, scalability, and cost-efficiency needs. Introduction Purpose and Scope Cloud-hosted PostgreSQL solutions are increasingly popular among organizations seeking scalable, high-performance databases. </p>

Benchmarking

Benchmarking AWS Tuning Metrics

Music to my Ears - All Things Distributed

All Things Distributed

MARCH 28, 2011

Werner Vogels weblog on building scalable and robust distributed systems. We see that with our Amazon customers; when they hear a great tune on a radio they may identify it using the Shazam or Soundhound apps on their mobile phone and buy that song instantly from the Amazon MP3 store. Driving Storage Costs Down for AWS Customers.

AWS

AWS Cloud Storage Internet

Document Model Support in DynamoDB: Flexibility, Availability, Performance, and Scale.Together at last

All Things Distributed

OCTOBER 8, 2014

The best part is that we are also significantly expanding the free tier many of you already enjoy by increasing the storage to 25 GB and throughput to 200 million requests per month. More than a decade ago, Amazon embarked on a mission to build a distributed system that challenged conventional methods of data storage and querying.

Availability

Availability Performance Scalability AWS

Achieving observability in async workflows

The Netflix TechBlog

MAY 14, 2021

The watermarking functionality, at the start, was a simple offering with various Google Drive integrations for storage and links. We wanted a scalable service that was near real-time, 2. Our team was responsible for Google integrations, watermarking, bursty traffic management, and on-call support for this application.

Traffic

Traffic Java Latency Google

Netflix Video Quality at Scale with Cosmos Microservices

The Netflix TechBlog

NOVEMBER 2, 2021

As VMAF evolves and is integrated with more encoding and streaming workflows within Netflix, we need scalable ways of fostering video quality innovations. The Reloaded system is a well-matured and scalable system, but its monolithic architecture can slow down rapid innovation.

Media

Media Innovation Metrics Latency

Percona Server for MongoDB 7 Is Now Available

Percona

OCTOBER 10, 2023

This is not a general rule, but as databases are responsible for a core layer of any IT system – data storage and processing — they require reliability. Stay tuned for more news about MongoDB offerings. Databases are different from a lot of software. For one, they often favor stability over innovation. on your radar.

Servers

Servers Availability Database Open Source

MySQL Performance Tuning 101: Key Tips to Improve MySQL Database Performance

Percona

SEPTEMBER 1, 2023

While there is no magic bullet for MySQL performance tuning, there are a few areas that can be focused on upfront that can dramatically improve the performance of your MySQL installation. What are the Benefits of MySQL Performance Tuning? A finely tuned database processes queries more efficiently, leading to swifter results.

Tuning

Tuning Database Performance Hardware

A Decade of Dynamo: Powering the next wave of high-performance, internet-scale applications

All Things Distributed

OCTOBER 2, 2017

We were pushing the limits of what was a leading commercial database at the time and were unable to sustain the availability, scalability and performance needs that our growing Amazon business demanded. We had an advanced team of database administrators and access to top experts within Oracle. million requests per second.

Internet

Internet Internet AWS Performance

Expanding the Cloud: Amazon Machine Learning Service, the Amazon Elastic Filesystem and more

All Things Distributed

APRIL 9, 2015

The Amazon ML console and API provide data and model visualization tools, as well as wizards to guide you through the process of creating machine learning models, measuring their quality and fine-tuning the predictions to match your application requirements. Details on the AWS Blog. The Amazon Elastic File System. for a while already.

Lambda

Lambda Cloud IoT AWS

How It Works: SQL Server Lock Partitioning

SQL Server According to Bob

SEPTEMBER 9, 2019

Acquiring shared access requires only the local partition be acquired (lightweight scalability). I recall when we were tuning the sp_reset_connection (which releases the database lock and acquires it again) command we tested rates in excess of 250,000/sec to ensure the partitioned database lock scaled: [link].

Servers

Servers Database Scalability Tuning

PostgreSQL Performance Tuning: Optimizing Database Parameters for Maximum Efficiency

Percona

MAY 1, 2023

Out of the box, the default PostgreSQL configuration is not tuned for any particular workload. It is primarily the responsibility of the database administrator or developer to tune PostgreSQL according to their system’s workload. What is PostgreSQL performance tuning? Why is PostgreSQL performance tuning important?

Tuning

Tuning Database Efficiency Performance

From Proprietary to Open Source: The Complete Guide to Database Migration

Percona

OCTOBER 18, 2023

Flexibility and scalability Open source databases provide much greater flexibility regarding customization and configuration. Are you looking to enhance performance, improve scalability, cut expenses, or gain access to specific features you don’t currently have? Start by identifying the reasons driving the migration.

Open Source

Open Source Database Strategy Hardware

Cloud Native Predictions for 2024

Percona

DECEMBER 27, 2023

The goal is to collaboratively develop tools and programs facilitating open development and run scalable and distributed training jobs for popular frameworks such as PyTorch, TensorFlow, MPI, MXNet, PaddlePaddle, and XGBoost. This fully automated scaling and tuning will enable a serverless-like experience in our Operators and Everest.

Cloud

Cloud Open Source Strategy Best Practices

Accelerate Machine Learning with Amazon SageMaker

All Things Distributed

NOVEMBER 29, 2017

Though the AWS Cloud gives you access to the storage and processing power required for ML, the process for building, training, and deploying ML models has unique challenges that often block successful use of this powerful new technology. An elastic, secure, and scalable environment to host your models, with one-click deployment.

Tuning

Tuning AWS Scalability Infrastructure

RabbitMQ vs. Kafka: Key Differences

Title Launch Observability at Netflix Scale

Trending Sources

Optimizing data warehouse storage

Speed Trino Queries With These Performance-Tuning Tips

Part 1: A Survey of Analytics Engineering Work at Netflix

Netflix’s Distributed Counter Abstraction

Title Launch Observability at Netflix Scale

Introducing Netflix TimeSeries Data Abstraction Layer

Introducing Netflix’s Key-Value Data Abstraction Layer

Best Practices for Scaling RabbitMQ

PostgreSQL vs. Oracle: Difference in Costs, Ease of Use & Functionality

Building Netflix’s Distributed Tracing Infrastructure

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

Conducting log analysis with an observability platform and full data context

Bulldozer: Batch Data Moving from Data Warehouse to Online Key-Value Stores

MySQL General Tablespaces: A Powerful Storage Option for Your Data

Get up to 300 new metrics out of the box with AWS supporting services (GA)

Dynatrace log collection for ARM unlocks power-efficient architecture for your enterprise

Observability engineering: Getting Prometheus metrics right for Kubernetes with Dynatrace and Kepler

Accelerate your cloud journey with Dynatrace observability for AWS S3 logs

What’s New at ScaleGrid – September 2024

How Bloom Filters Work in MyRocks

A Recap of the Data Engineering Open Forum at Netflix

Get up to 300 new metrics out of the box with AWS supporting services (GA)

Building a Media Understanding Platform for ML Innovations

InnoDB Performance Optimization Basics

Expanding the Cloud ? Announcing Amazon Redshift, a Petabyte.

Building In-Video Search

Monitoring of Kubernetes Infrastructure for day 2 operations

DynamoDB One Year Later - All Things Distributed

PostgreSQL Benchmark: ScaleGrid vs. Amazon RDS

Music to my Ears - All Things Distributed

Document Model Support in DynamoDB: Flexibility, Availability, Performance, and Scale.Together at last

Achieving observability in async workflows

Netflix Video Quality at Scale with Cosmos Microservices

Percona Server for MongoDB 7 Is Now Available

MySQL Performance Tuning 101: Key Tips to Improve MySQL Database Performance

A Decade of Dynamo: Powering the next wave of high-performance, internet-scale applications

Expanding the Cloud: Amazon Machine Learning Service, the Amazon Elastic Filesystem and more

How It Works: SQL Server Lock Partitioning

PostgreSQL Performance Tuning: Optimizing Database Parameters for Maximum Efficiency

From Proprietary to Open Source: The Complete Guide to Database Migration

Cloud Native Predictions for 2024

Accelerate Machine Learning with Amazon SageMaker

Stay Connected