Architecture and Storage - Technology Performance Pulse

Dynatrace elevates data security with separated storage and unique encryption keys for each tenant

Dynatrace

NOVEMBER 29, 2024

Enhancing data separation by partitioning each customer’s data on the storage level and encrypting it with a unique encryption key adds an additional layer of protection against unauthorized data access. A unique encryption key is applied to each tenant’s storage and automatically rotated every 365 days.

Storage

Storage AWS Azure Architecture

Ready for changes with Hexagonal Architecture

The Netflix TechBlog

MARCH 10, 2020

Leveraging Hexagonal Architecture We needed to support the ability to swap data sources without impacting business logic , so we knew we needed to keep them decoupled. We decided to build our app based on principles behind Hexagonal Architecture and Uncle Bob’s Clean Architecture. Entities are the domain objects (e.g.,

Architecture

Architecture Transportation Java Strategy

Why and How We Built a Primary-Replica Architecture of ClickHouse

DZone

AUGUST 13, 2024

But this also caused storage challenges like disk failures and data recovery. We innovatively use its snapshot feature to implement a primary-replica architecture for ClickHouse. This architecture ensures high availability and stability of the data while significantly enhancing system performance and data recovery capabilities.

Artificial Intelligence

Artificial Intelligence Architecture AWS Storage

Efficient Multimodal Data Processing: A Technical Deep Dive

DZone

FEBRUARY 27, 2025

Handling multimodal data spanning text, images, videos, and sensor inputs requires resilient architecture to manage the diversity of formats and scale.

Efficiency

Efficiency Processing Latency Storage

Dynatrace log collection for ARM unlocks power-efficient architecture for your enterprise

Dynatrace

DECEMBER 5, 2023

Without observability, the benefits of ARM are lost Over the last decade and a half, a new wave of computer architecture has overtaken the world. ARM architecture, based on a processor type optimized for cloud and hyperscale computing, has become the most prevalent on the planet, with billions of ARM devices currently in use.

Efficiency

Efficiency Architecture Energy Monitoring

Optimizing data warehouse storage

The Netflix TechBlog

DECEMBER 21, 2020

At this scale, we can gain a significant amount of performance and cost benefits by optimizing the storage layout (records, objects, partitions) as the data lands into our warehouse. We built AutoOptimize to efficiently and transparently optimize the data and metadata storage layout while maximizing their cost and performance benefits.

Storage

Storage Latency Efficiency Data Engineering

Medallion Architecture: Efficient Batch and Stream Processing Data Pipelines With Azure Databricks and Delta Lake

DZone

JULY 13, 2023

Medallion Architecture provides a framework for organizing data processing workflows into different zones, enabling optimized batch and stream processing. This article explores the concepts of Medallion Architecture and demonstrates how to implement batch and stream processing pipelines using Azure Databricks and Delta Lake.

Azure

Azure Architecture Efficiency Processing

RabbitMQ vs. Kafka: Key Differences

Scalegrid

FEBRUARY 6, 2025

This article outlines the key differences in architecture, performance, and use cases to help determine the best fit for your workload. RabbitMQ follows a message broker model with advanced routing, while Kafkas event streaming architecture uses partitioned logs for distributed processing. What is RabbitMQ? What is Apache Kafka?

Latency

Latency Analytics Architecture Storage

What is Greenplum Database? Intro to the Big Data Database

Scalegrid

MAY 13, 2020

In this blog post, we explain what Greenplum is, and break down the Greenplum architecture, advantages, major use cases, and how to get started. It’s architecture was specially designed to manage large-scale data warehouses and business intelligence workloads by giving you the ability to spread your data out across a multitude of servers.

Big Data

Big Data Database Artificial Intelligence Open Source

Introducing Impressions at Netflix

The Netflix TechBlog

FEBRUARY 14, 2025

Architecture Overview The first pivotal step in managing impressions begins with the creation of a Source-of-Truth (SOT) dataset. The enriched data is seamlessly accessible for both real-time applications via Kafka and historical analysis through storage in an Apache Iceberg table.

Tuning

Tuning Latency Efficiency Analytics

AWS serverless services: Exploring your options

Dynatrace

OCTOBER 7, 2021

To get a better understanding of AWS serverless, we’ll first explore the basics of serverless architectures, review AWS serverless offerings, and explore common use cases. Serverless architecture: A primer. Serverless architecture shifts application hosting functions away from local servers onto those managed by providers.

Serverless

Serverless AWS Lambda Storage

Title Launch Observability at Netflix Scale

The Netflix TechBlog

MARCH 4, 2025

Part 3: System Strategies and Architecture By: VarunKhaitan With special thanks to my stunning colleagues: Mallika Rao , Esmir Mesic , HugoMarques This blog post is a continuation of Part 2 , where we cleared the ambiguity around title launch observability at Netflix. The response schema for the observability endpoint.

Traffic

Traffic Strategy Entertainment Innovation

The history of Grail: Why you need a data lakehouse

Dynatrace

OCTOBER 4, 2022

Grail architectural basics. The aforementioned principles have, of course, a major impact on the overall architecture. A data lakehouse addresses these limitations and introduces an entirely new architectural design. This decoupling ensures the openness of data and storage formats, while also preserving data in context.

Artificial Intelligence

Artificial Intelligence Analytics Storage Architecture

What is a Distributed Storage System

Scalegrid

FEBRUARY 8, 2024

A distributed storage system is foundational in today’s data-driven landscape, ensuring data spread over multiple servers is reliable, accessible, and manageable. Understanding distributed storage is imperative as data volumes and the need for robust storage solutions rise.

Storage

Storage Systems Big Data Azure

Time Series Analysis: VAR-Model-As-A-Service Using Flask and MinIO

DZone

SEPTEMBER 25, 2023

It is the second of a series of articles that is built on top of that project, representing experiments with various statistical and machine learning models, data pipelines implemented using existing DAG tools, and storage services, both cloud-based and alternative on-premises solutions.

Storage

Storage AWS Architecture Cloud

Netflix’s Distributed Counter Abstraction

The Netflix TechBlog

NOVEMBER 12, 2024

After selecting a mode, users can interact with APIs without needing to worry about the underlying storage mechanisms and counting methods. Let’s examine some of the drawbacks of this approach: Lack of Idempotency : There is no idempotency key baked into the storage data-model preventing users from safely retrying requests.

Latency

Latency Cache Infrastructure Strategy

Battle of the RabbitMQ Queues: Performance Insights on Classic and Quorum

DZone

SEPTEMBER 19, 2024

RabbitMQ is a powerful and widely used message broker that facilitates communication between distributed applications by handling the transmission, storage, and delivery of messages. Queues play a critical role in RabbitMQ’s architecture, enabling asynchronous communication and decoupling the producers and consumers.

Storage

Storage Performance Scalability Architecture

What is a data lakehouse? Combining data lakes and warehouses for the best of both worlds

Dynatrace

OCTOBER 4, 2022

While data lakes and data warehousing architectures are commonly used modes for storing and analyzing data, a data lakehouse is an efficient third way to store and analyze data that unifies the two architectures while preserving the benefits of both. Unlike data warehouses, however, data is not transformed before landing in storage.

Artificial Intelligence

Artificial Intelligence Storage Analytics Government

Netflix Cloud Packaging in the Terabyte Era

The Netflix TechBlog

SEPTEMBER 24, 2021

Table 1: Movie and File Size Examples Initial Architecture A simplified view of our initial cloud video processing pipeline is illustrated in the following diagram. Figure 1: A Simplified Video Processing Pipeline With this architecture, chunk encoding is very efficient and processed in distributed cloud computing instances.

Cloud

Cloud Media Storage Cache

Any analysis, any time: Dynatrace Log Management and Analytics powered by Grail

Dynatrace

OCTOBER 4, 2022

Modern IT environments — whether multicloud, on-premises, or hybrid-cloud architectures — generate exponentially increasing data volumes. Teams have introduced workarounds to reduce storage costs. Stop worrying about log data ingest and storage — start creating value instead. Limited data availability constrains value creation.

Analytics

Analytics Artificial Intelligence Storage Serverless

What is function as a service? App development gets FaaS and furious

Dynatrace

AUGUST 11, 2022

FaaS vs. monolithic architectures. Monolithic architectures were commonplace with legacy, on-premises software solutions. Infrastructure as a service (IaaS) handles compute, storage, and network resources. Because a third party manages part of the infrastructure, IT teams give up a measure of control over system architecture.

Development

Development Serverless Best Practices Lambda

Improved Alerting with Atlas Streaming Eval

The Netflix TechBlog

APRIL 27, 2023

While Atlas is architected around compute & storage separation, and we could theoretically just scale the query layer to meet the increased query demand, every query, regardless of its type, has a data component that needs to be pushed down to the storage layer.

Storage

Storage Cache Metrics Database

Designing Instagram

High Scalability

JANUARY 11, 2022

Architecture. Firstly, the synchronous process which is responsible for uploading image content on file storage, persisting the media metadata in graph data-storage, returning the confirmation message to the user and triggering the process to update the user activity. Sending and receiving messages from other users.

Design

Design Media Storage Logistics

Introducing Netflix’s Key-Value Data Abstraction Layer

The Netflix TechBlog

SEPTEMBER 18, 2024

In this post, we dive deep into how Netflix’s KV abstraction works, the architectural principles guiding its design, the challenges we faced in scaling diverse use cases, and the technical innovations that have allowed us to achieve the performance and reliability required by Netflix’s global operations.

Latency

Latency Storage Cache Efficiency

What is hyperconverged infrastructure? Realizing the benefits of HCI

Dynatrace

NOVEMBER 11, 2022

Therefore, they need an environment that offers scalable computing, storage, and networking. Hyperconverged infrastructure (HCI) is an IT architecture that combines servers, storage, and networking functions into a unified, software-centric platform to streamline resource management. What is hyperconverged infrastructure?

Infrastructure

Infrastructure Storage Virtualization Network

IT automation central to navigating cloud complexity and data explosion

Dynatrace

SEPTEMBER 23, 2022

Organizations continue to turn to multicloud architecture to deliver better, more secure software faster. But IT teams need to embrace IT automation and new data storage models to benefit from modern clouds. Moreover, IT pros say that cloud architecture and data repositories thwart achieving better data insight.

Cloud

Cloud Artificial Intelligence Innovation Architecture

Building an elastic query engine on disaggregated storage

The Morning Paper

MARCH 8, 2020

Building an elastic query engine on disaggregated storage , Vuppalapati, NSDI’20. For such workloads, shared-nothing architectures beget high cost, inflexibility, poor performance, and inefficiency, which hurts production applications and cluster deployments. joins) during query processing. Disaggregation (or not).

Storage

Storage Engineering Cache Serverless

Jellyfish: Cost-Effective Data Tiering for Uber’s Largest Storage System

Uber Engineering

SEPTEMBER 9, 2021

Uber deploys a few storage technologies to store business data based on their application model.

Storage

Storage Systems Engineering Technology

How a data lakehouse brings data insights to life

Dynatrace

OCTOBER 4, 2022

Further, these resources support countless Kubernetes clusters and Java-based architectures. In most data storage models, indexing engines enable faster access to query logs. But indexing requires schema management and additional storage to be effective, which adds cost and overhead. Cost-effective architecture.

Analytics

Analytics Storage Infrastructure Metrics

Best Practices for Scaling RabbitMQ

Scalegrid

FEBRUARY 24, 2025

This decoupling is crucial in modern architectures where scalability and fault tolerance are paramount. The architecture of RabbitMQ is meticulously designed for complex message routing, enabling dynamic and flexible interactions between producers and consumers.

Best Practices

Best Practices Traffic Strategy Efficiency

The state of observability in 2024: Accelerating transformation with AI, analytics, and automation

Dynatrace

MARCH 6, 2024

Kubernetes adds to the complexity of technology stacks Alongside the challenges of managing multicloud environments, IT and security teams struggle to maintain visibility into cloud-native architectures as Kubernetes continues to become the platform of choice for modern applications.

Analytics

Analytics Innovation Strategy Storage

Geek Reading - Week of June 5, 2013

DZone

OCTOBER 11, 2022

Simpler UI Testing with CasperJS ( Architects Zone – Architectural Design Patterns & Best Practices). Using MongoDB as a cache store ( Architects Zone – Architectural Design Patterns & Best Practices). Why haven’t cash-strapped American schools embraced open source? Hacker News). Thoughts, Insights and Further Pointers.

Java

Java Best Practices Google Analytics

Analyze OpenTelemetry traces and log data at scale: Accelerate troubleshooting and optimize application performance

Dynatrace

OCTOBER 3, 2024

Trace your application Imagine a microservices architecture with hundreds of dependencies. There is no need to think about schema and indexes, re-hydration, or hot/cold storage. This architecture also means you’re not required to determine your log data use cases beforehand or while analyzing logs within the new logs app.

Performance

Performance Architecture Innovation Latency

Keep track of thousands of environments from 20,000 feet

Dynatrace

SEPTEMBER 28, 2020

Before we dive into the technical implementation, let me explain the visual concept of this “Global Status Page”: Another requirement for this status page was that it has to be lightweight, with no data storage at all. Lightweight architecture. This is where the consolidated API, which I presented in my last post , comes into play.

Storage

Storage Architecture Tuning Efficiency

Pioneering customer-centric pricing models: Decoding ingest-centric vs. answer-centric pricing

Dynatrace

OCTOBER 17, 2023

The rise of cloud-native microservice architectures further exacerbates this change. Dynatrace has developed the purpose-built data lakehouse, Grail , eliminating the need for separate management of indexes and storage. All data is readily accessible without storage tiers, such as costly solid-state drives (SSDs).

Retail

Retail Storage Best Practices Architecture

CDNs: Speed Up Performance by Reducing Latency

DZone

MAY 3, 2023

In the previous posts, we covered things we had to do to upload files on the front end, things we had to do on the back end, and optimizing costs by moving file uploads to object storage.

Latency

Latency Speed Performance Storage

Boost DevOps maturity with observability and a data lakehouse

Dynatrace

JUNE 9, 2023

Research has found that 99% of organizations have embraced a multicloud architecture. When data storage strategies become problematic to DevOps maturity Data warehouse-based approaches add cost and time to analytics projects. There’s also a potential scalability challenge with metrics in the context of microservices architectures.

DevOps

DevOps Analytics Storage Metrics

Conducting log analysis with an observability platform and full data context

Dynatrace

APRIL 20, 2023

Logs highlight observability challenges Ingesting, storing, and processing the unprecedented explosion of data from sources such as software as a service, multicloud environments, containers, and serverless architectures can be overwhelming for today’s organizations. Seamless integration.

Analytics

Analytics Infrastructure Storage Architecture

Measuring the importance of data quality to causal AI success

Dynatrace

JANUARY 4, 2024

It starts with implementing data governance practices, which set standards and policies for data use and management in areas such as quality, security, compliance, storage, stewardship, and integration. Modern, cloud-native architectures have many moving parts, and identifying them all is a daunting task with human effort alone.

Government

Government Analytics Benchmarking Storage

Introducing Netflix TimeSeries Data Abstraction Layer

The Netflix TechBlog

OCTOBER 8, 2024

In previous blog posts, we introduced the Key-Value Data Abstraction Layer and the Data Gateway Platform , both of which are integral to Netflix’s data architecture. Storage Layer The storage layer for TimeSeries comprises a primary data store and an optional index data store.

Latency

Latency Storage Traffic Infrastructure

Tailored access management, Part 2: Onboard users to Grail and AppEngine

Dynatrace

APRIL 19, 2023

The new architecture enables more granularity in permission management and provides the dynamics necessary to serve modern access management use cases. ALLOW storage:system:read; The Storage All System Data Read policy grants access to Dynatrace internal data such as auditing events and query execution events.

Storage

Storage Analytics Systems Architecture

Article: Magic Pocket: Dropbox’s Exabyte-Scale Blob Storage System

InfoQ

MAY 15, 2023

A horizontally scalable exabyte-scale blob storage system which operates out of multiple regions, Magic Pocket is used to store all of Dropbox’s data. Adopting SMR technology and erasure codes, the system has extremely high durability guarantees but is cheaper than operating in the cloud. By Facundo Agriel

Storage

Storage Systems Scalability Cloud

What is cloud monitoring? How to improve your full-stack visibility

Dynatrace

JANUARY 11, 2023

Cloud storage monitoring. Teams can keep track of storage resources and processes that are provisioned to virtual machines, services, databases, and applications. Multicloud architectures, on the other hand, blend services from two or more private or public clouds — or from a combination of public, private, and edge clouds.

Cloud

Cloud Monitoring Best Practices Infrastructure

What is security analytics?

Dynatrace

JUNE 10, 2024

Security analytics must also contend with the multicomponent architecture of modern IT infrastructure. Dehydrated data has been compressed or otherwise altered for storage in a data warehouse. Observability starts with the collection, storage, and accessibility of multiple sources.

Analytics

Analytics Network Open Source Hardware

Dynatrace elevates data security with separated storage and unique encryption keys for each tenant

Ready for changes with Hexagonal Architecture

Trending Sources

Why and How We Built a Primary-Replica Architecture of ClickHouse

Efficient Multimodal Data Processing: A Technical Deep Dive

Dynatrace log collection for ARM unlocks power-efficient architecture for your enterprise

Optimizing data warehouse storage

Medallion Architecture: Efficient Batch and Stream Processing Data Pipelines With Azure Databricks and Delta Lake

RabbitMQ vs. Kafka: Key Differences

What is Greenplum Database? Intro to the Big Data Database

Introducing Impressions at Netflix

AWS serverless services: Exploring your options

Title Launch Observability at Netflix Scale

The history of Grail: Why you need a data lakehouse

What is a Distributed Storage System

Time Series Analysis: VAR-Model-As-A-Service Using Flask and MinIO

Netflix’s Distributed Counter Abstraction

Battle of the RabbitMQ Queues: Performance Insights on Classic and Quorum

What is a data lakehouse? Combining data lakes and warehouses for the best of both worlds

Netflix Cloud Packaging in the Terabyte Era

Any analysis, any time: Dynatrace Log Management and Analytics powered by Grail

What is function as a service? App development gets FaaS and furious

Improved Alerting with Atlas Streaming Eval

Designing Instagram

Introducing Netflix’s Key-Value Data Abstraction Layer

What is hyperconverged infrastructure? Realizing the benefits of HCI

IT automation central to navigating cloud complexity and data explosion

Building an elastic query engine on disaggregated storage

Jellyfish: Cost-Effective Data Tiering for Uber’s Largest Storage System

How a data lakehouse brings data insights to life

Best Practices for Scaling RabbitMQ

The state of observability in 2024: Accelerating transformation with AI, analytics, and automation

Geek Reading - Week of June 5, 2013

Analyze OpenTelemetry traces and log data at scale: Accelerate troubleshooting and optimize application performance

Keep track of thousands of environments from 20,000 feet

Pioneering customer-centric pricing models: Decoding ingest-centric vs. answer-centric pricing

CDNs: Speed Up Performance by Reducing Latency

Boost DevOps maturity with observability and a data lakehouse

Conducting log analysis with an observability platform and full data context

Measuring the importance of data quality to causal AI success

Introducing Netflix TimeSeries Data Abstraction Layer

Tailored access management, Part 2: Onboard users to Grail and AppEngine

Article: Magic Pocket: Dropbox’s Exabyte-Scale Blob Storage System

What is cloud monitoring? How to improve your full-stack visibility

What is security analytics?

Stay Connected