Data, Scalability and Storage - Technology Performance Pulse

Dynatrace elevates data security with separated storage and unique encryption keys for each tenant

Dynatrace

NOVEMBER 29, 2024

Dynatrace continues to deliver on its commitment to keeping your data secure in the cloud. Enhancing data separation by partitioning each customer’s data on the storage level and encrypting it with a unique encryption key adds an additional layer of protection against unauthorized data access.

Storage

Storage AWS Azure Architecture

Microsoft Ignite 2024 guide: Cloud observability for AI transformation

Dynatrace

NOVEMBER 18, 2024

AI transformation, modernization, managing intelligent apps, safeguarding data, and accelerating productivity are all key themes at Microsoft Ignite 2024. Adopting AI to enhance efficiency and boost productivity is critical in a time of exploding data, cloud complexities, and disparate technologies.

Cloud

Cloud Azure Artificial Intelligence Innovation

Efficient Multimodal Data Processing: A Technical Deep Dive

DZone

FEBRUARY 27, 2025

Multimodal data processing is the evolving need of the latest data platforms powering applications like recommendation systems, autonomous vehicles, and medical diagnostics. Handling multimodal data spanning text, images, videos, and sensor inputs requires resilient architecture to manage the diversity of formats and scale.

Efficiency

Efficiency Processing Latency Storage

Storage Types Used on Cloud Computing Platforms

DZone

JANUARY 24, 2024

Cloud computing platforms have fundamentally altered how organizations access and manage data. Because of the emergence of cloud services, a broad range of storage choices are now easily available to fulfill the different demands of both organizations and people.

Storage

Storage Cloud Scalability Design

Optimizing data warehouse storage

The Netflix TechBlog

DECEMBER 21, 2020

By Anupom Syam Background At Netflix, our current data warehouse contains hundreds of Petabytes of data stored in AWS S3 , and each day we ingest and create additional Petabytes. We built AutoOptimize to efficiently and transparently optimize the data and metadata storage layout while maximizing their cost and performance benefits.

Storage

Storage Latency Efficiency Data Engineering

Empowering Developers With Scalable, Secure, and Customizable Storage Solutions

DZone

MARCH 22, 2024

In today's data-driven world, businesses face numerous challenges when it comes to storing, securing, and analyzing vast amounts of information. Enter StoneFly , a leading provider of storage area network (SAN) and network-attached storage (NAS) solutions that aim to simplify your life and tackle complex business problems head-on.

Storage

Storage Scalability Development Network

What is Greenplum Database? Intro to the Big Data Database

Scalegrid

MAY 13, 2020

It can scale towards a multi-petabyte level data workload without a single issue, and it allows access to a cluster of powerful servers that will work together within a single SQL interface where you can view all of the data. This feature-packed database provides powerful and rapid analytics on data that scales up to petabyte volumes.

Big Data

Big Data Database Artificial Intelligence Open Source

RabbitMQ vs. Kafka: Key Differences

Scalegrid

FEBRUARY 6, 2025

RabbitMQ is designed for flexible routing and message reliability, while Kafka handles high-throughput event streaming and real-time data processing. Both serve distinct purposes, from managing message queues to ingesting large data volumes. What is RabbitMQ?

Latency

Latency Analytics Architecture Storage

Predictable costs for Log Management & Analytics with new simplified licensing plan

Dynatrace

DECEMBER 18, 2024

With this new DPS pricing model option, customers can retain data at a fixed low cost with no additional cost to query for up to 35 days. This model provides a predictable way for customers to manage and analyze logs, drive log management tool consolidation, and reduce costs while gaining maximum value from their log data.

Analytics

Analytics Best Practices Efficiency Storage

Introducing Netflix’s Key-Value Data Abstraction Layer

The Netflix TechBlog

SEPTEMBER 18, 2024

Central to this infrastructure is our use of multiple online distributed databases such as Apache Cassandra , a NoSQL database known for its high availability and scalability. Second, developers had to constantly re-learn new data modeling practices and common yet critical data access patterns.

Latency

Latency Storage Cache Servers

Introducing Netflix TimeSeries Data Abstraction Layer

The Netflix TechBlog

OCTOBER 8, 2024

Rajiv Shringi Vinay Chella Kaidan Fullerton Oleksii Tkachuk Joey Lynch Introduction As Netflix continues to expand and diversify into various sectors like Video on Demand and Gaming , the ability to ingest and store vast amounts of temporal data — often reaching petabytes — with millisecond access latency has become increasingly vital.

Latency

Latency Storage Traffic Tuning

Building an Optimized Data Pipeline on Azure Using Spark, Data Factory, Databricks, and Synapse Analytics

DZone

APRIL 11, 2023

Data processing in the cloud has become increasingly popular due to its scalability, flexibility, and cost-effectiveness. This article will explore how these technologies can be used together to create an optimized data pipeline for data processing in the cloud.

Azure

Azure Analytics Storage Cloud

The history of Grail: Why you need a data lakehouse

Dynatrace

OCTOBER 4, 2022

Some time ago, at a restaurant near Boston, three Dynatrace colleagues dined and discussed the growing data challenge for enterprises. At its core, this challenge involves a rapid increase in the amount—and complexity—of data collected within a company. Work with different and independent data types. Thus, Grail was born.

Artificial Intelligence

Artificial Intelligence Analytics Storage Architecture

The Power of Caching: Boosting API Performance and Scalability

DZone

AUGUST 16, 2023

Caching is the process of storing frequently accessed data or resources in a temporary storage location, such as memory or disk, to improve retrieval speed and reduce the need for repetitive processing.

Cache

Cache Scalability Performance Latency

What Should You Know About Graph Database’s Scalability?

DZone

JANUARY 20, 2023

Having a distributed and scalable graph database system is highly sought after in many enterprise scenarios. Do Not Be Misled Designing and implementing a scalable graph database system has never been a trivial task.

Scalability

Scalability Big Data Hardware Internet

Dynatrace OpenPipeline: Stream processing data ingestion converges observability, security, and business data at massive scale for analytics and automation in context

Dynatrace

JANUARY 31, 2024

Organizations choose data-driven approaches to maximize the value of their data, achieve better business outcomes, and realize cost savings by improving their products, services, and processes. However, there are many obstacles and limitations along the way to becoming a data-driven organization. Understanding the context.

Analytics

Analytics Processing Transportation Storage

Bulldozer: Batch Data Moving from Data Warehouse to Online Key-Value Stores

The Netflix TechBlog

OCTOBER 27, 2020

By Tianlong Chen and Ioannis Papapanagiotou Netflix has more than 195 million subscribers that generate petabytes of data everyday. Data scientists and engineers collect this data from our subscribers and videos, and implement data analytics models to discover customer behaviour with the goal of maximizing user joy.

Latency

Latency Storage Big Data Tuning

NoSQL Data Modeling Techniques

Highly Scalable

MARCH 1, 2012

NoSQL databases are often compared by various non-functional criteria, such as scalability, performance, and consistency. At the same time, NoSQL data modeling is not so well studied and lacks the systematic theory found in relational databases. Many techniques that are described below are perfectly applicable to this model.

Database

Database Ecommerce Efficiency Engineering

Netflix’s Distributed Counter Abstraction

The Netflix TechBlog

NOVEMBER 12, 2024

By: Rajiv Shringi , Oleksii Tkachuk , Kartik Sathyanarayanan Introduction In our previous blog post, we introduced Netflix’s TimeSeries Abstraction , a distributed service designed to store and query large volumes of temporal event data with low millisecond latencies. Today, we’re excited to present the Distributed Counter Abstraction.

Latency

Latency Cache Infrastructure Strategy

Title Launch Observability at Netflix Scale

The Netflix TechBlog

DECEMBER 17, 2024

Metadata and assets must be correctly configured, data must flow seamlessly, microservices must process titles without error, and algorithms must function as intended. The complexity of these operational demands underscored the urgent need for a scalable solution. This could lead to an exponential increase in logged data.

Traffic

Traffic Scalability Strategy Monitoring

Boost DevOps maturity with observability and a data lakehouse

Dynatrace

JUNE 9, 2023

ln a world driven by macroeconomic uncertainty, businesses increasingly turn to data-driven decision-making to stay agile. They’re unleashing the power of cloud-based analytics on large data sets to unlock the insights they and the business need to make smarter decisions. All of these factors challenge DevOps maturity.

DevOps

DevOps Analytics Storage Metrics

Medallion Architecture: Efficient Batch and Stream Processing Data Pipelines With Azure Databricks and Delta Lake

DZone

JULY 13, 2023

In today's data-driven world, organizations need efficient and scalable data pipelines to process and analyze large volumes of data. Medallion Architecture provides a framework for organizing data processing workflows into different zones, enabling optimized batch and stream processing.

Azure

Azure Architecture Efficiency Processing

Conducting log analysis with an observability platform and full data context

Dynatrace

APRIL 20, 2023

Modern organizations ingest petabytes of data daily, but legacy approaches to log analysis and management cannot accommodate this volume of data. based financial services group, discussed how the bank uses log monitoring on the Dynatrace platform with an emphasis on observability and security data.

Analytics

Analytics Infrastructure Storage Architecture

Choosing the Right Storage for PostgreSQL on Kubernetes: A Benchmark Analysis

Percona

APRIL 1, 2025

As more organizations move their PostgreSQL databases onto Kubernetes, a common question arises: Which storage solution best handles its demands? Picking the right option is critical, directly impacting performance, reliability, and scalability. To address these concerns, […]

Storage

Storage Benchmarking Scalability Database

AWS serverless services: Exploring your options

Dynatrace

OCTOBER 7, 2021

This means you no longer have to provision, scale, and maintain servers to run your applications, databases, and storage systems. Speed is next; serverless solutions are quick to spin up or down as needed, and there are no delays due to limited storage or resource access. Scalability. Finally, there’s scalability.

Serverless

Serverless AWS Lambda Storage

Title Launch Observability at Netflix Scale

The Netflix TechBlog

MARCH 4, 2025

Store the data in an optimized, highly distributed datastore. Additionally, some collectors will instead poll our kafka queue for impressions data. This data is processed from a real-time impressions stream into a Kafka queue, which our title health system regularly polls. Track real-time title impressions from the NetflixUI.

Traffic

Traffic Strategy Entertainment Innovation

Enhance data management with Grail: Ultimate guide to custom buckets and security policies

Dynatrace

OCTOBER 6, 2023

Grail: Enterprise-ready data lakehouse Grail, the Dynatrace causational data lakehouse, was explicitly designed for observability and security data, with artificial intelligence integrated into its foundation. Tables are a physical data model, essentially the type of observability data that you can store.

Artificial Intelligence

Artificial Intelligence Metrics Analytics Storage

Designing Instagram

High Scalability

JANUARY 11, 2022

from a client it performs two parallel operations: i) persisting the action in the data store ii) publish the action in a streaming data store for a pub-sub model. User Feed Service, Media Counter Service) read the actions from the streaming data store and performs their specific tasks. Data Models. Graph Data Models.

Design

Design Media Storage Logistics

Microsoft Azure Event Hubs

DZone

FEBRUARY 23, 2023

Introduction With big data streaming platform and event ingestion service Azure Event Hubs , millions of events can be received and processed in a single second. Any real-time analytics provider or batching/storage adaptor can transform and store data supplied to an event hub.

Azure

Azure Big Data Storage Analytics

Stuff The Internet Says On Scalability For March 15th, 2019

High Scalability

MARCH 15, 2019

trillion suns : weight of the Milky Way; 300 +: backdoored apps on GitHub; 10% : hacked self-driving cars needed to bring traffic to a halt; $3 million : Marriott data breach cost after insurance; Quoteable Quotes: @kelseyhightower : Platform in a box solutions that are attempting to turn Kubernetes into a PaaS are missing the "as a service" part.

Internet

Internet Internet Scalability IoT

What is a Distributed Storage System

Scalegrid

FEBRUARY 8, 2024

A distributed storage system is foundational in today’s data-driven landscape, ensuring data spread over multiple servers is reliable, accessible, and manageable. Understanding distributed storage is imperative as data volumes and the need for robust storage solutions rise.

Storage

Storage Systems Big Data Azure

A Recap of the Data Engineering Open Forum at Netflix

The Netflix TechBlog

JUNE 20, 2024

A summary of sessions at the first Data Engineering Open Forum at Netflix on April 18th, 2024 The Data Engineering Open Forum at Netflix on April 18th, 2024. At Netflix, we aspire to entertain the world, and our data engineering teams play a crucial role in this mission by enabling data-driven decision-making at scale.

Data Engineering

Data Engineering Engineering Entertainment Software Engineering

Best Practices for Scaling RabbitMQ

Scalegrid

FEBRUARY 24, 2025

Youll also learn strategies for maintaining data safety and managing node failures so your RabbitMQ setup is always up to the task. Key Takeaways RabbitMQ improves scalability and fault tolerance in distributed systems by decoupling applications, enabling reliable message exchanges.

Best Practices

Best Practices Traffic Strategy Scalability

What is hyperconverged infrastructure? Realizing the benefits of HCI

Dynatrace

NOVEMBER 11, 2022

More organizations are adopting a hybrid IT environment, with data center and virtualized components. However, today’s IT teams are stretched thin, with little time to firefight issues with deployment, integration, and data center management. But in an HCI framework, purchasing more storage means purchasing more compute.

Infrastructure

Infrastructure Storage Virtualization Network

Improved Alerting with Atlas Streaming Eval

The Netflix TechBlog

APRIL 27, 2023

While we were able to put out the immediate fire by disabling the newly created alerts, this incident raised some critical concerns around the scalability of our alerting system. Atlas is an in-memory time-series database that ingests multiple billions of time-series per day and retains the last two weeks of data.

Storage

Storage Cache Metrics Database

Mastering Disk Space Management with MongoDB® Storage Engines

Scalegrid

MAY 11, 2024

MongoDB offers several storage engines that cater to various use cases. The default storage engine in earlier versions was MMAPv1, which utilized memory-mapped files and document-level locking. The newer, pluggable storage engine, WiredTiger, addresses this by using prefix compression, collection-level locking, and row-based storage.

Storage

Storage Engineering Cache Database

Amazon DynamoDB ? a Fast and Scalable NoSQL Database.

All Things Distributed

JANUARY 18, 2012

Werner Vogels weblog on building scalable and robust distributed systems. a Fast and Scalable NoSQL Database Service Designed for Internet Scale Applications. The original Dynamo design was based on a core set of strong distributed systems principles resulting in an ultra-scalable and highly reliable database system.

Scalability

Scalability Database Ecommerce Latency

How To Implement Video Information and Editing APIs in Java

DZone

MARCH 2, 2023

According to data provided by Sandvine in their 2022 Global Internet Phenomena Report , video traffic accounted for 53.72% of the total volume of internet traffic in 2021, and the closest trailing category (social) came in at just 12.69%.

Social Media

Social Media Java Traffic Internet

Open-Sourcing Metaflow, a Human-Centric Framework for Data Science

The Netflix TechBlog

DECEMBER 3, 2019

Netflix applies data science to hundreds of use cases across the company, including optimizing content delivery and video encoding. Data scientists at Netflix relish our culture that empowers them to work autonomously and use their judgment to solve problems independently. How could we improve the quality of life for data scientists?

Open Source

Open Source AWS Infrastructure Energy

How To Deploy the ELK Stack on Kubernetes

DZone

OCTOBER 24, 2023

The ELK stack is an abbreviation for Elasticsearch, Logstash, and Kibana, which offers the following capabilities: Elasticsearch: a scalable search and analytics engine with a log analytics tool and application-formed database, perfect for data-driven applications.

Analytics

Analytics Storage Infrastructure Scalability

MySQL General Tablespaces: A Powerful Storage Option for Your Data

Percona

JANUARY 4, 2024

Managing storage and performance efficiently in your MySQL database is crucial, and general tablespaces offer flexibility in achieving this. In contrast to the single system tablespace that holds system tables by default, general tablespaces are user-defined storage containers for multiple InnoDB tables.

Storage

Storage Engineering Database Open Source

Nine ways technology executives can get significant business value with the right observability platform

Dynatrace

MAY 21, 2024

Data with context can improve your ability to deliver on your goals, modernize your organization, and accelerate business transformation. These outcomes are made easy through the platform’s unique ability to turn data into answers and action, in contextual, real-time, and cost-effective ways that were previously impossible.

Technology

Technology Technology Analytics Storage

Best practices for Fluent Bit 3.0

Dynatrace

MAY 7, 2024

Fluent Bit is a telemetry agent designed to receive data (logs, traces, and metrics), process or modify it, and export it to a destination. Fluent Bit can serve as a proxy before you send data to Dynatrace or similar. However, you can also use Fluent Bit as a processor because you can perform various actions on the data.

Best Practices

Best Practices IoT Metrics Storage

Dynatrace elevates data security with separated storage and unique encryption keys for each tenant

Microsoft Ignite 2024 guide: Cloud observability for AI transformation

Trending Sources

Efficient Multimodal Data Processing: A Technical Deep Dive

Storage Types Used on Cloud Computing Platforms

Optimizing data warehouse storage

Empowering Developers With Scalable, Secure, and Customizable Storage Solutions

What is Greenplum Database? Intro to the Big Data Database

RabbitMQ vs. Kafka: Key Differences

Predictable costs for Log Management & Analytics with new simplified licensing plan

Introducing Netflix’s Key-Value Data Abstraction Layer

Introducing Netflix TimeSeries Data Abstraction Layer

Building an Optimized Data Pipeline on Azure Using Spark, Data Factory, Databricks, and Synapse Analytics

The history of Grail: Why you need a data lakehouse

The Power of Caching: Boosting API Performance and Scalability

What Should You Know About Graph Database’s Scalability?

Dynatrace OpenPipeline: Stream processing data ingestion converges observability, security, and business data at massive scale for analytics and automation in context

Bulldozer: Batch Data Moving from Data Warehouse to Online Key-Value Stores

NoSQL Data Modeling Techniques

Netflix’s Distributed Counter Abstraction

Title Launch Observability at Netflix Scale

Boost DevOps maturity with observability and a data lakehouse

Medallion Architecture: Efficient Batch and Stream Processing Data Pipelines With Azure Databricks and Delta Lake

Conducting log analysis with an observability platform and full data context

Choosing the Right Storage for PostgreSQL on Kubernetes: A Benchmark Analysis

AWS serverless services: Exploring your options

Title Launch Observability at Netflix Scale

Enhance data management with Grail: Ultimate guide to custom buckets and security policies

Designing Instagram

Top PostgreSQL 17 New Features

Microsoft Azure Event Hubs

Stuff The Internet Says On Scalability For March 15th, 2019

What is a Distributed Storage System

A Recap of the Data Engineering Open Forum at Netflix

Best Practices for Scaling RabbitMQ

What is hyperconverged infrastructure? Realizing the benefits of HCI

Improved Alerting with Atlas Streaming Eval

Mastering Disk Space Management with MongoDB® Storage Engines

Amazon DynamoDB ? a Fast and Scalable NoSQL Database.

How To Implement Video Information and Editing APIs in Java

Open-Sourcing Metaflow, a Human-Centric Framework for Data Science

How To Deploy the ELK Stack on Kubernetes

MySQL General Tablespaces: A Powerful Storage Option for Your Data

Nine ways technology executives can get significant business value with the right observability platform

Best practices for Fluent Bit 3.0

Stay Connected