Data, Processing and Storage - Technology Performance Pulse

Dynatrace elevates data security with separated storage and unique encryption keys for each tenant

Dynatrace

NOVEMBER 29, 2024

Dynatrace continues to deliver on its commitment to keeping your data secure in the cloud. Enhancing data separation by partitioning each customer’s data on the storage level and encrypting it with a unique encryption key adds an additional layer of protection against unauthorized data access.

Storage

Storage AWS Azure Architecture

OpenPipeline: Simplify access to critical business data

Dynatrace

NOVEMBER 4, 2024

There’s a goldmine of business data traversing your IT systems, yet most of it remains untapped. To unlock business value, the data must be: Accessible from anywhere. Data has value only when you can access it, no matter where it lies. Agile business decisions rely on fresh data. Easy to access. Contextualized.

Analytics

Analytics Airlines Metrics Monitoring

Efficient Multimodal Data Processing: A Technical Deep Dive

DZone

FEBRUARY 27, 2025

Multimodal data processing is the evolving need of the latest data platforms powering applications like recommendation systems, autonomous vehicles, and medical diagnostics. Handling multimodal data spanning text, images, videos, and sensor inputs requires resilient architecture to manage the diversity of formats and scale.

Efficiency

Efficiency Processing Latency Storage

Navigate end-to-end data compliance through effective log management

Dynatrace

NOVEMBER 29, 2024

With an increasing number of regulations and standards governing how businesses handle data, an end-to-end compliance strategy is crucial. As the volume and complexity of data increase, understanding and managing logs effectively to reach compliance is essential. These logs contain sensitive healthcare data.

Healthcare

Healthcare Analytics Storage Government

Data Storage Formats for Big Data Analytics: Performance and Cost Implications of Parquet, Avro, and ORC

DZone

SEPTEMBER 9, 2024

Efficient data processing is crucial for businesses and organizations that rely on big data analytics to make informed decisions. One key factor that significantly affects the performance of data processing is the storage format of the data.

Big Data

Big Data Storage Analytics Benchmarking

Cutting Big Data Costs: Effective Data Processing With Apache Spark

DZone

SEPTEMBER 14, 2023

In today's data-driven world, efficient data processing plays a pivotal role in the success of any project. Apache Spark , a robust open-source data processing framework, has emerged as a game-changer in this domain.

Big Data

Big Data Processing Open Source Games

Optimizing data warehouse storage

The Netflix TechBlog

DECEMBER 21, 2020

By Anupom Syam Background At Netflix, our current data warehouse contains hundreds of Petabytes of data stored in AWS S3 , and each day we ingest and create additional Petabytes. On the other hand, these optimizations themselves need to be sufficiently inexpensive to justify their own processing cost over the gains they bring.

Storage

Storage Latency Efficiency Data Engineering

Business process observability: An IT solution to a business challenge

Dynatrace

FEBRUARY 24, 2025

Business processes support virtually all aspects of an organizations operations. Theyre often categorized by their function; core processes directly create customer value, support processes increase departmental efficiency, and management processes drive strategic goals and compliance.

Processing

Processing Analytics Infrastructure Monitoring

Catching up with OpenTelemetry in 2025

Dynatrace

FEBRUARY 27, 2025

To understand whats happening in todays complex software ecosystems, you need comprehensive telemetry data to make it all observable. With so many types of technologies in software stacks around the globe, OpenTelemetry has emerged as the de facto standard for gathering telemetry data. But, generating telemetry data is the easy part.

Tuning

Tuning Open Source Innovation Monitoring

Storage handling improvements increase retention of transaction data for Dynatrace Managed

Dynatrace

JULY 29, 2021

Using existing storage resources optimally is key to being able to capture the right data over time. In this blog post, we announce: Compression of transaction data that’s older than three days. Improvements to Adaptive Data Retention. Transaction-data compression for Dynatrace Managed environments.

Storage

Storage Virtualization Infrastructure Availability

Dynatrace OpenPipeline: Stream processing data ingestion converges observability, security, and business data at massive scale for analytics and automation in context

Dynatrace

JANUARY 31, 2024

Organizations choose data-driven approaches to maximize the value of their data, achieve better business outcomes, and realize cost savings by improving their products, services, and processes. However, there are many obstacles and limitations along the way to becoming a data-driven organization.

Analytics

Analytics Processing Transportation Storage

What is Greenplum Database? Intro to the Big Data Database

Scalegrid

MAY 13, 2020

Greenplum Database is a massively parallel processing (MPP) SQL database that is built and based on PostgreSQL. It can scale towards a multi-petabyte level data workload without a single issue, and it allows access to a cluster of powerful servers that will work together within a single SQL interface where you can view all of the data.

Big Data

Big Data Database Artificial Intelligence Open Source

New continuous compliance requirements drive the need to converge observability and security

Dynatrace

DECEMBER 12, 2024

Move beyond logs-only security: Embrace a comprehensive, end-to-end approach that integrates all data from observability and security. Carefully planning and integrating new processes and tools is critical to ensuring compliance without disrupting daily operations. Dynatrace unifies all the different data types at scale and in context.

Analytics

Analytics Government Efficiency Innovation

Medallion Architecture: Efficient Batch and Stream Processing Data Pipelines With Azure Databricks and Delta Lake

DZone

JULY 13, 2023

In today's data-driven world, organizations need efficient and scalable data pipelines to process and analyze large volumes of data. Medallion Architecture provides a framework for organizing data processing workflows into different zones, enabling optimized batch and stream processing.

Azure

Azure Architecture Efficiency Processing

Netflix’s Distributed Counter Abstraction

The Netflix TechBlog

NOVEMBER 12, 2024

By: Rajiv Shringi , Oleksii Tkachuk , Kartik Sathyanarayanan Introduction In our previous blog post, we introduced Netflix’s TimeSeries Abstraction , a distributed service designed to store and query large volumes of temporal event data with low millisecond latencies. Today, we’re excited to present the Distributed Counter Abstraction.

Latency

Latency Cache Infrastructure Strategy

Microsoft Ignite 2024 guide: Cloud observability for AI transformation

Dynatrace

NOVEMBER 18, 2024

AI transformation, modernization, managing intelligent apps, safeguarding data, and accelerating productivity are all key themes at Microsoft Ignite 2024. Adopting AI to enhance efficiency and boost productivity is critical in a time of exploding data, cloud complexities, and disparate technologies.

Cloud

Cloud Azure Artificial Intelligence Innovation

How a data lakehouse brings data insights to life

Dynatrace

OCTOBER 4, 2022

For IT infrastructure managers and site reliability engineers, or SREs , logs provide a treasure trove of data. But on their own, logs present just another data silo as IT professionals attempt to troubleshoot and remediate problems. Data volume explosion in multicloud environments poses log issues.

Analytics

Analytics Storage Infrastructure Metrics

Using JSONB in PostgreSQL: How to Effectively Store & Index JSON Data in PostgreSQL

Scalegrid

JULY 17, 2020

It is an open standard format which organizes data into key/value pairs and arrays detailed in RFC 7159. JSON is the most common format used by web services to exchange data, store documents, unstructured data, etc. You can also check out our Working with JSON Data in PostgreSQL vs. JSONB Patterns & Antipatterns.

Storage

Storage Database Efficiency Processing

Introducing Netflix’s Key-Value Data Abstraction Layer

The Netflix TechBlog

SEPTEMBER 18, 2024

Second, developers had to constantly re-learn new data modeling practices and common yet critical data access patterns. To overcome these challenges, we developed a holistic approach that builds upon our Data Gateway Platform. Data Model At its core, the KV abstraction is built around a two-level map architecture.

Latency

Latency Storage Cache Servers

Master the Art of Querying Data on Amazon S3

DZone

JUNE 3, 2024

In an era where data is the new oil, effectively utilizing data is crucial for the growth of every organization. It is not enough to store these data durably, but also to effectively query and analyze them. Without a querying capability, the data stored in S3 would not be of any benefit.

Big Data

Big Data AWS Storage Analytics

Overcoming Challenges and Best Practices for Data Migration From On-Premise to Cloud

DZone

MARCH 29, 2023

Data migration is the process of moving data from one location to another, which is an essential aspect of cloud migration. Data migration involves transferring data from on-premise storage to the cloud. With the rapid adoption of cloud computing , businesses are moving their IT infrastructure to the cloud.

Best Practices

Best Practices Cloud Data Engineering Storage

The history of Grail: Why you need a data lakehouse

Dynatrace

OCTOBER 4, 2022

Some time ago, at a restaurant near Boston, three Dynatrace colleagues dined and discussed the growing data challenge for enterprises. At its core, this challenge involves a rapid increase in the amount—and complexity—of data collected within a company. Work with different and independent data types. Thus, Grail was born.

Artificial Intelligence

Artificial Intelligence Analytics Storage Architecture

What is a data lakehouse? Combining data lakes and warehouses for the best of both worlds

Dynatrace

OCTOBER 4, 2022

While data lakes and data warehousing architectures are commonly used modes for storing and analyzing data, a data lakehouse is an efficient third way to store and analyze data that unifies the two architectures while preserving the benefits of both. What is a data lakehouse? How does a data lakehouse work?

Artificial Intelligence

Artificial Intelligence Storage Analytics Government

Privacy spotlight: Control compliance in Dynatrace with multiple layers of sensitive data masking

Dynatrace

MAY 21, 2024

Observing complex environments involves handling regulatory, compliance, and data governance requirements. This continuously evolving landscape requires careful management and clarity regarding how sensitive data is used. This is particularly important when dealing with large volumes of data.

Government

Government Storage Tuning Monitoring

Introducing Impressions at Netflix

The Netflix TechBlog

FEBRUARY 14, 2025

Every image you hover over isnt just a visual placeholder; its a critical data point that fuels our sophisticated personalization engine. It requires a state-of-the-art system that can track and process these impressions while maintaining a detailed history of each profiles exposure.

Tuning

Tuning Latency Efficiency Storage

Bulldozer: Batch Data Moving from Data Warehouse to Online Key-Value Stores

The Netflix TechBlog

OCTOBER 27, 2020

By Tianlong Chen and Ioannis Papapanagiotou Netflix has more than 195 million subscribers that generate petabytes of data everyday. Data scientists and engineers collect this data from our subscribers and videos, and implement data analytics models to discover customer behaviour with the goal of maximizing user joy.

Latency

Latency Storage Big Data Tuning

Dynatrace Cost & Carbon Optimization certified for accuracy and transparency

Dynatrace

MARCH 5, 2025

Cloud service providers (CSPs) share carbon footprint data with their customers, but the focus of these tools is on reporting and trending, effectively targeting sustainability officers and business leaders. Actions resulting from the evaluation The certification process surfaced a few recommendations for improving the app.

Energy

Energy Analytics Traffic Cloud

RabbitMQ vs. Kafka: Key Differences

Scalegrid

FEBRUARY 6, 2025

RabbitMQ is designed for flexible routing and message reliability, while Kafka handles high-throughput event streaming and real-time data processing. Both serve distinct purposes, from managing message queues to ingesting large data volumes. What is RabbitMQ? What is Apache Kafka?

Latency

Latency Analytics Architecture Storage

Building an Optimized Data Pipeline on Azure Using Spark, Data Factory, Databricks, and Synapse Analytics

DZone

APRIL 11, 2023

Data processing in the cloud has become increasingly popular due to its scalability, flexibility, and cost-effectiveness. This article will explore how these technologies can be used together to create an optimized data pipeline for data processing in the cloud.

Azure

Azure Analytics Storage Cloud

Introducing Netflix TimeSeries Data Abstraction Layer

The Netflix TechBlog

OCTOBER 8, 2024

Rajiv Shringi Vinay Chella Kaidan Fullerton Oleksii Tkachuk Joey Lynch Introduction As Netflix continues to expand and diversify into various sectors like Video on Demand and Gaming , the ability to ingest and store vast amounts of temporal data — often reaching petabytes — with millisecond access latency has become increasingly vital.

Latency

Latency Storage Traffic Tuning

Getting answers from data starts with automated log acquisition, at any scale

Dynatrace

OCTOBER 14, 2022

Log data provides a unique source of truth for debugging applications, optimizing infrastructure, and investigating security incidents. This contextualization of log data enables AI-powered problem detection and root cause analysis at scale. Dynamic landscape and data handling requirements result in manual work.

Storage

Storage Government Network DevOps

Measuring the importance of data quality to causal AI success

Dynatrace

JANUARY 4, 2024

While this approach can be effective if the model is trained with a large amount of data, even in the best-case scenarios, it amounts to an informed guess, rather than a certainty. But to be successful, data quality is critical. Teams need to ensure the data is accurate and correctly represents real-world scenarios. Consistency.

Government

Government Analytics Benchmarking Storage

Data privacy by design: How an observability platform protects data security

Dynatrace

APRIL 19, 2023

Creating an ecosystem that facilitates data security and data privacy by design can be difficult, but it’s critical to securing information. When organizations focus on data privacy by design, they build security considerations into cloud systems upfront rather than as a bolt-on consideration.

Design

Design Storage Programming Analytics

Boost DevOps maturity with observability and a data lakehouse

Dynatrace

JUNE 9, 2023

ln a world driven by macroeconomic uncertainty, businesses increasingly turn to data-driven decision-making to stay agile. They’re unleashing the power of cloud-based analytics on large data sets to unlock the insights they and the business need to make smarter decisions. All of these factors challenge DevOps maturity.

DevOps

DevOps Analytics Storage Metrics

Analyze OpenTelemetry traces and log data at scale: Accelerate troubleshooting and optimize application performance

Dynatrace

OCTOBER 3, 2024

Considering the latest State of Observability 2024 report, it’s evident that multicloud environments not only come with an explosion of data beyond humans’ ability to manage it. It’s increasingly difficult to ingest, manage, store, and sort through this amount of data. You can find the list of use cases here.

Performance

Performance Architecture Innovation Latency

Real-time business analytics with Dynatrace: Unleashing the treasure trove of insights from your observability data

Dynatrace

AUGUST 20, 2024

Driven by that value, Dynatrace brings real-time observability, security, and business data into context and makes sense of it so our customers can get answers, automate, predict, and prevent. Executives are sitting on a goldmine of data, and they don’t know it.

Analytics

Analytics Latency Processing Systems

Process more with less using smarter cluster overload prevention for Dynatrace Managed

Dynatrace

MAY 14, 2020

Dynatrace just makes this easy—it comes out-of-the-box, no silos of data, no DIY stitching together tools, no wasted time, and no wasted resources. . A Dynatrace Managed cluster may lack the necessary hardware to process all the additional incoming data. Please watch disk space usage and extend it if needed.

Processing

Processing Hardware Traffic Storage

Conducting log analysis with an observability platform and full data context

Dynatrace

APRIL 20, 2023

Modern organizations ingest petabytes of data daily, but legacy approaches to log analysis and management cannot accommodate this volume of data. based financial services group, discussed how the bank uses log monitoring on the Dynatrace platform with an emphasis on observability and security data.

Analytics

Analytics Infrastructure Storage Architecture

Block Size and Its Impact on Storage Performance

DZone

JUNE 21, 2024

This article analyzes the correlation between block sizes and their impact on storage performance. This paper deals with definitions and understanding of structured data vs unstructured data, how various storage segments react to block size changes, and differences between I/O-driven and throughput-driven workloads.

Storage

Storage Performance Benchmarking Processing

How unified data and analytics offers a new approach to software intelligence

Dynatrace

OCTOBER 4, 2022

Software and data are a company’s competitive advantage. But for software to work perfectly, organizations need to use data to optimize every phase of the software lifecycle. The only way to address these challenges is through observability data — logs, metrics, and traces. Teams interact with myriad data types.

Analytics

Analytics Software Software Storage

Faster time to value with enhanced handling of OneAgent runtime data

Dynatrace

SEPTEMBER 23, 2020

Recent improvements in OneAgent runtime-data handling. Storage mount points in a system might be larger or smaller, local or remote, with high or low latency, and various speeds. Starting with OneAgent version 1.199, the runtime folder is configurable and consequently you can retain your storage mount point setup as-is.

Storage

Storage Latency Operating System Network

Privacy spotlight: Retain data in Grail with 1-day precision, for up to 10 years

Dynatrace

DECEMBER 5, 2023

Streamline privacy requirements with flexible retention periods Data retention is a critical aspect of data handling, and it’s not just about privacy compliance—it’s about having the flexibility to optimize data storage times in Grail for your Dynatrace use cases.

Storage

Storage Healthcare Best Practices Speed

Title Launch Observability at Netflix Scale

The Netflix TechBlog

DECEMBER 17, 2024

As Netflix expanded globally and the volume of title launches skyrocketed, the operational challenges of maintaining this manual process became undeniable. Metadata and assets must be correctly configured, data must flow seamlessly, microservices must process titles without error, and algorithms must function as intended.

Traffic

Traffic Scalability Strategy Monitoring

Enhance data management with Grail: Ultimate guide to custom buckets and security policies

Dynatrace

OCTOBER 6, 2023

Grail: Enterprise-ready data lakehouse Grail, the Dynatrace causational data lakehouse, was explicitly designed for observability and security data, with artificial intelligence integrated into its foundation. Tables are a physical data model, essentially the type of observability data that you can store.

Artificial Intelligence

Artificial Intelligence Metrics Analytics Storage

Dynatrace elevates data security with separated storage and unique encryption keys for each tenant

OpenPipeline: Simplify access to critical business data

Trending Sources

Efficient Multimodal Data Processing: A Technical Deep Dive

Navigate end-to-end data compliance through effective log management

Data Storage Formats for Big Data Analytics: Performance and Cost Implications of Parquet, Avro, and ORC

Cutting Big Data Costs: Effective Data Processing With Apache Spark

Optimizing data warehouse storage

Business process observability: An IT solution to a business challenge

Catching up with OpenTelemetry in 2025

Storage handling improvements increase retention of transaction data for Dynatrace Managed

Dynatrace OpenPipeline: Stream processing data ingestion converges observability, security, and business data at massive scale for analytics and automation in context

What is Greenplum Database? Intro to the Big Data Database

New continuous compliance requirements drive the need to converge observability and security

Medallion Architecture: Efficient Batch and Stream Processing Data Pipelines With Azure Databricks and Delta Lake

Netflix’s Distributed Counter Abstraction

Microsoft Ignite 2024 guide: Cloud observability for AI transformation

How a data lakehouse brings data insights to life

Using JSONB in PostgreSQL: How to Effectively Store & Index JSON Data in PostgreSQL

Introducing Netflix’s Key-Value Data Abstraction Layer

Master the Art of Querying Data on Amazon S3

Overcoming Challenges and Best Practices for Data Migration From On-Premise to Cloud

The history of Grail: Why you need a data lakehouse

What is a data lakehouse? Combining data lakes and warehouses for the best of both worlds

Privacy spotlight: Control compliance in Dynatrace with multiple layers of sensitive data masking

Introducing Impressions at Netflix

Bulldozer: Batch Data Moving from Data Warehouse to Online Key-Value Stores

Dynatrace Cost & Carbon Optimization certified for accuracy and transparency

RabbitMQ vs. Kafka: Key Differences

Building an Optimized Data Pipeline on Azure Using Spark, Data Factory, Databricks, and Synapse Analytics

Introducing Netflix TimeSeries Data Abstraction Layer

Getting answers from data starts with automated log acquisition, at any scale

Measuring the importance of data quality to causal AI success

Data privacy by design: How an observability platform protects data security

Boost DevOps maturity with observability and a data lakehouse

Analyze OpenTelemetry traces and log data at scale: Accelerate troubleshooting and optimize application performance

Real-time business analytics with Dynatrace: Unleashing the treasure trove of insights from your observability data

Process more with less using smarter cluster overload prevention for Dynatrace Managed

Conducting log analysis with an observability platform and full data context

Block Size and Its Impact on Storage Performance

How unified data and analytics offers a new approach to software intelligence

Faster time to value with enhanced handling of OneAgent runtime data

Privacy spotlight: Retain data in Grail with 1-day precision, for up to 10 years

Title Launch Observability at Netflix Scale

Enhance data management with Grail: Ultimate guide to custom buckets and security policies

Stay Connected