Availability, Data and Storage - Technology Performance Pulse

OpenPipeline: Simplify access to critical business data

Dynatrace

NOVEMBER 4, 2024

There’s a goldmine of business data traversing your IT systems, yet most of it remains untapped. To unlock business value, the data must be: Accessible from anywhere. Data has value only when you can access it, no matter where it lies. Agile business decisions rely on fresh data. Easy to access. Contextualized.

Analytics

Analytics Airlines Metrics Monitoring

Introduction to Azure Data Lake Storage Gen2

DZone

FEBRUARY 1, 2023

Built on Azure Blob Storage, Azure Data Lake Storage Gen2 is a suite of features for big data analytics. Azure Data Lake Storage Gen1 and Azure Blob Storage's capabilities are combined in Data Lake Storage Gen2.

Azure

Azure Storage Big Data Analytics

Storage handling improvements increase retention of transaction data for Dynatrace Managed

Dynatrace

JULY 29, 2021

Using existing storage resources optimally is key to being able to capture the right data over time. In this blog post, we announce: Compression of transaction data that’s older than three days. Improvements to Adaptive Data Retention. Transaction-data compression for Dynatrace Managed environments.

Storage

Storage Virtualization Infrastructure Availability

Optimizing data warehouse storage

The Netflix TechBlog

DECEMBER 21, 2020

By Anupom Syam Background At Netflix, our current data warehouse contains hundreds of Petabytes of data stored in AWS S3 , and each day we ingest and create additional Petabytes. We built AutoOptimize to efficiently and transparently optimize the data and metadata storage layout while maximizing their cost and performance benefits.

Storage

Storage Latency Efficiency Data Engineering

Storage Types Used on Cloud Computing Platforms

DZone

JANUARY 24, 2024

Cloud computing platforms have fundamentally altered how organizations access and manage data. Because of the emergence of cloud services, a broad range of storage choices are now easily available to fulfill the different demands of both organizations and people.

Storage

Storage Cloud Scalability Design

What is Greenplum Database? Intro to the Big Data Database

Scalegrid

MAY 13, 2020

It can scale towards a multi-petabyte level data workload without a single issue, and it allows access to a cluster of powerful servers that will work together within a single SQL interface where you can view all of the data. This feature-packed database provides powerful and rapid analytics on data that scales up to petabyte volumes.

Big Data

Big Data Database Artificial Intelligence Open Source

Dynatrace Managed turnkey Premium High Availability for globally distributed data centers (Early Adopter)

Dynatrace

JUNE 25, 2020

Dynatrace Managed is intrinsically highly available as it stores three copies of all events, user sessions, and metrics across its cluster nodes. Our Premium High Availability comes with the following features: Active-active deployment model for optimum hardware utilization. Minimized cross-data center network traffic.

Availability

Availability Hardware Latency Traffic

Microsoft Ignite 2024 guide: Cloud observability for AI transformation

Dynatrace

NOVEMBER 18, 2024

AI transformation, modernization, managing intelligent apps, safeguarding data, and accelerating productivity are all key themes at Microsoft Ignite 2024. Adopting AI to enhance efficiency and boost productivity is critical in a time of exploding data, cloud complexities, and disparate technologies.

Cloud

Cloud Azure Artificial Intelligence Innovation

Netflix’s Distributed Counter Abstraction

The Netflix TechBlog

NOVEMBER 12, 2024

By: Rajiv Shringi , Oleksii Tkachuk , Kartik Sathyanarayanan Introduction In our previous blog post, we introduced Netflix’s TimeSeries Abstraction , a distributed service designed to store and query large volumes of temporal event data with low millisecond latencies. Today, we’re excited to present the Distributed Counter Abstraction.

Latency

Latency Cache Infrastructure Strategy

Dynatrace Cost & Carbon Optimization certified for accuracy and transparency

Dynatrace

MARCH 5, 2025

Cloud service providers (CSPs) share carbon footprint data with their customers, but the focus of these tools is on reporting and trending, effectively targeting sustainability officers and business leaders. The certification results are now publicly available.

Energy

Energy Analytics Traffic Cloud

Using JSONB in PostgreSQL: How to Effectively Store & Index JSON Data in PostgreSQL

Scalegrid

JULY 17, 2020

It is an open standard format which organizes data into key/value pairs and arrays detailed in RFC 7159. JSON is the most common format used by web services to exchange data, store documents, unstructured data, etc. You can also check out our Working with JSON Data in PostgreSQL vs. JSONB Patterns & Antipatterns.

Storage

Storage Database Efficiency Processing

How a data lakehouse brings data insights to life

Dynatrace

OCTOBER 4, 2022

For IT infrastructure managers and site reliability engineers, or SREs , logs provide a treasure trove of data. But on their own, logs present just another data silo as IT professionals attempt to troubleshoot and remediate problems. Data volume explosion in multicloud environments poses log issues.

Analytics

Analytics Storage Infrastructure Metrics

The history of Grail: Why you need a data lakehouse

Dynatrace

OCTOBER 4, 2022

Some time ago, at a restaurant near Boston, three Dynatrace colleagues dined and discussed the growing data challenge for enterprises. At its core, this challenge involves a rapid increase in the amount—and complexity—of data collected within a company. Work with different and independent data types. Thus, Grail was born.

Artificial Intelligence

Artificial Intelligence Analytics Storage Architecture

Introducing Netflix’s Key-Value Data Abstraction Layer

The Netflix TechBlog

SEPTEMBER 18, 2024

Central to this infrastructure is our use of multiple online distributed databases such as Apache Cassandra , a NoSQL database known for its high availability and scalability. Second, developers had to constantly re-learn new data modeling practices and common yet critical data access patterns.

Latency

Latency Storage Cache Servers

Privacy spotlight: Control compliance in Dynatrace with multiple layers of sensitive data masking

Dynatrace

MAY 21, 2024

Observing complex environments involves handling regulatory, compliance, and data governance requirements. This continuously evolving landscape requires careful management and clarity regarding how sensitive data is used. This is particularly important when dealing with large volumes of data.

Government

Government Storage Tuning Monitoring

Dynatrace OpenPipeline: Stream processing data ingestion converges observability, security, and business data at massive scale for analytics and automation in context

Dynatrace

JANUARY 31, 2024

Organizations choose data-driven approaches to maximize the value of their data, achieve better business outcomes, and realize cost savings by improving their products, services, and processes. However, there are many obstacles and limitations along the way to becoming a data-driven organization. Understanding the context.

Analytics

Analytics Processing Transportation Storage

Getting answers from data starts with automated log acquisition, at any scale

Dynatrace

OCTOBER 14, 2022

Log data provides a unique source of truth for debugging applications, optimizing infrastructure, and investigating security incidents. This contextualization of log data enables AI-powered problem detection and root cause analysis at scale. Dynamic landscape and data handling requirements result in manual work.

Storage

Storage Government Network DevOps

Predictable costs for Log Management & Analytics with new simplified licensing plan

Dynatrace

DECEMBER 18, 2024

With this new DPS pricing model option, customers can retain data at a fixed low cost with no additional cost to query for up to 35 days. This model provides a predictable way for customers to manage and analyze logs, drive log management tool consolidation, and reduce costs while gaining maximum value from their log data.

Analytics

Analytics Best Practices Efficiency Storage

Dynatrace Managed now available on all major cloud platforms

Dynatrace

JULY 31, 2020

Cloud-based solutions typically aren’t a viable option or enterprises that have strict security or privacy policies that require their data to be maintained on-premise. Dynatrace Managed now available on the Google Cloud Platform. Dynatrace news. How to set up Dynatrace Managed on Google Cloud Platform. Prerequisites.

Cloud

Cloud Availability Azure Google

Introducing Impressions at Netflix

The Netflix TechBlog

FEBRUARY 14, 2025

Every image you hover over isnt just a visual placeholder; its a critical data point that fuels our sophisticated personalization engine. This nuanced integration of data and technology empowers us to offer bespoke content recommendations. This queue ensures we are consistently capturing raw events from our global userbase.

Tuning

Tuning Latency Efficiency Storage

Introducing Netflix TimeSeries Data Abstraction Layer

The Netflix TechBlog

OCTOBER 8, 2024

Rajiv Shringi Vinay Chella Kaidan Fullerton Oleksii Tkachuk Joey Lynch Introduction As Netflix continues to expand and diversify into various sectors like Video on Demand and Gaming , the ability to ingest and store vast amounts of temporal data — often reaching petabytes — with millisecond access latency has become increasingly vital.

Latency

Latency Storage Traffic Tuning

Data privacy by design: How an observability platform protects data security

Dynatrace

APRIL 19, 2023

Creating an ecosystem that facilitates data security and data privacy by design can be difficult, but it’s critical to securing information. When organizations focus on data privacy by design, they build security considerations into cloud systems upfront rather than as a bolt-on consideration.

Design

Design Storage Programming Analytics

Bulldozer: Batch Data Moving from Data Warehouse to Online Key-Value Stores

The Netflix TechBlog

OCTOBER 27, 2020

By Tianlong Chen and Ioannis Papapanagiotou Netflix has more than 195 million subscribers that generate petabytes of data everyday. Data scientists and engineers collect this data from our subscribers and videos, and implement data analytics models to discover customer behaviour with the goal of maximizing user joy.

Latency

Latency Storage Big Data Tuning

Measuring the importance of data quality to causal AI success

Dynatrace

JANUARY 4, 2024

While this approach can be effective if the model is trained with a large amount of data, even in the best-case scenarios, it amounts to an informed guess, rather than a certainty. But to be successful, data quality is critical. Teams need to ensure the data is accurate and correctly represents real-world scenarios. Consistency.

Government

Government Analytics Benchmarking Storage

Stay in control of your data retention with Dynatrace Grail—from 10 days to 10 years

Dynatrace

APRIL 28, 2023

Optimize cost and availability while staying compliant Observability data like logs and metrics provide automated answers, root cause detection, and security issues. Customer decisions about data retention are often determined by important security, privacy, and legal issues.

Analytics

Analytics Storage Infrastructure Metrics

Privacy spotlight: Retain data in Grail with 1-day precision, for up to 10 years

Dynatrace

DECEMBER 5, 2023

Streamline privacy requirements with flexible retention periods Data retention is a critical aspect of data handling, and it’s not just about privacy compliance—it’s about having the flexibility to optimize data storage times in Grail for your Dynatrace use cases.

Storage

Storage Healthcare Best Practices Speed

Title Launch Observability at Netflix Scale

The Netflix TechBlog

MARCH 4, 2025

Store the data in an optimized, highly distributed datastore. Additionally, some collectors will instead poll our kafka queue for impressions data. This data is processed from a real-time impressions stream into a Kafka queue, which our title health system regularly polls. Track real-time title impressions from the NetflixUI.

Traffic

Traffic Strategy Entertainment Innovation

Enhance data management with Grail: Ultimate guide to custom buckets and security policies

Dynatrace

OCTOBER 6, 2023

Grail: Enterprise-ready data lakehouse Grail, the Dynatrace causational data lakehouse, was explicitly designed for observability and security data, with artificial intelligence integrated into its foundation. Tables are a physical data model, essentially the type of observability data that you can store.

Artificial Intelligence

Artificial Intelligence Metrics Analytics Storage

RabbitMQ vs. Kafka: Key Differences

Scalegrid

FEBRUARY 6, 2025

RabbitMQ is designed for flexible routing and message reliability, while Kafka handles high-throughput event streaming and real-time data processing. Both serve distinct purposes, from managing message queues to ingesting large data volumes. What is RabbitMQ?

Latency

Latency Analytics Architecture Storage

Analyze OpenTelemetry traces and log data at scale: Accelerate troubleshooting and optimize application performance

Dynatrace

OCTOBER 3, 2024

Considering the latest State of Observability 2024 report, it’s evident that multicloud environments not only come with an explosion of data beyond humans’ ability to manage it. It’s increasingly difficult to ingest, manage, store, and sort through this amount of data. You can find the list of use cases here.

Performance

Performance Architecture Innovation Latency

Data ingestion pipeline with Operation Management

The Netflix TechBlog

MARCH 7, 2023

These media focused machine learning algorithms as well as other teams generate a lot of data from the media files, which we described in our previous blog , are stored as annotations in Marken. Similarly, client teams don’t have to worry about when or how the data is written. in a video file.

Media

Media Latency Architecture Database

What is predictive AI? How this data-driven technique gives foresight to IT teams

Dynatrace

SEPTEMBER 5, 2023

They handle complex infrastructure, maintain service availability, and respond swiftly to incidents. Predictive AI uses machine learning, data analysis, statistical models, and AI methods to predict anomalies, identify patterns, and create forecasts. This data-driven approach fosters continuous refinement of processes and systems.

Artificial Intelligence

Artificial Intelligence DevOps Analytics Engineering

Choosing the Right Storage for PostgreSQL on Kubernetes: A Benchmark Analysis

Percona

APRIL 1, 2025

As more organizations move their PostgreSQL databases onto Kubernetes, a common question arises: Which storage solution best handles its demands? For stateful workloads like PostgreSQL, storage must offer high availability and safeguard data integrity, even under intense, high-volume conditions.

Storage

Storage Benchmarking Scalability Database

Faster time to value with enhanced handling of OneAgent runtime data

Dynatrace

SEPTEMBER 23, 2020

Recent improvements in OneAgent runtime-data handling. Storage mount points in a system might be larger or smaller, local or remote, with high or low latency, and various speeds. Starting with OneAgent version 1.199, the runtime folder is configurable and consequently you can retain your storage mount point setup as-is.

Storage

Storage Latency Operating System Network

Real-time business analytics with Dynatrace: Unleashing the treasure trove of insights from your observability data

Dynatrace

AUGUST 20, 2024

Driven by that value, Dynatrace brings real-time observability, security, and business data into context and makes sense of it so our customers can get answers, automate, predict, and prevent. Executives are sitting on a goldmine of data, and they don’t know it.

Analytics

Analytics Latency Processing Systems

Data pipeline asset management with Dataflow

The Netflix TechBlog

FEBRUARY 9, 2022

JAR) form to be executed as part of the user defined data pipeline. data pipeline ?—?a DAG) for the purpose of transforming data using some business logic. Netflix homegrown CLI tool for data pipeline management. task, an atomic unit of data transformation logic, a non-separable execution block in the workflow chain.

Storage

Storage Data Engineering Testing Code

Introducing Dynatrace built-in data observability on Davis AI and Grail

Dynatrace

JANUARY 31, 2024

I have ingested important custom data into Dynatrace, critical to running my applications and making accurate business decisions… but can I trust the accuracy and reliability?” ” Welcome to the world of data observability. At its core, data observability is about ensuring the availability, reliability, and quality of data.

DevOps

DevOps Analytics Airlines Metrics

Open-Sourcing Metaflow, a Human-Centric Framework for Data Science

The Netflix TechBlog

DECEMBER 3, 2019

Netflix applies data science to hundreds of use cases across the company, including optimizing content delivery and video encoding. Data scientists at Netflix relish our culture that empowers them to work autonomously and use their judgment to solve problems independently. How could we improve the quality of life for data scientists?

Open Source

Open Source AWS Infrastructure Energy

What is a Distributed Storage System

Scalegrid

FEBRUARY 8, 2024

A distributed storage system is foundational in today’s data-driven landscape, ensuring data spread over multiple servers is reliable, accessible, and manageable. Understanding distributed storage is imperative as data volumes and the need for robust storage solutions rise.

Storage

Storage Systems Big Data Azure

Dynatrace adds support for VPC Flow Logs to Kinesis Data Firehose

Dynatrace

SEPTEMBER 7, 2022

Dynatrace and the Dynatrace Intelligent Observability Platform have added support for the newly introduced Amazon VPC Flow Logs to Amazon Kinesis Data Firehose. This support enables customers to define specific endpoint delivery of real-time streaming data to platforms such as Dynatrace. What is VPC Flow Logs? Why Dynatrace?

Traffic

Traffic AWS Network Cloud

Mastering Disk Space Management with MongoDB® Storage Engines

Scalegrid

MAY 11, 2024

MongoDB offers several storage engines that cater to various use cases. The default storage engine in earlier versions was MMAPv1, which utilized memory-mapped files and document-level locking. The newer, pluggable storage engine, WiredTiger, addresses this by using prefix compression, collection-level locking, and row-based storage.

Storage

Storage Engineering Cache Database

Storage Strategies for PostgreSQL on Kubernetes

Percona

DECEMBER 11, 2023

There are a wealth of options on how you can approach storage configuration in Percona Operator for PostgreSQL , and in this blog post, we review various storage strategies — from basics to more sophisticated use cases. For example, you can choose the public cloud storage type – gp3, io2, etc, or set file system.

Storage

Storage Strategy Cloud Network

Any analysis, any time: Dynatrace Log Management and Analytics powered by Grail

Dynatrace

OCTOBER 4, 2022

Several pain points have made it difficult for organizations to manage their data efficiently and create actual value. Limited data availability constrains value creation. Modern IT environments — whether multicloud, on-premises, or hybrid-cloud architectures — generate exponentially increasing data volumes.

Analytics

Analytics Artificial Intelligence Storage Serverless

OpenPipeline: Simplify access to critical business data

Introduction to Azure Data Lake Storage Gen2

Trending Sources

Storage handling improvements increase retention of transaction data for Dynatrace Managed

Optimizing data warehouse storage

Storage Types Used on Cloud Computing Platforms

What is Greenplum Database? Intro to the Big Data Database

Dynatrace Managed turnkey Premium High Availability for globally distributed data centers (Early Adopter)

Microsoft Ignite 2024 guide: Cloud observability for AI transformation

Netflix’s Distributed Counter Abstraction

Dynatrace Cost & Carbon Optimization certified for accuracy and transparency

Using JSONB in PostgreSQL: How to Effectively Store & Index JSON Data in PostgreSQL

How a data lakehouse brings data insights to life

The history of Grail: Why you need a data lakehouse

Introducing Netflix’s Key-Value Data Abstraction Layer

Privacy spotlight: Control compliance in Dynatrace with multiple layers of sensitive data masking

Dynatrace OpenPipeline: Stream processing data ingestion converges observability, security, and business data at massive scale for analytics and automation in context

Getting answers from data starts with automated log acquisition, at any scale

Predictable costs for Log Management & Analytics with new simplified licensing plan

Dynatrace Managed now available on all major cloud platforms

Introducing Impressions at Netflix

Introducing Netflix TimeSeries Data Abstraction Layer

Data privacy by design: How an observability platform protects data security

Bulldozer: Batch Data Moving from Data Warehouse to Online Key-Value Stores

Measuring the importance of data quality to causal AI success

Stay in control of your data retention with Dynatrace Grail—from 10 days to 10 years

Privacy spotlight: Retain data in Grail with 1-day precision, for up to 10 years

Title Launch Observability at Netflix Scale

Enhance data management with Grail: Ultimate guide to custom buckets and security policies

RabbitMQ vs. Kafka: Key Differences

Analyze OpenTelemetry traces and log data at scale: Accelerate troubleshooting and optimize application performance

Data ingestion pipeline with Operation Management

What is predictive AI? How this data-driven technique gives foresight to IT teams

Choosing the Right Storage for PostgreSQL on Kubernetes: A Benchmark Analysis

Faster time to value with enhanced handling of OneAgent runtime data

Real-time business analytics with Dynatrace: Unleashing the treasure trove of insights from your observability data

Top PostgreSQL 17 New Features

Data pipeline asset management with Dataflow

Introducing Dynatrace built-in data observability on Davis AI and Grail

Open-Sourcing Metaflow, a Human-Centric Framework for Data Science

What is a Distributed Storage System

Dynatrace adds support for VPC Flow Logs to Kinesis Data Firehose

Mastering Disk Space Management with MongoDB® Storage Engines

Storage Strategies for PostgreSQL on Kubernetes

Any analysis, any time: Dynatrace Log Management and Analytics powered by Grail

Stay Connected