Data, Open Source and Storage - Technology Performance Pulse

Catching up with OpenTelemetry in 2025

Dynatrace

FEBRUARY 27, 2025

To understand whats happening in todays complex software ecosystems, you need comprehensive telemetry data to make it all observable. With so many types of technologies in software stacks around the globe, OpenTelemetry has emerged as the de facto standard for gathering telemetry data. But, generating telemetry data is the easy part.

Tuning

Tuning Open Source Innovation Monitoring

Open-Sourcing Metaflow, a Human-Centric Framework for Data Science

The Netflix TechBlog

DECEMBER 3, 2019

by David Berg , Ravi Kiran Chirravuri , Romain Cledat , Savin Goyal , Ferras Hamad , Ville Tuulos tl;dr Metaflow is now open-source! Netflix applies data science to hundreds of use cases across the company, including optimizing content delivery and video encoding. How could we improve the quality of life for data scientists?

Open Source

Open Source AWS Infrastructure Energy

What is Greenplum Database? Intro to the Big Data Database

Scalegrid

MAY 13, 2020

It can scale towards a multi-petabyte level data workload without a single issue, and it allows access to a cluster of powerful servers that will work together within a single SQL interface where you can view all of the data. This feature-packed database provides powerful and rapid analytics on data that scales up to petabyte volumes.

Big Data

Big Data Database Artificial Intelligence Open Source

What is? OpenTelemetry??An open-source standard for logs, metrics, and traces

Dynatrace

OCTOBER 15, 2021

These are just a few of the open-source technologies you may encounter as you research observability solutions for managing complex multicloud IT environments and the services that run on them. Of these open-source observability tools, one stands out. Source: OpenTelemetry Documentation. What is telemetry data?

Open Source

Open Source Metrics Cloud Transportation

Bring syslog into Dynatrace using OpenTelemetry to get open source value with enterprise support

Dynatrace

MARCH 15, 2024

For example, a supported syslog component must support the masking of sensitive data at capture to avoid transmitting personally identifiable information or other confidential data over the network. Log batching, enrichment, transformation, log source distinction, and application offloading are also regular requirements.

Open Source

Open Source Infrastructure Network Government

Cutting Big Data Costs: Effective Data Processing With Apache Spark

DZone

SEPTEMBER 14, 2023

In today's data-driven world, efficient data processing plays a pivotal role in the success of any project. Apache Spark , a robust open-source data processing framework, has emerged as a game-changer in this domain.

Big Data

Big Data Processing Open Source Games

Part 1: A Survey of Analytics Engineering Work at Netflix

The Netflix TechBlog

DECEMBER 17, 2024

Metric definitions are often scattered across various databases, documentation sites, and code repositories, making it difficult for analysts and data scientists to find reliable information quickly. DJ stands out as an open source solution that is actively developed and stress-tested at Netflix.

Analytics

Analytics Engineering Entertainment Metrics

Using JSONB in PostgreSQL: How to Effectively Store & Index JSON Data in PostgreSQL

Scalegrid

JULY 17, 2020

It is an open standard format which organizes data into key/value pairs and arrays detailed in RFC 7159. JSON is the most common format used by web services to exchange data, store documents, unstructured data, etc. You can also check out our Working with JSON Data in PostgreSQL vs. JSONB Patterns & Antipatterns.

Storage

Storage Database Efficiency Availability

The history of Grail: Why you need a data lakehouse

Dynatrace

OCTOBER 4, 2022

Some time ago, at a restaurant near Boston, three Dynatrace colleagues dined and discussed the growing data challenge for enterprises. At its core, this challenge involves a rapid increase in the amount—and complexity—of data collected within a company. Work with different and independent data types. Thus, Grail was born.

Artificial Intelligence

Artificial Intelligence Analytics Storage Architecture

Open-Sourcing Metaflow, a Human-Centric Framework for Data Science

The Netflix TechBlog

DECEMBER 3, 2019

by David Berg , Ravi Kiran Chirravuri , Romain Cledat , Savin Goyal , Ferras Hamad , Ville Tuulos tl;dr Metaflow is now open-source! Netflix applies data science to hundreds of use cases across the company, including optimizing content delivery and video encoding. How could we improve the quality of life for data scientists?

Open Source

Open Source AWS Infrastructure Energy

Escrow Buddy: An open-source tool from Netflix for remediation of missing FileVault keys in MDM

The Netflix TechBlog

JUNE 12, 2023

Netflix has open-sourced Escrow Buddy, which helps Security and IT teams ensure they have valid FileVault recovery keys for all their Macs in MDM. The agent also enables rotation of recovery keys after use, local storage and validation of recovery keys, and other features.

Open Source

Open Source Database Servers Storage

Dynatrace OpenPipeline: Stream processing data ingestion converges observability, security, and business data at massive scale for analytics and automation in context

Dynatrace

JANUARY 31, 2024

Organizations choose data-driven approaches to maximize the value of their data, achieve better business outcomes, and realize cost savings by improving their products, services, and processes. However, there are many obstacles and limitations along the way to becoming a data-driven organization. Understanding the context.

Analytics

Analytics Processing Transportation Storage

Distributed tracing with Dynatrace just got even better

Dynatrace

MARCH 11, 2025

With our latest enhancements, were transforming the way you work with trace data. For deeper exploration, our Distributed Tracing app empowers you to analyze raw trace data and uncover insights, whether troubleshooting errors, optimizing performance, or discovering the unknown unknowns. But why stop there?

Games

Games Analytics Innovation Metrics

RabbitMQ vs. Kafka: Key Differences

Scalegrid

FEBRUARY 6, 2025

RabbitMQ is designed for flexible routing and message reliability, while Kafka handles high-throughput event streaming and real-time data processing. Both serve distinct purposes, from managing message queues to ingesting large data volumes. What is RabbitMQ? What is Apache Kafka?

Latency

Latency Analytics Architecture Storage

How unified data and analytics offers a new approach to software intelligence

Dynatrace

OCTOBER 4, 2022

Software and data are a company’s competitive advantage. But for software to work perfectly, organizations need to use data to optimize every phase of the software lifecycle. The only way to address these challenges is through observability data — logs, metrics, and traces. Teams interact with myriad data types.

Analytics

Analytics Software Software Storage

Title Launch Observability at Netflix Scale

The Netflix TechBlog

MARCH 4, 2025

Store the data in an optimized, highly distributed datastore. Additionally, some collectors will instead poll our kafka queue for impressions data. This data is processed from a real-time impressions stream into a Kafka queue, which our title health system regularly polls. Track real-time title impressions from the NetflixUI.

Traffic

Traffic Entertainment Strategy Innovation

Kubernetes in the wild report 2023

Dynatrace

JANUARY 16, 2023

The study analyzes factual Kubernetes production data from thousands of organizations worldwide that are using the Dynatrace Software Intelligence Platform to keep their Kubernetes clusters secure, healthy, and high performing. Open-source software drives a vibrant Kubernetes ecosystem. Java, Go, and Node.js

Open Source

Open Source Java Operating System Programming

Configuring OpenTelemetry Agents to Enrich Data and Reduce Observability Costs

DZone

JANUARY 16, 2023

BindPlane OP is a powerful open-source tool that makes it easy to build and manage telemetry pipelines to ship data from IT environments of any kind and size to any analysis tool or storage destination. The vendor-agnostic toolset is excellent for reducing data costs and getting the most out of your data.

Open Source

Open Source Storage Processing

The Ultimate Guide to Open Source Databases

Percona

MARCH 30, 2023

The use of open source databases has increased steadily in recent years. Past trepidation — about perceived vulnerabilities and performance issues — has faded as decision makers realize what an “open source database” really is and what it offers. What is an open source database?

Open Source

Open Source Database Storage Scalability

Data Engineers of Netflix?—?Interview with Pallavi Phadnis

The Netflix TechBlog

OCTOBER 28, 2021

Data Engineers of Netflix?—?Interview Interview with Pallavi Phadnis This post is part of our “ Data Engineers of Netflix ” series, where our very own data engineers talk about their journeys to Data Engineering @ Netflix. Pallavi Phadnis is a Senior Software Engineer on the Product Data Science and Engineering team.

Data Engineering

Data Engineering Engineering Big Data Software Engineering

A Recap of the Data Engineering Open Forum at Netflix

The Netflix TechBlog

JUNE 20, 2024

A summary of sessions at the first Data Engineering Open Forum at Netflix on April 18th, 2024 The Data Engineering Open Forum at Netflix on April 18th, 2024. At Netflix, we aspire to entertain the world, and our data engineering teams play a crucial role in this mission by enabling data-driven decision-making at scale.

Data Engineering

Data Engineering Engineering Entertainment Software Engineering

From Proprietary to Open Source: The Complete Guide to Database Migration

Percona

OCTOBER 18, 2023

Migrating a proprietary database to open source is a major decision that can significantly affect your organization. Advantages of migrating to open source For many reasons mentioned earlier, organizations are increasingly shifting towards open source databases for their data management needs.

Open Source

Open Source Database Hardware Strategy

Which Is the Best PostgreSQL GUI? 2019 Comparison

Scalegrid

SEPTEMBER 16, 2019

PostgreSQL graphical user interface (GUI) tools help these open source database users to manage, manipulate, and visualize their data. Offers great visualization to help you interpret your data. The window-based interface makes it much easier to manage your PostgreSQL data. pgAdmin Cost: Free (open source).

Open Source

Open Source Database Azure Cloud

Data lakehouse innovations advance the three pillars of observability for more collaborative analytics

Dynatrace

FEBRUARY 16, 2023

How do you get more value from petabytes of exponentially exploding, increasingly heterogeneous data? The short answer: The three pillars of observability—logs, metrics, and traces—converging on a data lakehouse. To solve this problem, Dynatrace launched Grail, its causational data lakehouse , in 2022.

Analytics

Analytics Innovation Metrics Database

Weighing the top seven Kubernetes challenges and how to solve them

Dynatrace

JUNE 6, 2023

Kubernetes has become the leading container orchestration platform for organizations adopting open source solutions to manage, scale, and automate application deployment. Kubernetes is an open source container orchestration platform for managing, automating, and scaling containerized applications. What is Kubernetes?

Open Source

Open Source Storage Analytics Innovation

Apache Kafka + Apache Flink = Match Made in Heaven

DZone

MAY 5, 2023

This blog post explores the benefits of combining both open-source frameworks, shows unique differentiators of Flink versus Kafka, and discusses when to use a Kafka-native streaming engine like Kafka Streams instead of Flink.

Open Source

Open Source Storage Innovation Engineering

Connect Fluentd logs with Dynatrace traces, metrics, and topology data to enhance Kubernetes observability

Dynatrace

APRIL 8, 2022

Fluentd is an open-source data collector that unifies log collection, processing, and consumption. Output plugins deliver logs to storage solutions, analytics tools, and observability platforms like Dynatrace. All metrics, traces, and real user data are also surfaced in the context of specific events.

Metrics

Metrics Analytics Software Architecture Open Source

PostgreSQL vs. Oracle: Difference in Costs, Ease of Use & Functionality

Scalegrid

JULY 13, 2020

The unstoppable rise of open source databases. One database in particular is causing a huge dent in Oracle’s market share – open source PostgreSQL. See how open source PostgreSQL Community version costs compare to Oracle Standard Edition and Oracle Enterprise Edition. What’s causing this massive shift?

Open Source

Open Source Tuning C++ Database

Building Netflix’s Distributed Tracing Infrastructure

The Netflix TechBlog

OCTOBER 19, 2020

Troubleshooting a session in Edgar When we started building Edgar four years ago, there were very few open-source distributed tracing systems that satisfied our needs. Our tactical approach was to use Netflix-specific libraries for collecting traces from Java-based streaming services until open source tracer libraries matured.

Infrastructure

Infrastructure Transportation Storage Open Source

Remote Workstations for the Discerning Artists

The Netflix TechBlog

MARCH 8, 2021

Historically artists had these machines built for them at their desks and only had access to the data and applications when they were in the office. Machine Configuration: Spinnaker Starting at the left of the chart, Spinnaker is an open-source platform that controls the creation of workstation pools.

Entertainment

Entertainment Storage Open Source Hardware

What is security analytics?

Dynatrace

JUNE 10, 2024

Security analytics combines data collection, aggregation, and analysis to search for and identify potential threats. Using a combination of historical data and information collected in real time, security teams can detect threats earlier in the SDLC. Here’s how. What is security analytics? Why is security analytics important?

Analytics

Analytics Network Open Source Hardware

MySQL General Tablespaces: A Powerful Storage Option for Your Data

Percona

JANUARY 4, 2024

Managing storage and performance efficiently in your MySQL database is crucial, and general tablespaces offer flexibility in achieving this. In contrast to the single system tablespace that holds system tables by default, general tablespaces are user-defined storage containers for multiple InnoDB tables.

Storage

Storage Engineering Database Open Source

10 open-source Kubernetes tools for highly effective SRE and Ops Teams

Abhishek Tiwari

JANUARY 6, 2018

Here we present a list of 10 open-source Kubernetes tools to make your SRE and Ops teams more effective to achieve their service level objectives. The backup files are stored in an object storage service (e.g. Ark server performs the actual backup, validates it and loads backup files in cloud object storage. Telepresence.

Open Source

Open Source DevOps Engineering Storage

Dynatrace extends contextual analytics and AIOps for open observability

Dynatrace

JULY 29, 2021

The complexity of such deployments has accelerated with the adoption of emerging, open-source technologies that generate telemetry data, which is exploding in terms of volume, speed, and cardinality. Entity tagging requires an enormous amount of manual effort and is always open to interpretation.

Analytics

Analytics Open Source Serverless Architecture

OpenShift vs. Kubernetes: Understanding the differences

Dynatrace

JUNE 7, 2023

Kubernetes is an open source container orchestration platform that enables organizations to automatically scale, manage, and deploy containerized applications in distributed environments. Like Kubernetes, OpenShift is an open source Kubernetes-based container platform. What is Kubernetes? What is OpenShift?

Open Source

Open Source Social Media Infrastructure Operating System

Monitoring Self-Destructing Apps Using Prometheus

DZone

JANUARY 8, 2020

Prometheus is an open-source system monitoring and alerting toolkit. Data related to monitoring is stored in RAM and LevelDB nevertheless data can be stored to other storage systems such as ElasticSearch, InfluxDb, and others, [link]. Watch out for your self-destructing apps!

Monitoring

Monitoring Open Source Storage Systems

Monitor your technology stack with Dynatrace and Amazon Managed Service for Prometheus

Dynatrace

SEPTEMBER 29, 2021

Prometheus is an open-source monitoring and alerting toolkit for services and applications that run in containers. It offers a flexible multidimensional data model that’s based on key-value pairs and a potent query language (PromQL). Dynatrace news. What is Prometheus and how does it work?

Technology

Technology Technology Monitoring AWS

Reducing Your Database Hosting Costs: DigitalOcean vs. AWS vs. Azure

Scalegrid

APRIL 28, 2020

Since database hosting is more dependent on memory (RAM) than storage, we are going to compare various instance sizes ranging from just 1GB of RAM up to 64GB of RAM so you can see how costs vary across different application workloads. ScaleGrid provides an Import wizard to migrate data from one cluster to another. EC2 instances.

Azure

Azure AWS Database Latency

Three smart log ingestion strategies in Dynatrace (without OneAgent)

Dynatrace

DECEMBER 15, 2022

One option is to install OneAgent on that syslog server, which automatically discovers, instruments and sends the log data to the Dynatrace platform. Yet observability into syslog data on Dynatrace would help you monitor and troubleshoot infrastructure. In Cribl’s configuration, open “Data/Destinations” and find “Webhook.”

Strategy

Strategy AWS Open Source Transportation

Exploring Data @ Netflix

The Netflix TechBlog

JUNE 25, 2021

By Gim Mahasintunan on behalf of Data Platform Engineering. Supporting a rapidly growing base of engineers of varied backgrounds using different data stores can be challenging in any organization. In this blog post, we are thrilled to share that we are open-sourcing one such tool: the Netflix Data Explorer.

Metrics

Metrics Best Practices Design Strategy

What is container orchestration?

Dynatrace

MARCH 24, 2023

Problems include provisioning and deployment; load balancing; securing interactions between containers; configuration and allocation of resources such as networking and storage; and deprovisioning containers that are no longer needed. Originally created by Google, Kubernetes was donated to the CNCF as an open source project.

Infrastructure

Infrastructure Open Source Operating System Cloud

OTel contributor Q&A: Dynatrace works to ensure enterprise readiness for OpenTelemetry

Dynatrace

NOVEMBER 17, 2020

With these release candidate APIs available, instrumentation for web frameworks, storage clients, and much more can be built. We at Dynatrace understand the importance of contributing our expertise in enterprise-grade intelligent observability to the open source community. Dynatrace fully embraces OpenTelemetry.

Open Source

Open Source Software Engineering Government Java

The Anna Key-Value Store Now Has 355x the Performance of DynamoDB for the Dollar

High Scalability

SEPTEMBER 8, 2018

They've posted about Anna's new superpowers in Going Fast and Cheap: How We Made Anna Autoscale : Using Anna v0 as an in-memory storage engine, we set out to address the cloud storage problems described above. Each storage server collects statistics about the requests it serves, the data it stores, etc.

Storage

Storage Performance AWS Media

Accelerate your cloud journey with Dynatrace observability for AWS S3 logs

Dynatrace

JUNE 27, 2023

Many AWS services and third party solutions use AWS S3 for log storage. Centralized log management for scalable ingestion into Grail As AWS S3 proves to be the preferred way of storing cloud logs, enterprise customers face mounting challenges in putting S3 log data to use.

AWS

AWS Cloud Lambda Analytics

Catching up with OpenTelemetry in 2025

Open-Sourcing Metaflow, a Human-Centric Framework for Data Science

Trending Sources

What is Greenplum Database? Intro to the Big Data Database

What is? OpenTelemetry??An open-source standard for logs, metrics, and traces

Bring syslog into Dynatrace using OpenTelemetry to get open source value with enterprise support

Cutting Big Data Costs: Effective Data Processing With Apache Spark

Part 1: A Survey of Analytics Engineering Work at Netflix

Using JSONB in PostgreSQL: How to Effectively Store & Index JSON Data in PostgreSQL

The history of Grail: Why you need a data lakehouse

Open-Sourcing Metaflow, a Human-Centric Framework for Data Science

Escrow Buddy: An open-source tool from Netflix for remediation of missing FileVault keys in MDM

Dynatrace OpenPipeline: Stream processing data ingestion converges observability, security, and business data at massive scale for analytics and automation in context

Distributed tracing with Dynatrace just got even better

RabbitMQ vs. Kafka: Key Differences

How unified data and analytics offers a new approach to software intelligence

Title Launch Observability at Netflix Scale

Kubernetes in the wild report 2023

Configuring OpenTelemetry Agents to Enrich Data and Reduce Observability Costs

The Ultimate Guide to Open Source Databases

Data Engineers of Netflix?—?Interview with Pallavi Phadnis

A Recap of the Data Engineering Open Forum at Netflix

From Proprietary to Open Source: The Complete Guide to Database Migration

Which Is the Best PostgreSQL GUI? 2019 Comparison

Data lakehouse innovations advance the three pillars of observability for more collaborative analytics

Weighing the top seven Kubernetes challenges and how to solve them

Apache Kafka + Apache Flink = Match Made in Heaven

Connect Fluentd logs with Dynatrace traces, metrics, and topology data to enhance Kubernetes observability

PostgreSQL vs. Oracle: Difference in Costs, Ease of Use & Functionality

Building Netflix’s Distributed Tracing Infrastructure

Remote Workstations for the Discerning Artists

What is security analytics?

MySQL General Tablespaces: A Powerful Storage Option for Your Data

10 open-source Kubernetes tools for highly effective SRE and Ops Teams

Dynatrace extends contextual analytics and AIOps for open observability

OpenShift vs. Kubernetes: Understanding the differences

Monitoring Self-Destructing Apps Using Prometheus

Monitor your technology stack with Dynatrace and Amazon Managed Service for Prometheus

Reducing Your Database Hosting Costs: DigitalOcean vs. AWS vs. Azure

Three smart log ingestion strategies in Dynatrace (without OneAgent)

Exploring Data @ Netflix

What is container orchestration?

OTel contributor Q&A: Dynatrace works to ensure enterprise readiness for OpenTelemetry

The Anna Key-Value Store Now Has 355x the Performance of DynamoDB for the Dollar

Accelerate your cloud journey with Dynatrace observability for AWS S3 logs

Stay Connected