Metrics, Processing and Tuning - Technology Performance Pulse

Catching up with OpenTelemetry in 2025

Dynatrace

FEBRUARY 27, 2025

OpenTelemetry is enhancing GenAI observability : By defining semantic conventions for GenAI and implementing Python-based instrumentation for OpenAI, OpenTel is moving towards addressing GenAI monitoring and performance tuning needs. Semantic Conventions, or semconv, are the standard that makes it all possible.

Tuning

Tuning Open Source Innovation Monitoring

Demo: Transform OpenTelemetry data into actionable insights with the Dynatrace Distributed Tracing app

Dynatrace

OCTOBER 29, 2024

A Dynatrace API token with the following permissions: Ingest OpenTelemetry traces ( openTelemetryTrace.ingest ) Ingest metrics ( metrics.ingest ) Ingest logs ( logs.ingest ) To set up the token, see Dynatrace API – Tokens and authentication in Dynatrace documentation. So, stay tuned for more enhancements and features.

Metrics

Metrics Tuning Monitoring Availability

Support of OpenTelemetry metrics with enhanced AI capabilities

Dynatrace

MAY 18, 2022

With the most important components becoming release candidates , Dynatrace now supports the full OpenTelemetry specification on all runtimes and automatically adds intelligence to metrics at enterprise scale. So these metrics are immensely valuable to SRE and DevOps teams. Automation and intelligence for metrics at enterprise scale.

Metrics

Metrics DevOps Tuning Virtualization

Introducing Impressions at Netflix

The Netflix TechBlog

FEBRUARY 14, 2025

It requires a state-of-the-art system that can track and process these impressions while maintaining a detailed history of each profiles exposure. In this multi-part blog series, we take you behind the scenes of our system that processes billions of impressions daily.

Tuning

Tuning Latency Efficiency Storage

9 key DevOps metrics for success

Dynatrace

SEPTEMBER 28, 2021

The emerging concepts of working with DevOps metrics and DevOps KPIs have really come a long way. DevOps metrics to help you meet your DevOps goals. Your next challenge is ensuring your DevOps processes, pipelines, and tooling meet the intended goal. Lead time for changes helps teams understand how effective their processes are.

DevOps

DevOps Metrics Traffic Efficiency

Observability engineering: Getting Prometheus metrics right for Kubernetes with Dynatrace and Kepler

Dynatrace

DECEMBER 18, 2023

This challenge has given rise to the discipline of observability engineering, which concentrates on the details of telemetry data to fine-tune observability use cases. To get a more granular look into telemetry data, many analysts rely on custom metrics using Prometheus.

Metrics

Metrics Engineering Energy Tuning

Data Mesh?—?A Data Movement and Processing Platform @ Netflix

The Netflix TechBlog

AUGUST 1, 2022

A Data Movement and Processing Platform @ Netflix By Bo Lei , Guilherme Pires , James Shao , Kasturi Chatterjee , Sujay Jain , Vlad Sydorenko Background Realtime processing technologies (A.K.A stream processing) is one of the key factors that enable Netflix to maintain its leading position in the competition of entertaining our users.

Processing

Processing Transportation Entertainment Tuning

Dynatrace innovates again with the release of topology-driven auto-adaptive metric baselines

Dynatrace

JULY 23, 2020

With the advent and ingestion of thousands of custom metrics into Dynatrace, we’ve once again pushed the boundaries of automatic, AI-based root cause analysis with the introduction of auto-adaptive baselines as a foundational concept for Dynatrace topology-driven timeseries measurements. In many cases, metric behavior changes over time.

Metrics

Metrics Innovation Strategy Monitoring

Dynatrace partners with AWS to provide enterprise-grade, intelligent observability for custom OpenTelemetry metrics

Dynatrace

DECEMBER 15, 2020

OpenTelemetry metrics are useful for augmenting the fully automatic observability that can be achieved with Dynatrace OneAgent. OpenTelemetry metrics add domain specific data such as business KPIs and license relevant consumption details. Enterprise-grade observability for custom OpenTelemetry metrics from AWS. Dynatrace news.

AWS

AWS Metrics Analytics Government

Title Launch Observability at Netflix Scale

The Netflix TechBlog

DECEMBER 17, 2024

The Challenge of Title Launch Observability As engineers, were wired to track system metrics like error rates, latencies, and CPU utilizationbut what about metrics that matter to a titlessuccess? Option 1: Log Processing Log processing offers a straightforward solution for monitoring and analyzing title launches.

Traffic

Traffic Scalability Strategy Monitoring

Announcing enterprise-grade observability at scale for your OpenTelemetry custom metrics

Dynatrace

NOVEMBER 23, 2020

As the application owner of an e-commerce application, for example, you can enrich the source code of your application with domain-specific knowledge by adding actionable semantics to collected performance or business metrics. New OpenTelemetry metrics exporters provide the broadest language support on the market.

Metrics

Metrics Open Source Tuning Analytics

Running the Astronomy Shop OpenTelemetry demo application with Dynatrace

Dynatrace

MARCH 13, 2025

The configuration also includes an optional span metrics connector, which generates Request, Error, and Duration (R.E.D.) metrics from span data. The configuration also includes an optional span metrics connector, which generates Request, Error, and Duration (R.E.D.) metrics from span data.

Open Source

Open Source Metrics Architecture Infrastructure

Multidimensional analysis 2.0: Analyze microservice-based metrics without code changes (Part 2)

Dynatrace

FEBRUARY 5, 2020

Select a Metric and Aggregation to get started. You can choose any standard Dynatrace metric and any request attribute. Simply switch the metric to Failure rate to find out if there was an error that might have impacted your platinum customers. You might guess that the relatively long booking time is caused by a failure.

Metrics

Metrics Code Tuning Monitoring

RabbitMQ vs. Kafka: Key Differences

Scalegrid

FEBRUARY 6, 2025

RabbitMQ is designed for flexible routing and message reliability, while Kafka handles high-throughput event streaming and real-time data processing. RabbitMQ follows a message broker model with advanced routing, while Kafkas event streaming architecture uses partitioned logs for distributed processing. What is Apache Kafka?

Latency

Latency Analytics Architecture Storage

Extend infrastructure observability with JMX Extensions and additional full-stack metrics

Dynatrace

APRIL 20, 2020

To provide you with more value when monitoring hosts in infrastructure mode, we’re extending our infrastructure mode with a range of metrics that have until now only been available in full-stack mode. Monitor additional metrics. All of these metrics are now part of infrastructure mode. How to get access. initiative.

Infrastructure

Infrastructure Metrics Java Virtualization

Why applying chaos engineering to data-intensive applications matters

Dynatrace

MAY 23, 2024

Stream processing One approach to such a challenging scenario is stream processing, a computing paradigm and software architectural style for data-intensive software systems that emerged to cope with requirements for near real-time processing of massive amounts of data. This significantly increases event latency.

Engineering

Engineering Tuning Latency Open Source

An approach to index tuning – Part 2

SQL Performance

APRIL 13, 2020

In my last post , I started to outline the process I go through when tuning queries – specifically when I discover that I need to add a new index, or modify an existing one. Once we have that data, we can move on to the next steps in the process. Once we have that data, we can move on to the next steps in the process.

Tuning

Tuning Cache Metrics Testing

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

The Netflix TechBlog

MAY 4, 2023

Migrating Critical Traffic At Scale with No Downtime — Part 1 Shyam Gala , Javier Fernandez-Ivern , Anup Rokkam Pratap , Devang Shah Hundreds of millions of customers tune into Netflix every day, expecting an uninterrupted and immersive streaming experience. This technique facilitates validation on multiple fronts.

Traffic

Traffic Latency Tuning Systems

Incremental Processing using Netflix Maestro and Apache Iceberg

The Netflix TechBlog

NOVEMBER 20, 2023

by Jun He , Yingyi Zhang , and Pawan Dixit Incremental processing is an approach to process new or changed data in workflows. The key advantage is that it only incrementally processes data that are newly added or updated to a dataset, instead of re-processing the complete dataset.

Processing

Processing Big Data Efficiency Engineering

Observability throughout the software development lifecycle increases delivery performance

Dynatrace

OCTOBER 4, 2024

Today, development teams suffer from a lack of automation for time-consuming tasks, the absence of standardization due to an overabundance of tool options, and insufficiently mature DevSecOps processes. This process begins when the developer merges a code change and ends when it is running in a production environment.

Software

Software Software Development Performance

Best Practices for Scaling RabbitMQ

Scalegrid

FEBRUARY 24, 2025

Proper setup involves creating a configuration process that accounts for hostname changes, which could prevent nodes from rejoining the cluster. Message load balancing guarantees that messages are processed evenly across different queues and nodes within the RabbitMQ system. Erlang is the backbone of RabbitMQ clustering.

Best Practices

Best Practices Traffic Strategy Efficiency

Dynatrace simplifies StatsD, Telegraf, and Prometheus observability with Davis AI

Dynatrace

OCTOBER 7, 2020

Open-source metric sources automatically map to our Smartscape model for AI analytics. We’ve just enhanced Dynatrace OneAgent with an open metric API. Here’s a quick overview of what you can achieve now that the Dynatrace Software Intelligence Platform has been extended to ingest third-party metrics. Dynatrace news.

Open Source

Open Source Metrics Analytics Tuning

Toward a Better Quality Metric for the Video Community

The Netflix TechBlog

DECEMBER 7, 2020

VMAF is a video quality metric that Netflix jointly developed with a number of university collaborators and open-sourced on Github. One aspect that differentiates VMAF from other traditional metrics such as PSNR or SSIM, is that VMAF is able to predict more consistently across spatial resolutions, across shots, and across genres (for example.

Metrics

Metrics Open Source Tuning Speed

User experience score—the one metric to rule them all

Dynatrace

AUGUST 6, 2019

Defining a comprehensive user-experience metric gives rise to questions such as: How do we compare the user experience of one session to another? Which metric can be used for the purpose of reporting user experience and tracking it over a period of time? A single metric for user experience segmentation. Error metrics.

Metrics

Metrics Tuning Monitoring Analytics

Auto-adaptive thresholds for AI-driven quality gating

Dynatrace

JUNE 4, 2024

This process, known as auto-adaptive thresholding, eliminates the need to define a static threshold upfront. The training times and other quality metrics, such as the RMSE (Root Mean Squared Error), SMAPE (Scaled Mean Absolute Percentage Error), and coverage probability, are monitored using Dynatrace.

Metrics

Metrics Engineering Code Tuning

How Netflix uses eBPF flow logs at scale for network insight

The Netflix TechBlog

JUNE 7, 2021

Berkeley Packet Filter (BPF) is an in-kernel execution engine that processes a virtual instruction set, and has been extended as eBPF for providing a safe way to extend kernel functionality. The Flow Exporter also publishes various operational metrics to Atlas. What is BPF? So how do we ingest and enrich these flows at scale ?

Network

Network Transportation AWS Cloud

OpenTelemetry 101: A nontechnical guide for IT leaders and enthusiasts

Dynatrace

JULY 22, 2024

Using OpenTelemetry, developers can collect and process telemetry data from applications, services, and systems. Observability Observability is the ability to determine a system’s health by analyzing the data it generates, such as logs, metrics, and traces. There are three main types of telemetry data: Metrics.

Latency

Latency Best Practices Metrics Open Source

New Prometheus-based extensions enable intelligent observability for more than 200 additional technologies

Dynatrace

FEBRUARY 14, 2022

Building on its advanced analytics capabilities for Prometheus data , Dynatrace now enables you to create extensions based on Prometheus metrics. Many technologies expose their metrics in the Prometheus data format. Easily gain actionable insights with the Dynatrace Extension for Prometheus metrics. Prometheus in Kubernetes ?and

Technology

Technology Technology Metrics Infrastructure

Rapid Event Notification System at Netflix

The Netflix TechBlog

FEBRUARY 18, 2022

Event Prioritization Considering the use cases were wide ranging both in terms of their sources and their importance, we built segmentation into the event processing. We thus assigned a priority to each use case and sharded event traffic by routing to priority-specific queues and the corresponding event processing clusters.

Systems

Systems Traffic Architecture Mobile

Custom metrics for services enrich Dynatrace AI and dashboarding capabilities (EAP)

Dynatrace

MAY 16, 2019

Are you applying AI to the unique metrics and KPIs that matter most to the success of your digital business? Do you provide dashboards and analytics that combine technical and business metrics that are specific to your business? Dynatrace out-of-the-box metrics generally focus on availability, failure rate, and performance.

Metrics

Metrics Analytics Programming Database

Flexible, scalable, self-service Kubernetes native observability now in General Availability

Dynatrace

MAY 17, 2022

From a cost perspective, internal customers waste valuable time sending tickets to operations teams asking for metrics, logs, and traces to be enabled. A team looking for metrics, traces, and logs no longer needs to file a ticket to get their app monitored in their own environments. This approach is costly and error prone.

Availability

Availability Scalability Cloud Metrics

Migrating Critical Traffic At Scale with No Downtime?—?Part 2

The Netflix TechBlog

JUNE 13, 2023

Replay traffic testing gives us the initial foundation of validation, but as our migration process unfolds, we are met with the need for a carefully controlled migration process. A process that doesn’t just minimize risk, but also facilitates a continuous evaluation of the rollout’s impact.

Traffic

Traffic Metrics Systems Strategy

Dynatrace SaaS release notes version 1.241

Dynatrace

MAY 12, 2022

To stay tuned, keep an eye on our release notes. Remediation tracking now enables you to view the risk assessment for the process groups affected by a vulnerability. Reintroduced a limit of 100,000 process group instances (last 72h) running on hosts presented on the “Deployment status” page for OneAgents. (APM-370529).

Tuning

Tuning Metrics Cloud Mobile

Data lakehouse innovations advance the three pillars of observability for more collaborative analytics

Dynatrace

FEBRUARY 16, 2023

The short answer: The three pillars of observability—logs, metrics, and traces—converging on a data lakehouse. The goal is to turn more data into insights so the whole organization can make data-driven decisions and automate processes. Grail can store and process 1,000 petabytes per day,” Greifeneder explains.

Analytics

Analytics Innovation Metrics Database

Site-Speed Topography

CSS Wizardry

NOVEMBER 3, 2020

Any time you run a test with WebPageTest, you’ll get this table of different milestones and metrics. Higher variance means a less stable metric across pages. I can see from the screenshot above that TTFB is my most stable metrics—no one page appears to have particularly expensive database queries or API calls on the back-end.

Speed

Speed Ecommerce Metrics Analytics

Full support for Google’s Core Web Vitals improves your user experience and search rankings

Dynatrace

FEBRUARY 19, 2021

To provide “quality signals that are essential to delivering a great user experience on the web,” Google introduced their Core Web Vitals initiative last year, advocating the Largest contentful paint , Cumulative layout shift , and First input delay metrics. with: Aggregated field metrics?rather?than?valuable?details

Google

Google Metrics Monitoring Network

Machine Learning for Fraud Detection in Streaming Services

The Netflix TechBlog

NOVEMBER 11, 2022

Synthetic Minority Over-sampling Technique Evaluation Metrics For evaluating the performance of the anomaly detection models we consider a set of evaluation metrics and report their values. For the one-class as well as binary anomaly detection task, such metrics are accuracy, precision, recall, f0.5,

C++

C++ Metrics Tuning Strategy

Tech Transforms podcast: Navigating complex cloud environments and improving efficiency

Dynatrace

APRIL 3, 2023

UK Home Office: Metrics meets service The UK Home Office is the lead government department for many essential, large-scale programs. In this episode, Dimitris discusses the many different tools and processes they use. Make sure to stay connected with our social media pages. Tag us with #TechTransforms to be featured on our pages!

Efficiency

Efficiency Social Media Artificial Intelligence Cloud

Telltale: Netflix Application Monitoring Simplified

The Netflix TechBlog

AUGUST 13, 2020

A metric crossed a threshold. You’re half awake and wondering, “Is there really a problem or is this just an alert that needs tuning? Telltale learns what constitutes typical health for an application, no alert tuning required. Metrics are a key part of understanding application health. Client metrics and QoE changes.

Monitoring

Monitoring Tuning Traffic Metrics

Unlock log analytics: Seamless insights without writing queries

Dynatrace

MAY 28, 2024

What about correlated trace data, host metrics, real-time vulnerability scanning results, or log messages captured just before an incident occurs? Stay tuned for even wider support of log data embedded seamlessly into the context of Dynatrace Apps, and better ways to get answers from logs without writing queries.

Analytics

Analytics Infrastructure Database Cloud

Crucial Redis Monitoring Metrics You Must Watch

Scalegrid

JANUARY 25, 2024

You will need to know which monitoring metrics for Redis to watch and a tool to monitor these critical server metrics to ensure its health. Redis returns a big list of database metrics when you run the info command on the Redis shell. You can pick a smart selection of relevant metrics from these.

Metrics

Metrics Monitoring Latency Cache

OneAgent for Linux on IBM Z (General Availability)

Dynatrace

NOVEMBER 20, 2019

Host performance is tracked via high-level health metrics on the home dashboard to details for each of the hosts. For details on available metrics, see our help page on host performance monitoring. Network measurements with per-interface and per-process resolution. Network metrics are also collected for detected processes.

Availability

Availability Hardware Java Tuning

Software intelligence as code enables tailored observability, AIOps, and application security at scale

Dynatrace

FEBRUARY 9, 2022

Modern infrastructure needs to be elastic and GitOps approaches are used to automate the provisioning of infrastructure and applications using Git, an open-source control system that provides the change processes including reviews and approvals. Key components of GitOps are declarative infrastructure as code, orchestration, and observability.

Code

Code Software Software DevOps

Using Dynatrace to master the 5 pillars of the AWS Well-Architected Framework (Part 1)

Dynatrace

NOVEMBER 24, 2020

Tracking changes to automated processes, including auditing impacts to the system, and reverting to the previous environment states seamlessly. The ultimate goal of each of these reviews is to identify gaps, quantify risk, and develop recommendations for improving the team, processes, and architecture with each of the five pillars.

AWS

AWS Artificial Intelligence Best Practices Lambda

Catching up with OpenTelemetry in 2025

Demo: Transform OpenTelemetry data into actionable insights with the Dynatrace Distributed Tracing app

Trending Sources

Support of OpenTelemetry metrics with enhanced AI capabilities

Introducing Impressions at Netflix

9 key DevOps metrics for success

Observability engineering: Getting Prometheus metrics right for Kubernetes with Dynatrace and Kepler

Data Mesh?—?A Data Movement and Processing Platform @ Netflix

Dynatrace innovates again with the release of topology-driven auto-adaptive metric baselines

Dynatrace partners with AWS to provide enterprise-grade, intelligent observability for custom OpenTelemetry metrics

Title Launch Observability at Netflix Scale

Announcing enterprise-grade observability at scale for your OpenTelemetry custom metrics

Running the Astronomy Shop OpenTelemetry demo application with Dynatrace

Multidimensional analysis 2.0: Analyze microservice-based metrics without code changes (Part 2)

RabbitMQ vs. Kafka: Key Differences

Extend infrastructure observability with JMX Extensions and additional full-stack metrics

Why applying chaos engineering to data-intensive applications matters

An approach to index tuning – Part 2

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

Incremental Processing using Netflix Maestro and Apache Iceberg

Observability throughout the software development lifecycle increases delivery performance

Best Practices for Scaling RabbitMQ

Dynatrace simplifies StatsD, Telegraf, and Prometheus observability with Davis AI

Toward a Better Quality Metric for the Video Community

User experience score—the one metric to rule them all

Auto-adaptive thresholds for AI-driven quality gating

How Netflix uses eBPF flow logs at scale for network insight

OpenTelemetry 101: A nontechnical guide for IT leaders and enthusiasts

New Prometheus-based extensions enable intelligent observability for more than 200 additional technologies

Rapid Event Notification System at Netflix

Custom metrics for services enrich Dynatrace AI and dashboarding capabilities (EAP)

Flexible, scalable, self-service Kubernetes native observability now in General Availability

Migrating Critical Traffic At Scale with No Downtime?—?Part 2

Dynatrace SaaS release notes version 1.241

Data lakehouse innovations advance the three pillars of observability for more collaborative analytics

Site-Speed Topography

Full support for Google’s Core Web Vitals improves your user experience and search rankings

Machine Learning for Fraud Detection in Streaming Services

Tech Transforms podcast: Navigating complex cloud environments and improving efficiency

Telltale: Netflix Application Monitoring Simplified

Unlock log analytics: Seamless insights without writing queries

Crucial Redis Monitoring Metrics You Must Watch

OneAgent for Linux on IBM Z (General Availability)

Software intelligence as code enables tailored observability, AIOps, and application security at scale

Using Dynatrace to master the 5 pillars of the AWS Well-Architected Framework (Part 1)

Stay Connected