Infrastructure, Metrics and Tuning - Technology Performance Pulse

Power Dashboarding, Part I: Start your exploration journey with Dashboards

Dynatrace

FEBRUARY 6, 2025

With Dashboards , you can monitor business performance, user interactions, security vulnerabilities, IT infrastructure health, and so much more, all in real time. Even if infrastructure metrics aren’t your thing, you’re welcome to join us on this creative journey simply swap out the suggested metrics for ones that interest you.

Metrics

Metrics Infrastructure Monitoring Best Practices

Extend infrastructure observability with JMX Extensions and additional full-stack metrics

Dynatrace

APRIL 20, 2020

Infrastructure exists to support the backing services that are collectively perceived by users to be your web application. Issues that manifest themselves as performance degradation on a user’s device can often be traced back to underlying infrastructure issues. Monitor additional metrics. Dynatrace news.

Infrastructure

Infrastructure Metrics Java Virtualization

Monitoring of Kubernetes Infrastructure for day 2 operations

Dynatrace

JULY 8, 2020

One of the promises of container orchestration platforms is to make i t easier for the developers to accelerate the deployment of their app lication s without having to worry about scalability and infrastructure dependencies. It is important to understand the impact infrastructure can have on the platform and the application it runs.

Infrastructure

Infrastructure Monitoring Cloud Metrics

Dynatrace innovates again with the release of topology-driven auto-adaptive metric baselines

Dynatrace

JULY 23, 2020

With the advent and ingestion of thousands of custom metrics into Dynatrace, we’ve once again pushed the boundaries of automatic, AI-based root cause analysis with the introduction of auto-adaptive baselines as a foundational concept for Dynatrace topology-driven timeseries measurements. In many cases, metric behavior changes over time.

Metrics

Metrics Innovation Strategy Monitoring

Building Netflix’s Distributed Tracing Infrastructure

The Netflix TechBlog

OCTOBER 19, 2020

Now let’s look at how we designed the tracing infrastructure that powers Edgar. This insight led us to build Edgar: a distributed tracing infrastructure and user experience. Our distributed tracing infrastructure is grouped into three sections: tracer library instrumentation, stream processing, and storage.

Infrastructure

Infrastructure Transportation Storage Open Source

9 key DevOps metrics for success

Dynatrace

SEPTEMBER 28, 2021

The emerging concepts of working with DevOps metrics and DevOps KPIs have really come a long way. DevOps metrics to help you meet your DevOps goals. Like any IT or business project, you’ll need to track critical key metrics. Here are nine key DevOps metrics and DevOps KPIs that will help you be successful.

DevOps

DevOps Metrics Traffic Efficiency

Running the Astronomy Shop OpenTelemetry demo application with Dynatrace

Dynatrace

MARCH 13, 2025

OpenTelemetry provides a common set of tools, APIs, and SDKs to help collect observability signals from applications and infrastructure endpoints. The configuration also includes an optional span metrics connector, which generates Request, Error, and Duration (R.E.D.) metrics from span data. metrics from span data.

Open Source

Open Source Metrics Architecture Infrastructure

Observability engineering: Getting Prometheus metrics right for Kubernetes with Dynatrace and Kepler

Dynatrace

DECEMBER 18, 2023

This challenge has given rise to the discipline of observability engineering, which concentrates on the details of telemetry data to fine-tune observability use cases. To get a more granular look into telemetry data, many analysts rely on custom metrics using Prometheus.

Metrics

Metrics Engineering Energy Tuning

New web performance insights with additional metrics and enhanced Visually complete for synthetic monitors

Dynatrace

AUGUST 5, 2020

Recently introduced improvements to Visually complete and new web performance metrics for Real User Monitoring are now available for Synthetic Monitoring as well. Ensure better user experience with paint-focused performance metrics. These metrics are tightly connected to the perceived load speed of your application.

Metrics

Metrics Monitoring Performance Benchmarking

RabbitMQ vs. Kafka: Key Differences

Scalegrid

FEBRUARY 6, 2025

Optimizing RabbitMQ requires clustering, queue management, and resource tuning to maintain stability and efficiency. By analyzing benchmark results, organizations can determine which system aligns best with their infrastructure needswhether its high-speed event processing or reliable message queuing for microservices.

Latency

Latency Analytics Architecture Storage

Title Launch Observability at Netflix Scale

The Netflix TechBlog

DECEMBER 17, 2024

The Challenge of Title Launch Observability As engineers, were wired to track system metrics like error rates, latencies, and CPU utilizationbut what about metrics that matter to a titlessuccess? This approach provides a few advantages: Low burden on existing systems: Log processing imposes minimal changes to existing infrastructure.

Traffic

Traffic Scalability Strategy Monitoring

Building and Scaling Data Lineage at Netflix to Improve Data Infrastructure Reliability, and…

The Netflix TechBlog

MARCH 25, 2019

Building and Scaling Data Lineage at Netflix to Improve Data Infrastructure Reliability, and Efficiency By: Di Lin , Girish Lingappa , Jitender Aswani Imagine yourself in the role of a data-inspired decision maker staring at a metric on a dashboard about to make a critical business decision but pausing to ask a question?—?“Can

Infrastructure

Infrastructure Big Data Transportation Architecture

How Netflix uses eBPF flow logs at scale for network insight

The Netflix TechBlog

JUNE 7, 2021

Challenges The cloud network infrastructure that Netflix utilizes today consists of AWS services such as VPC, DirectConnect, VPC Peering, Transit Gateways, NAT Gateways, etc and Netflix owned devices. The Flow Exporter also publishes various operational metrics to Atlas. So how do we ingest and enrich these flows at scale ?

Network

Network Transportation AWS Cloud

Flexible, scalable, self-service Kubernetes native observability now in General Availability

Dynatrace

MAY 17, 2022

From a cost perspective, internal customers waste valuable time sending tickets to operations teams asking for metrics, logs, and traces to be enabled. A team looking for metrics, traces, and logs no longer needs to file a ticket to get their app monitored in their own environments.

Availability

Availability Scalability Cloud Metrics

New Prometheus-based extensions enable intelligent observability for more than 200 additional technologies

Dynatrace

FEBRUARY 14, 2022

Building on its advanced analytics capabilities for Prometheus data , Dynatrace now enables you to create extensions based on Prometheus metrics. Many technologies expose their metrics in the Prometheus data format. Many technologies expose their metrics in the Prometheus data format. Our monitoring coverage already includes ?

Technology

Technology Technology Metrics Infrastructure

Unlock log analytics: Seamless insights without writing queries

Dynatrace

MAY 28, 2024

What about correlated trace data, host metrics, real-time vulnerability scanning results, or log messages captured just before an incident occurs? Depending on which app is in use, one glance at a histogram provides invaluable insight into managing clouds, databases, Kubernetes environments, and infrastructure.

Analytics

Analytics Infrastructure Database Cloud

Dynatrace simplifies StatsD, Telegraf, and Prometheus observability with Davis AI

Dynatrace

OCTOBER 7, 2020

Open-source metric sources automatically map to our Smartscape model for AI analytics. We’ve just enhanced Dynatrace OneAgent with an open metric API. Here’s a quick overview of what you can achieve now that the Dynatrace Software Intelligence Platform has been extended to ingest third-party metrics. Dynatrace news.

Open Source

Open Source Metrics Analytics Tuning

Enable full observability for Linux on IBM Z mainframe now with logs

Dynatrace

APRIL 17, 2024

The challenge for hybrid cloud deployments is maintaining critical observability, which must include the full set of monitoring signals: logs, metrics, and traces. For example, on the Dynatrace platform, open the new Infrastructure & Operations app and navigate to any monitored host running on Linux on IBM Z (s390 architecture).

Operating System

Operating System Energy Infrastructure Architecture

Why applying chaos engineering to data-intensive applications matters

Dynatrace

MAY 23, 2024

Failures can occur unpredictably across various levels, from physical infrastructure to software layers. Optimized fault recovery We’re also interested in exploring the potential of tuning configurations to improve recovery speed and performance after failures and avoid the demand for additional computing resources.

Engineering

Engineering Tuning Latency Open Source

Software intelligence as code enables tailored observability, AIOps, and application security at scale

Dynatrace

FEBRUARY 9, 2022

More recently, teams have begun to apply DevOps best practices to infrastructure automation, giving developers a more active role with GitOps as an operational framework. Key components of GitOps are declarative infrastructure as code, orchestration, and observability. Dynatrace enables software intelligence as code.

Code

Code Software Software DevOps

Migrating Netflix to GraphQL Safely

The Netflix TechBlog

JUNE 14, 2023

So, we relied on higher-level metrics-based testing: AB Testing and Sticky Canaries. To determine customer impact, we could compare various metrics such as error rates, latencies, and time to render. We spent the next few months diving into these high-level metrics and fixing issues such as cache TTLs, flawed client assumptions, etc.

Traffic

Traffic Latency Metrics Cache

Observability throughout the software development lifecycle increases delivery performance

Dynatrace

OCTOBER 4, 2024

A central element of platform engineering teams is a robust Internal Developer Platform (IDP), which encompasses a set of tools, services, and infrastructure that enables developers to build, test, and deploy software applications. Stay tuned Currently, the API allows for the configuration of an event processing pipeline.

Software

Software Software Development Performance

OneAgent for Linux on IBM Z (General Availability)

Dynatrace

NOVEMBER 20, 2019

At Dynatrace, where we provide a software intelligence platform for hybrid environments (from infrastructure to cloud) we see a growing need to measure how mainframe architecture and the services running on it contribute to the overall performance and availability of applications. Network metrics are also collected for detected processes.

Availability

Availability Hardware Java Tuning

Auto-adaptive thresholds for AI-driven quality gating

Dynatrace

JUNE 4, 2024

While platform engineers can build and prepare the necessary infrastructure and templates for self-adoption, developers must still provide some customization. Our data scientists utilize metrics and events to store these quality metrics. Ideally, this should be a self-service offering that enables individual adoption by teams.

Metrics

Metrics Engineering Code Tuning

OpenTelemetry 101: A nontechnical guide for IT leaders and enthusiasts

Dynatrace

JULY 22, 2024

Observability Observability is the ability to determine a system’s health by analyzing the data it generates, such as logs, metrics, and traces. There are three main types of telemetry data: Metrics. Metrics are typically aggregated and stored in time series databases for monitoring and alerting purposes.

Latency

Latency Best Practices Metrics Open Source

Kubernetes made simple? Kelsey Hightower and Andreas Grabner discuss the future of cloud-native technologies

Dynatrace

FEBRUARY 17, 2022

With automatic and intelligent observability of all their infrastructure, apps, services, and workloads and their dependencies, Dynatrace pinpoints exactly where something is going wrong. If you’re going to have an SLO, you should have a story in mind of why you’re setting up all these alerts and collecting all these metrics.

Technology

Technology Technology Cloud Infrastructure

Dynatrace extends automatic and intelligent observability to cloud and Kubernetes logs for smarter automation at scale

Dynatrace

FEBRUARY 8, 2021

Every service and component exposes observability data (metrics, logs, and traces) that contains crucial information to drive digital businesses. Some companies are still using different tools for application performance monitoring, infrastructure monitoring, and log monitoring. Any log event (JSON or plain text) via HTTP REST API.

Cloud

Cloud Azure Analytics Open Source

Data lakehouse innovations advance the three pillars of observability for more collaborative analytics

Dynatrace

FEBRUARY 16, 2023

The short answer: The three pillars of observability—logs, metrics, and traces—converging on a data lakehouse. You’re getting all the architectural benefits of Grail—the petabytes, the cardinality—with this implementation,” including the three pillars of observability: logs, metrics, and traces in context.

Analytics

Analytics Innovation Metrics Database

A look behind the scenes of AWS Lambda and our new Lambda monitoring extension

Dynatrace

FEBRUARY 25, 2021

Distributing accounts across the infrastructure is an architectural decision, as a given account often has similar usage patterns, languages, and sizes for their Lambda functions. This is another measure to evenly redistribute the load within the AWS Lambda infrastructure. file uploaded to AWS Lambda. The Lambda execution life cycle.

Lambda

Lambda AWS Monitoring Serverless

Large scale deployments are easy and cost-effective with network zones (Early Adopter)

Dynatrace

JULY 2, 2020

By minimizing bandwidth and preventing unrelated traffic between data centers, you can maintain healthy network infrastructure and save on costs. In combination with ActiveGates, network zones save bandwidth and infrastructure costs and by: compressing OneAgent traffic. This saves bandwidth and infrastructure costs.

Network

Network Traffic Infrastructure Tuning

Build and operate multicloud FaaS with enhanced, intelligent end-to-end observability

Dynatrace

APRIL 25, 2023

These functions are executed by a serverless platform or provider (such as AWS Lambda, Azure Functions or Google Cloud Functions) that manages the underlying infrastructure, scaling and billing. Enable faster development and deployment cycles by abstracting away the infrastructure complexity.

Serverless

Serverless Lambda Azure AWS

OpenTelemetry services analysis and endpoint detection made easier with Dynatrace unified services

Dynatrace

JANUARY 4, 2024

Great news: OpenTelemetry endpoint detection, analyzing OpenTelemetry services, and visualizing Istio service mesh metrics just got easier. As a CNCF open source incubating project, OpenTelemetry provides a standardized set of APIs, libraries, agents, instrumentation, and specifications for logging, metrics, and tracing.

Metrics

Metrics Open Source Tuning Infrastructure

Kubernetes vs Docker: What’s the difference?

Dynatrace

SEPTEMBER 29, 2021

Think of containers as the packaging for microservices that separate the content from its environment – the underlying operating system and infrastructure. For a deeper look into how to gain end-to-end observability into Kubernetes environments, tune into the on-demand webinar Harness the Power of Kubernetes Observability.

Open Source

Open Source DevOps Traffic Cloud

Conducting log analysis with an observability platform and full data context

Dynatrace

APRIL 20, 2023

With more automated approaches to log monitoring and log analysis, however, organizations can gain visibility into their applications and infrastructure efficiently and with greater precision—even as cloud environments grow. They enable IT teams to identify and address the precise cause of application and infrastructure issues.

Analytics

Analytics Infrastructure Storage Architecture

Telltale: Netflix Application Monitoring Simplified

The Netflix TechBlog

AUGUST 13, 2020

A metric crossed a threshold. You’re half awake and wondering, “Is there really a problem or is this just an alert that needs tuning? Telltale learns what constitutes typical health for an application, no alert tuning required. Metrics are a key part of understanding application health. Infrastructure change events.

Monitoring

Monitoring Tuning Traffic Metrics

Python at Netflix

The Netflix TechBlog

APRIL 29, 2019

An easy, though imprecise, way of thinking about Netflix infrastructure is that everything that happens before you press Play on your remote control (e.g., Various software systems are needed to design, build, and operate this CDN infrastructure, and a significant number of them are written in Python. are you logged in?

Open Source

Open Source Network Infrastructure Big Data

Using Dynatrace to master the 5 pillars of the AWS Well-Architected Framework (Part 1)

Dynatrace

NOVEMBER 24, 2020

Do we have the ability (process, frameworks, tooling) to quickly deploy new services and underlying IT infrastructure and if we do, do we know that we are not disrupting our end users? Automatic collection of the entire set of services that publish metrics to Amazon CloudWatch. Stay tuned. Dynatrace and AWS.

AWS

AWS Artificial Intelligence Best Practices Lambda

Resolving technical debt helps state and local agencies improve business impact

Dynatrace

APRIL 21, 2023

The agencies resisted adopting the tool because it required significant time to configure and tune collected metrics into valuable information. Further, the toolset had been in place for 20 years resulting in high annual software maintenance and infrastructure costs. over five years. Register to listen to the webinar.

Government

Government Infrastructure Innovation Monitoring

Why log monitoring and log analytics matter in a hyperscale world

Dynatrace

NOVEMBER 15, 2021

This includes troubleshooting issues with software, services, and applications, and any infrastructure they interact with, such as multicloud platforms, container environments, and data repositories. Log analytics also help identify ways to make infrastructure environments more predictable, efficient, and resilient. More automation.

Analytics

Analytics Monitoring DevOps Artificial Intelligence

Best Practices for Scaling RabbitMQ

Scalegrid

FEBRUARY 24, 2025

With 24/7 expert support, ScaleGrid assists with troubleshooting, performance tuning, and migration processes. Metrics and Statistics Monitoring the performance of a RabbitMQ cluster is crucial for maintaining its efficiency and reliability. ScaleGrid ensures high availability through automatic failover and advanced monitoring tools.

Best Practices

Best Practices Traffic Strategy Scalability

Applying Netflix DevOps Patterns to Windows

The Netflix TechBlog

AUGUST 22, 2019

Artisan Crafted Images In the Netflix full cycle DevOps culture the team responsible for building a service is also responsible for deploying, testing, infrastructure, and operation of that service. Now each change in the infrastructure is tested, canaried, and deployed like any other code change.

DevOps

DevOps AWS Tuning Infrastructure

Auto-Diagnosis and Remediation in Netflix Data Platform

The Netflix TechBlog

JANUARY 13, 2022

Troubleshooting these problems is not a trivial task and requires collecting logs and metrics from several different systems and analyzing them to identify the root cause. Pensive infrastructure comprises two separate systems to support batch and streaming workloads. Expand Pensive with Machine Learning classifiers.

Big Data

Big Data Infrastructure Metrics Games

How digital experience monitoring helps deliver business observability

Dynatrace

APRIL 26, 2022

Fast, consistent application delivery creates a positive user experience that can ultimately drive customer loyalty and improve business metrics like conversion rate and user retention. With DEM solutions, organizations can operate over on-premise network infrastructure or private or public cloud SaaS or IaaS offerings.

Monitoring

Monitoring Social Media IoT Metrics

Running the OpenTelemetry demo application with Dynatrace

Dynatrace

OCTOBER 6, 2022

Jaeger and Prometheus backends for displaying the collected traces and metrics, but you can easily configure alternative backends. Both methods ingest data, but by using the Dynatrace OneAgent, users can automatically discover additional insights about their infrastructure, applications, processes, services and databases.

Open Source

Open Source Metrics Tuning Technology

Power Dashboarding, Part I: Start your exploration journey with Dashboards

Extend infrastructure observability with JMX Extensions and additional full-stack metrics

Trending Sources

Monitoring of Kubernetes Infrastructure for day 2 operations

Dynatrace innovates again with the release of topology-driven auto-adaptive metric baselines

Building Netflix’s Distributed Tracing Infrastructure

9 key DevOps metrics for success

Running the Astronomy Shop OpenTelemetry demo application with Dynatrace

Observability engineering: Getting Prometheus metrics right for Kubernetes with Dynatrace and Kepler

New web performance insights with additional metrics and enhanced Visually complete for synthetic monitors

RabbitMQ vs. Kafka: Key Differences

Title Launch Observability at Netflix Scale

Building and Scaling Data Lineage at Netflix to Improve Data Infrastructure Reliability, and…

How Netflix uses eBPF flow logs at scale for network insight

Flexible, scalable, self-service Kubernetes native observability now in General Availability

New Prometheus-based extensions enable intelligent observability for more than 200 additional technologies

Unlock log analytics: Seamless insights without writing queries

Dynatrace simplifies StatsD, Telegraf, and Prometheus observability with Davis AI

Enable full observability for Linux on IBM Z mainframe now with logs

Why applying chaos engineering to data-intensive applications matters

Software intelligence as code enables tailored observability, AIOps, and application security at scale

Migrating Netflix to GraphQL Safely

Observability throughout the software development lifecycle increases delivery performance

OneAgent for Linux on IBM Z (General Availability)

Auto-adaptive thresholds for AI-driven quality gating

OpenTelemetry 101: A nontechnical guide for IT leaders and enthusiasts

Kubernetes made simple? Kelsey Hightower and Andreas Grabner discuss the future of cloud-native technologies

Dynatrace extends automatic and intelligent observability to cloud and Kubernetes logs for smarter automation at scale

Data lakehouse innovations advance the three pillars of observability for more collaborative analytics

A look behind the scenes of AWS Lambda and our new Lambda monitoring extension

Large scale deployments are easy and cost-effective with network zones (Early Adopter)

Build and operate multicloud FaaS with enhanced, intelligent end-to-end observability

OpenTelemetry services analysis and endpoint detection made easier with Dynatrace unified services

Kubernetes vs Docker: What’s the difference?

Conducting log analysis with an observability platform and full data context

Telltale: Netflix Application Monitoring Simplified

Python at Netflix

Using Dynatrace to master the 5 pillars of the AWS Well-Architected Framework (Part 1)

Resolving technical debt helps state and local agencies improve business impact

Why log monitoring and log analytics matter in a hyperscale world

Best Practices for Scaling RabbitMQ

Applying Netflix DevOps Patterns to Windows

Auto-Diagnosis and Remediation in Netflix Data Platform

How digital experience monitoring helps deliver business observability

Running the OpenTelemetry demo application with Dynatrace

Stay Connected