Latency, Metrics and Monitoring - Technology Performance Pulse

Cut costs and complexity: 5 strategies for reducing tool sprawl with Dynatrace

Dynatrace

APRIL 10, 2025

Break data silos and add context for faster, more strategic decisions : Unifying metrics, logs, traces, and user behavior within a single platform enables real-time decisions rooted in full context, not guesswork. To save on storage and query costs, teams transition older data to cold storage, trimming out valuable details to save space.

Strategy

Strategy Storage Network Architecture

Next-level interaction and customization of data visualizations in Dynatrace Dashboards and Notebooks

Dynatrace

OCTOBER 10, 2024

Take your monitoring, data exploration, and storytelling to the next level with outstanding data visualization All your applications and underlying infrastructure produce vast volumes of data that you need to monitor or analyze for insights. Based on the color, you immediately see if any SLOs are off track.

Latency

Latency Infrastructure Monitoring Metrics

What is observability? Not just logs, metrics and traces

Dynatrace

OCTOBER 1, 2021

In IT and cloud computing, observability is the ability to measure a system’s current state based on the data it generates, such as logs, metrics, and traces. What is the difference between monitoring and observability? Is observability really monitoring by another name? What is observability? In short, no.

Metrics

Metrics Open Source Monitoring Cloud

Best practices and key metrics for improving mobile app performance

Dynatrace

DECEMBER 13, 2023

As a result, organizations need to monitor mobile app performance metrics that are meaningful and actionable by gaining adequate observability of mobile app performance. There are many common mobile app performance metrics that are used to measure key performance indicators (KPIs) related to user experience and satisfaction.

Best Practices

Best Practices Mobile Metrics Performance

Observability vs. monitoring: What’s the difference?

Dynatrace

NOVEMBER 3, 2021

This trend is prompting advances in both observability and monitoring. But exactly what are the differences between observability vs. monitoring? Monitoring and observability provide a two-pronged approach. To get a better understanding of observability vs monitoring, we’ll explore the differences between the two.

Monitoring

Monitoring Metrics DevOps Scalability

Cloud infrastructure monitoring in action: Dynatrace on Dynatrace

Dynatrace

SEPTEMBER 29, 2020

As of September 2020, we run 51 clusters on 1100 EC2 instances distributed across six AWS Regions ensuring that all our users can leverage the Dynatrace Software Intelligence Platform to monitor their hybrid-multi cloud environments. Sydney, we have a disk write latency problem!

Infrastructure

Infrastructure Cloud Monitoring AWS

AI-driven analysis of Spring Micrometer metrics in context, with typology at scale

Dynatrace

APRIL 7, 2022

Micrometer is used for instrumenting both out-of-the-box and custom metrics from Spring Boot applications. Davis topology-aware anomaly detection and alerting for your Micrometer metrics. Topology-related custom metrics for seamless reports and alerts. Micrometer uses a registry to export metrics to monitoring systems.

Metrics

Metrics Java Latency Cache

What is API monitoring?

Dynatrace

OCTOBER 4, 2021

As a result, API monitoring has become a must for DevOps teams. So what is API monitoring? What is API Monitoring? API monitoring is the process of collecting and analyzing data about the performance of an API in order to identify problems that impact users. The need for API monitoring. Ways to monitor APIs.

Monitoring

Monitoring Latency Metrics Availability

What is real user monitoring (RUM)?

Dynatrace

JANUARY 13, 2022

Real user monitoring can help you catch these issues before they impact the bottom line. What is real user monitoring? Real user monitoring (RUM) is a performance monitoring process that collects detailed data about a user’s interaction with an application. Real user monitoring collects data on a variety of metrics.

Monitoring

Monitoring Mobile Latency Best Practices

Real user monitoring vs. synthetic monitoring: Understanding best practices

Dynatrace

JUNE 27, 2022

As businesses compete for customer loyalty, it’s critical to understand the difference between real-user monitoring and synthetic user monitoring. However, not all user monitoring systems are created equal. What is real user monitoring? RUM gathers information on a variety of performance metrics.

Best Practices

Best Practices Monitoring Wireless Traffic

How to Configure Istio, Prometheus and Grafana for Monitoring

DZone

AUGUST 29, 2023

One of the primary responsibilities of Site reliability engineers (SREs) in large organizations is to monitor the golden metrics of their applications, such as CPU utilization, memory utilization, latency, and throughput.

Monitoring

Monitoring Latency Network Infrastructure

RabbitMQ vs. Kafka: Key Differences

Scalegrid

FEBRUARY 6, 2025

RabbitMQ can be deployed in distributed environments and includes monitoring tools through a built-in dashboard and CLI. Its partitioned log architecture supports both queuing and publish-subscribe models, allowing it to handle large-scale event processing with minimal latency.

Latency

Latency Analytics Architecture Storage

Title Launch Observability at Netflix Scale

The Netflix TechBlog

DECEMBER 17, 2024

The Challenge of Title Launch Observability As engineers, were wired to track system metrics like error rates, latencies, and CPU utilizationbut what about metrics that matter to a titlessuccess? Option 1: Log Processing Log processing offers a straightforward solution for monitoring and analyzing title launches.

Traffic

Traffic Scalability Strategy Monitoring

How to Install Pixie for Kubernetes Monitoring: The Complete Guide

DZone

SEPTEMBER 4, 2022

It is important to highlight that most older monitoring systems were considered inefficient due to their operational overhead. Pixie offers monitoring, telemetry, metrics, and more with less than 5% CPU overhead and latency degradation during data collection.

Monitoring

Monitoring Latency Metrics Cloud

Dynatrace automatically monitors OpenAI ChatGPT for companies that deliver reliable, cost-effective services powered by generative AI

Dynatrace

JUNE 7, 2023

One of the crucial success factors for delivering cost-efficient and high-quality AI-agent services, following the approach described above, is to closely observe their cost, latency, and reliability. With these latency, reliability, and cost measurements in place, your operations team can now define their own OpenAI dashboards and SLOs.

Monitoring

Monitoring Latency Metrics Azure

How digital experience monitoring helps deliver business observability

Dynatrace

APRIL 26, 2022

Fast, consistent application delivery creates a positive user experience that can ultimately drive customer loyalty and improve business metrics like conversion rate and user retention. What is digital experience monitoring? Primary digital experience monitoring tools.

Monitoring

Monitoring Social Media IoT Metrics

Best Practices for Scaling RabbitMQ

Scalegrid

FEBRUARY 24, 2025

Optimizing RabbitMQ performance through strategies such as keeping queues short, enabling lazy queues, and monitoring health checks is essential for maintaining system efficiency and effectively managing high traffic loads. Monitoring the cluster nodes preemptively addresses potential issues, ensuring the system operates smoothly.

Best Practices

Best Practices Traffic Strategy Scalability

Noisy Neighbor Detection with eBPF

The Netflix TechBlog

SEPTEMBER 10, 2024

In this blog post, we'll reveal how we leveraged eBPF to achieve continuous, low-overhead instrumentation of the Linux scheduler, enabling effective self-serve monitoring of noisy neighbor issues. Learn how Linux kernel instrumentation can improve your infrastructure observability with deeper insights and enhanced monitoring.

Latency

Latency Metrics Programming Monitoring

Crucial Redis Monitoring Metrics You Must Watch

Scalegrid

JANUARY 25, 2024

You will need to know which monitoring metrics for Redis to watch and a tool to monitor these critical server metrics to ensure its health. Redis returns a big list of database metrics when you run the info command on the Redis shell. You can pick a smart selection of relevant metrics from these.

Metrics

Metrics Monitoring Latency Cache

Telltale: Netflix Application Monitoring Simplified

The Netflix TechBlog

AUGUST 13, 2020

A metric crossed a threshold. Over the years we’ve learned from on-call engineers about the pain points of application monitoring: too many alerts, too many dashboards to scroll through, and too much configuration and maintenance. Metrics are a key part of understanding application health. Client metrics and QoE changes.

Monitoring

Monitoring Tuning Traffic Metrics

Implementing service-level objectives to improve software quality

Dynatrace

DECEMBER 27, 2022

By implementing service-level objectives, teams can avoid collecting and checking a huge amount of metrics for each service. In what follows, we explore some of these best practices and guidance for implementing service-level objectives in your monitored environment. Latency is the time that it takes a request to be served.

Software

Software Software Benchmarking Latency

Introducing Impressions at Netflix

The Netflix TechBlog

FEBRUARY 14, 2025

Highlighting NewReleases For new content, impression history helps us monitor initial user interactions and adjust our merchandising efforts accordingly. We accomplish this by gathering detailed column-level metrics that offer insights into the state and quality of each impression.

Tuning

Tuning Latency Efficiency Storage

Who will watch the watchers? Extended infrastructure observability for WSO2 API Manager

Dynatrace

SEPTEMBER 18, 2020

Save hours of bug hunting with out-of-the-box WSO2 API Manager monitoring. The Dynatrace Software Intelligence Platform gives you a complete Infrastructure Monitoring solution for monitoring cloud platforms and virtual infrastructure, along with log monitoring and AIOps. High latency or lack of responses.

Infrastructure

Infrastructure Latency Metrics Cloud

Optimize Citrix platform performance and user experience with Dynatrace (GA)

Dynatrace

JANUARY 15, 2020

Having released this functionality in an Preview Release back in September 2019, we’re now happy to announce the General Availability of our Citrix monitoring extension. Synthetic monitoring: Citrix login availability and performance. OneAgent: Citrix StoreFront services discovered and monitored by Dynatrace. Dynatrace news.

Latency

Latency Performance Virtualization Infrastructure

OpenTelemetry 101: A nontechnical guide for IT leaders and enthusiasts

Dynatrace

JULY 22, 2024

Observability Observability is the ability to determine a system’s health by analyzing the data it generates, such as logs, metrics, and traces. There are three main types of telemetry data: Metrics. Metrics are typically aggregated and stored in time series databases for monitoring and alerting purposes.

Latency

Latency Best Practices Metrics Open Source

Get up to 300 new metrics out of the box with AWS supporting services (GA)

Dynatrace

DECEMBER 22, 2019

At Dynatrace, we’re constantly improving our AWS monitoring capabilities. Monitor and understand additional AWS services. Supporting services include every service that isn’t available with out-of-the-box Dynatrace monitoring. Get up to 300 new AWS metrics out of the box. Updated AWS monitoring policy.

AWS

AWS Metrics IoT Storage

Maximize user experience with out-of-the-box service-performance SLOs

Dynatrace

AUGUST 25, 2023

SLOs cover a wide range of monitoring options for different applications. According to the Google Site Reliability Engineering (SRE) handbook, monitoring the four golden signals is crucial in delivering high-performing software solutions. Service-performance template Latency is often described as the time a request takes to be served.

Performance

Performance Latency Traffic Metrics

Migrating Netflix to GraphQL Safely

The Netflix TechBlog

JUNE 14, 2023

So, we relied on higher-level metrics-based testing: AB Testing and Sticky Canaries. To determine customer impact, we could compare various metrics such as error rates, latencies, and time to render. Wins High-Level Health Metrics: AB Testing provided the assurance we needed in our overall client-side GraphQL implementation.

Traffic

Traffic Latency Metrics Cache

Build systems more reliably with Dynatrace: Chaos Engineering

Dynatrace

AUGUST 21, 2024

This approach enhances key DORA metrics and enables early detection of failures in the release process, allowing SREs more time for innovation. This blog post explores the Reliability metric , which measures modern operational practices. Why reliability?

Engineering

Engineering Systems Latency Metrics

Get up to 300 new metrics out of the box with AWS supporting services (GA)

Dynatrace

DECEMBER 22, 2019

At Dynatrace, we’re constantly improving our AWS monitoring capabilities. Monitor and understand additional AWS services. Supporting services include every service that isn’t available with out-of-the-box Dynatrace monitoring. Get up to 300 new AWS metrics out of the box. Updated AWS monitoring policy.

AWS

AWS Metrics IoT Storage

Dynatrace supports Azure Managed Instance for Apache Cassandra

Dynatrace

MAY 13, 2022

This extension provides fully app-centric Cassandra performance monitoring for Azure Managed Instance for Apache Cassandra. Cassandra is also essential to Dynatrace because it is integral to our monitoring solution. Provide a foundation for calculating metrics in dashboard charts.

Azure

Azure Latency Metrics Infrastructure

Optimize your observability pipeline for AWS Lambda serverless functions

Dynatrace

NOVEMBER 10, 2022

For AWS Lambda, Dynatrace provides Lambda Layers for adding distributed tracing to your serverless functions and for capturing metrics and logs from Amazon CloudWatch. This makes it easier to apply/enforce monitoring policies as fewer teams are involved (e.g. The need for a simplified approach to capture telemetry. How to get started.

Lambda

Lambda Serverless AWS Latency

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

The Netflix TechBlog

MAY 4, 2023

The second phase involves migrating the traffic over to the new systems in a manner that mitigates the risk of incidents while continually monitoring and confirming that we are meeting crucial metrics tracked at multiple levels. It provides a good read on the availability and latency ranges under different production conditions.

Traffic

Traffic Latency Tuning Systems

What are quality gates? How to use quality gates to deliver better software at speed and scale

Dynatrace

FEBRUARY 21, 2024

Automating quality gates is ideal, as it minimizes manually checking and validating key metrics throughout the SDLC. By actively monitoring metrics such as error rate, success rate, and CPU load, quality gates instill confidence in teams during software releases. Fewer expensive fixes.

Speed

Speed Software Software Latency

SLOs done right: how DevOps teams can build better service-level objectives

Dynatrace

MARCH 16, 2023

Enterprises now have access to myriad metrics they can track and measure, but an abundance of choice doesn’t equal actionable insight. Indeed, 54% of SREs say they handle too many metrics, making it increasingly difficult to find the most relevant ones for a particular service, according to the Dynatrace State of SRE Report.

DevOps

DevOps Latency Metrics Traffic

Dynatrace supports SnapStart for Lambda as an AWS launch partner

Dynatrace

NOVEMBER 28, 2022

The new Amazon capability enables customers to improve the startup latency of their functions from several seconds to as low as sub-second (up to 10 times faster) at P99 (the 99th latency percentile). This can cause latency outliers and may lead to a poor end-user experience for latency-sensitive applications.

Lambda

Lambda AWS Serverless Latency

Redis® Monitoring Strategies for 2025

Scalegrid

JANUARY 21, 2025

In todays data-driven world, the ability to effectively monitor and manage data is of paramount importance. With its widespread use in modern application architectures, understanding the ins and outs of Redis monitoring is essential for any tech professional. Redis, a powerful in-memory data store, is no exception.

Strategy

Strategy Monitoring Latency DevOps

Get seamless insights into Nutanix clusters with Dynatrace

Dynatrace

NOVEMBER 9, 2023

Easily monitor your Nutanix clusters with Dynatrace The Dynatrace Nutanix Cluster Extension offers straightforward yet powerful features to help you streamline your monitoring with an easy one-click activation via Dynatrace Hub. However, this approach doesn’t provide seamless monitoring coverage.

Virtualization

Virtualization Storage Metrics Monitoring

Site reliability done right: 5 SRE best practices that deliver on business objectives

Dynatrace

MAY 31, 2023

As a result, site reliability has emerged as a critical success metric for many organizations. The practice uses continuous monitoring and high levels of automation in close collaboration with agile development teams to ensure applications are highly available and perform without friction. Service-level objectives (SLOs). availability.

Best Practices

Best Practices DevOps Latency Metrics

Lessons learned from enterprise service-level objective management

Dynatrace

MAY 19, 2022

Organizations have multiple stakeholders and almost always have different teams that set up monitoring, operate systems, and develop new functionality. The monitoring team set up the dashboard, so who owns violations? In their new dashboard, they added dimensions for load, latency, and open problems for each component.

Automotive

Automotive Latency Architecture Azure

Improved Alerting with Atlas Streaming Eval

The Netflix TechBlog

APRIL 27, 2023

A few years ago, we were paged by our SRE team due to our Metrics Alerting System falling behind — critical application health alerts reached engineers 45 minutes late! Hence, we started down the path of alert evaluation via real-time streaming metrics. This has proven to be valuable towards reducing Mean Time to Recover (MTTR).

Storage

Storage Cache Metrics Database

What is full stack observability?

Dynatrace

APRIL 6, 2022

A full-stack observability solution uses telemetry data such as logs, metrics, and traces to give IT teams insight into application, infrastructure, and UX performance. Comprehensive observability is also essential for digital experience monitoring (DEM). Why full-stack observability matters. See observability in action!

DevOps

DevOps Innovation Infrastructure Cloud

Low Overhead Continuous Contextual Production Profiling

DZone

JUNE 15, 2023

In order to gain insight into these problems, we gather a range of metrics and logs to monitor the utilization of system resources such as CPU, memory, and application-specific latencies. It is worth noting that this data collection process does not impact the performance of the application.

Latency

Latency Storage Strategy Metrics

Migrating Critical Traffic At Scale with No Downtime?—?Part 2

The Netflix TechBlog

JUNE 13, 2023

A small percentage of production traffic is redirected to the two new clusters, allowing us to monitor the new version’s performance and compare it against the current version. By tracking metrics only at the level of service being updated, we might miss capturing deviations in broader end-to-end system functionality.

Traffic

Traffic Metrics Systems Strategy

Cut costs and complexity: 5 strategies for reducing tool sprawl with Dynatrace

Next-level interaction and customization of data visualizations in Dynatrace Dashboards and Notebooks

Trending Sources

What is observability? Not just logs, metrics and traces

Best practices and key metrics for improving mobile app performance

Observability vs. monitoring: What’s the difference?

Cloud infrastructure monitoring in action: Dynatrace on Dynatrace

AI-driven analysis of Spring Micrometer metrics in context, with typology at scale

What is API monitoring?

What is real user monitoring (RUM)?

Real user monitoring vs. synthetic monitoring: Understanding best practices

How to Configure Istio, Prometheus and Grafana for Monitoring

RabbitMQ vs. Kafka: Key Differences

Title Launch Observability at Netflix Scale

How to Install Pixie for Kubernetes Monitoring: The Complete Guide

Dynatrace automatically monitors OpenAI ChatGPT for companies that deliver reliable, cost-effective services powered by generative AI

How digital experience monitoring helps deliver business observability

Best Practices for Scaling RabbitMQ

Noisy Neighbor Detection with eBPF

Crucial Redis Monitoring Metrics You Must Watch

Telltale: Netflix Application Monitoring Simplified

Implementing service-level objectives to improve software quality

Introducing Impressions at Netflix

Who will watch the watchers? Extended infrastructure observability for WSO2 API Manager

Optimize Citrix platform performance and user experience with Dynatrace (GA)

OpenTelemetry 101: A nontechnical guide for IT leaders and enthusiasts

Get up to 300 new metrics out of the box with AWS supporting services (GA)

Maximize user experience with out-of-the-box service-performance SLOs

Migrating Netflix to GraphQL Safely

Build systems more reliably with Dynatrace: Chaos Engineering

Get up to 300 new metrics out of the box with AWS supporting services (GA)

Dynatrace supports Azure Managed Instance for Apache Cassandra

Optimize your observability pipeline for AWS Lambda serverless functions

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

What are quality gates? How to use quality gates to deliver better software at speed and scale

SLOs done right: how DevOps teams can build better service-level objectives

Dynatrace supports SnapStart for Lambda as an AWS launch partner

Redis® Monitoring Strategies for 2025

Get seamless insights into Nutanix clusters with Dynatrace

Site reliability done right: 5 SRE best practices that deliver on business objectives

Lessons learned from enterprise service-level objective management

Improved Alerting with Atlas Streaming Eval

What is full stack observability?

Low Overhead Continuous Contextual Production Profiling

Migrating Critical Traffic At Scale with No Downtime?—?Part 2

Stay Connected