Development, Latency and Metrics - Technology Performance Pulse

Optimising for High Latency Environments

CSS Wizardry

SEPTEMBER 16, 2024

This gives fascinating insights into the network topography of our visitors, and how much we might be impacted by high latency regions. Round-trip-time (RTT) is basically a measure of latency—how long did it take to get from one endpoint to another and back again? RTT data should be seen as an insight and not a metric.

Latency

Latency Cache Transportation Mobile

What is observability? Not just logs, metrics and traces

Dynatrace

OCTOBER 1, 2021

In IT and cloud computing, observability is the ability to measure a system’s current state based on the data it generates, such as logs, metrics, and traces. The architects and developers who create the software must design it to be observed. Why is it important, and what can it actually help organizations achieve?

Metrics

Metrics Open Source Monitoring Cloud

Best practices and key metrics for improving mobile app performance

Dynatrace

DECEMBER 13, 2023

As a result, organizations need to monitor mobile app performance metrics that are meaningful and actionable by gaining adequate observability of mobile app performance. There are many common mobile app performance metrics that are used to measure key performance indicators (KPIs) related to user experience and satisfaction.

Best Practices

Best Practices Mobile Metrics Performance

Application observability meets developer observability: Unlock a 360º view of your environment

Dynatrace

NOVEMBER 6, 2023

Application observability helps IT teams gain visibility in their highly distributed systems, but what is developer observability and why is it important? In a recent webinar , Dynatrace DevOps activist Andi Grabner and senior software engineer Yarden Laifenfeld explored developer observability. Observability is about answering.”

Development

Development DevOps Programming Cloud

RabbitMQ vs. Kafka: Key Differences

Scalegrid

FEBRUARY 6, 2025

Its partitioned log architecture supports both queuing and publish-subscribe models, allowing it to handle large-scale event processing with minimal latency. Apache Kafka uses a custom TCP/IP protocol for high throughput and low latency. Apache Kafka, designed for distributed event streaming, maintains low latency at scale.

Latency

Latency Analytics Architecture Storage

Mastering Latency With P90, P99, and Mean Response Times

DZone

FEBRUARY 5, 2024

In the fast-paced digital world, where every millisecond counts, understanding the nuances of network latency becomes paramount for developers and system architects. Latency, the delay before a transfer of data begins following an instruction for its transfer, can significantly impact user experience and system performance.

Latency

Latency Metrics Network Systems

Migrating Netflix to GraphQL Safely

The Netflix TechBlog

JUNE 14, 2023

So, we relied on higher-level metrics-based testing: AB Testing and Sticky Canaries. To determine customer impact, we could compare various metrics such as error rates, latencies, and time to render. Wins High-Level Health Metrics: AB Testing provided the assurance we needed in our overall client-side GraphQL implementation.

Traffic

Traffic Latency Metrics Cache

Noisy Neighbor Detection with eBPF

The Netflix TechBlog

SEPTEMBER 10, 2024

Continuous Instrumentation of the Linux Scheduler To ensure the reliability of our workloads that depend on low latency responses, we instrumented the run queue latency for each container, which measures the time processes spend in the scheduling queue before being dispatched to the CPU.

Latency

Latency Metrics Programming Monitoring

Implementing service-level objectives to improve software quality

Dynatrace

DECEMBER 27, 2022

By implementing service-level objectives, teams can avoid collecting and checking a huge amount of metrics for each service. When organizations implement SLOs, they can improve software development processes and application performance. Develop error budgets to help teams measure success and make data-driven decisions.

Software

Software Software Benchmarking Latency

OpenTelemetry 101: A nontechnical guide for IT leaders and enthusiasts

Dynatrace

JULY 22, 2024

If you work in software development, SRE, or DevOps, you’ve likely heard the terms observability, telemetry, and tracing. These concepts are crucial for understanding how applications behave in production environments, and they’re an essential part of modern software development practices. What is OpenTelemetry?

Latency

Latency Best Practices Metrics Open Source

What are quality gates? How to use quality gates to deliver better software at speed and scale

Dynatrace

FEBRUARY 21, 2024

They help foster confidence and consistency throughout the entire software development lifecycle (SDLC). Automating quality gates is ideal, as it minimizes manually checking and validating key metrics throughout the SDLC. Continuous, informed improvement : Quality gates provide consistent feedback on key metrics.

Speed

Speed Software Software Latency

Best Practices for Scaling RabbitMQ

Scalegrid

FEBRUARY 24, 2025

While clustering across wide-area networks (WANs) is discouraged due to latency issues, leased links can mitigate some connectivity challenges. Keeping queues short minimizes latency and enhances the overall efficiency of message delivery in RabbitMQ. Keeping queues short maintains a responsive and efficient RabbitMQ setup.

Best Practices

Best Practices Traffic Strategy Scalability

Build systems more reliably with Dynatrace: Chaos Engineering

Dynatrace

AUGUST 21, 2024

This approach enhances key DORA metrics and enables early detection of failures in the release process, allowing SREs more time for innovation. This blog post explores the Reliability metric , which measures modern operational practices. Why reliability?

Engineering

Engineering Systems Latency Metrics

Dynatrace supports SnapStart for Lambda as an AWS launch partner

Dynatrace

NOVEMBER 28, 2022

The new Amazon capability enables customers to improve the startup latency of their functions from several seconds to as low as sub-second (up to 10 times faster) at P99 (the 99th latency percentile). This can cause latency outliers and may lead to a poor end-user experience for latency-sensitive applications.

Lambda

Lambda AWS Serverless Latency

Escaping POC Purgatory: Evaluation-Driven Development for AI Systems

O'Reilly

MARCH 25, 2025

Weve seen this across dozens of companies, and the teams that break out of this trap all adopt some version of Evaluation-Driven Development (EDD), where testing, monitoring, and evaluation drive every decision from the start. Were also betting that this will be a time of software development flourishing. The way out?

Systems

Systems Development Tuning Monitoring

Time to First Byte: What It Is and Why It Matters

CSS Wizardry

AUGUST 7, 2019

However, one metric I feel that front-end developers overlook all too quickly is Time to First Byte (TTFB). The first—and often most surprising for people to learn—thing that I want to draw your attention to is that TTFB counts one whole round trip of latency. can all provide valuable insights. But what else is TTFB?

Latency

Latency Ecommerce Servers Mobile

Dynatrace supports Azure Managed Instance for Apache Cassandra

Dynatrace

MAY 13, 2022

It also removes the need for developers and database administrators to manage infrastructure or update database versions. Once you deploy the Dynatrace extension, Dynatrace ingests your Cassandra metrics and analyzes them in context with the entire stack. Provide a foundation for calculating metrics in dashboard charts.

Azure

Azure Latency Metrics Infrastructure

SLOs done right: how DevOps teams can build better service-level objectives

Dynatrace

MARCH 16, 2023

So how do development and operations (DevOps) teams and site reliability engineers (SREs) distinguish among good, great, and suboptimal SLOs? Enterprises now have access to myriad metrics they can track and measure, but an abundance of choice doesn’t equal actionable insight. The result?

DevOps

DevOps Latency Metrics Traffic

Why applying chaos engineering to data-intensive applications matters

Dynatrace

MAY 23, 2024

Stream processing systems, designed for continuous, low-latency processing, demand swift recovery mechanisms to tolerate and mitigate failures effectively. This significantly increases event latency. Spark Structured Streaming can also provide consistent fault recovery for applications where latency is not a critical requirement.

Engineering

Engineering Tuning Latency Open Source

Site reliability done right: 5 SRE best practices that deliver on business objectives

Dynatrace

MAY 31, 2023

As a result, site reliability has emerged as a critical success metric for many organizations. The practice uses continuous monitoring and high levels of automation in close collaboration with agile development teams to ensure applications are highly available and perform without friction. Service-level objectives (SLOs). availability.

Best Practices

Best Practices DevOps Latency Metrics

Lessons learned from enterprise service-level objective management

Dynatrace

MAY 19, 2022

Organizations have multiple stakeholders and almost always have different teams that set up monitoring, operate systems, and develop new functionality. In their new dashboard, they added dimensions for load, latency, and open problems for each component. The “Four Golden Signals” include the following: Latency.

Automotive

Automotive Latency Architecture Mobile

Automated Change Impact Analysis with Site Reliability Guardian

Dynatrace

FEBRUARY 15, 2023

Streamline development and delivery processes Nowadays, digital transformation strategies are executed by almost every organization across all industries. To achieve this, many organizations are adopting DevOps practices to provide developers with a delivery platform to release their applications and services autonomously and independently.

DevOps

DevOps Latency Traffic Best Practices

What is AWS Lambda?

Dynatrace

APRIL 5, 2021

These include website hosting, database management, backup and restore, IoT capabilities, e-commerce solutions, app development tools and more, with new services released regularly. Real-time stream processing to perform live activity tracking, data cleansing, metrics generation, and more.

Lambda

Lambda AWS Serverless Hardware

Extending Vector with eBPF to inspect host and container performance

The Netflix TechBlog

FEBRUARY 20, 2019

Today we are excited to announce latency heatmaps and improved container support for our on-host monitoring solution?—?Vector?—?to Remotely view real-time process scheduler latency and tcp throughput with Vector and eBPF What is Vector? to the broader community. Vector is open source and in use by multiple companies.

Performance

Performance Latency Open Source Metrics

What is real user monitoring (RUM)?

Dynatrace

JANUARY 13, 2022

Real user monitoring collects data on a variety of metrics. For example, data collected on load actions can include navigation start, request start, and speed index metrics. Real user monitoring works by injecting code into an application to capture metrics while the application is in use. How real user monitoring works.

Monitoring

Monitoring Mobile Latency Best Practices

What is full stack observability?

Dynatrace

APRIL 6, 2022

A full-stack observability solution uses telemetry data such as logs, metrics, and traces to give IT teams insight into application, infrastructure, and UX performance. Observability can identify the baseline user experience and allow teams to improve it by optimizing page load times or reducing latency. See observability in action!

DevOps

DevOps Innovation Infrastructure Cloud

Enhanced AI model observability with Dynatrace and Traceloop OpenLLMetry

Dynatrace

DECEMBER 4, 2023

OpenTelemetry has become a standard for collecting traces, metrics, and logs. Given the prevalence of Python in AI model development, OpenTelemetry serves as a robust standard for collecting observability data, including traces, metrics, and logs. Maintained under the Apache 2.0 However, Python models are trickier.

Open Source

Open Source Metrics Java Latency

Seamlessly Swapping the API backend of the Netflix Android app

The Netflix TechBlog

SEPTEMBER 8, 2020

How we migrated our Android endpoints out of a monolith into a new microservice by Rohan Dhruva , Ed Ballot As Android developers, we usually have the luxury of treating our backends as magic boxes running in the cloud, faithfully returning us JSON. We will talk more about how we used these metrics in the sections to follow.

Latency

Latency Cache Java Traffic

Implementing AWS well-architected pillars with automated workflows

Dynatrace

SEPTEMBER 13, 2023

In this blog post, we’ll demonstrate how Dynatrace automation and the Dynatrace Site Reliability Guardian can help you implement your applications according to all six AWS Well-Architected pillars by integrating them into your software development lifecycle (SDLC).

AWS

AWS Efficiency Azure Cloud

What is API monitoring?

Dynatrace

OCTOBER 4, 2021

API monitoring captures and analyzes metrics that describe the vital aspects of an application’s performance, which can help developers gain a deeper understanding of the health and efficiency of the APIs they’re utilizing. For example, some developers may be using an old version of an API that will soon be deprecated.

Monitoring

Monitoring Latency Metrics Availability

Improved Alerting with Atlas Streaming Eval

The Netflix TechBlog

APRIL 27, 2023

A few years ago, we were paged by our SRE team due to our Metrics Alerting System falling behind — critical application health alerts reached engineers 45 minutes late! Hence, we started down the path of alert evaluation via real-time streaming metrics. This has proven to be valuable towards reducing Mean Time to Recover (MTTR).

Storage

Storage Cache Metrics Database

Pushy to the Limit: Evolving Netflix’s WebSocket proxy for the future

The Netflix TechBlog

SEPTEMBER 10, 2024

History & motivation There were two main motivating use cases that drove Pushy’s initial development and usage. These pain points coincided with the introduction of KeyValue, which was a new offering from the CDE team that is roughly “HashMap as a service” for Netflix developers.

Latency

Latency Cache Tuning Efficiency

Dynatrace accelerates business transformation with new AI observability solution

Dynatrace

JANUARY 31, 2024

Bringing together metrics, logs, traces, problem analytics, and root-cause information in dashboards and notebooks, Dynatrace offers an end-to-end unified operational view of cloud applications. Development and demand for AI tools come with a growing concern about their environmental cost.

Cache

Cache Azure Infrastructure Monitoring

Observability platform vs. observability tools

Dynatrace

DECEMBER 22, 2021

Observability gives developers and system operators real-time awareness of a highly distributed system’s current state based on the data it generates. Observability is made up of three key pillars: metrics, logs, and traces. A microscopic view of systems is also particularly valuable to developers.

Artificial Intelligence

Artificial Intelligence Metrics Architecture DevOps

Unlock the power of contextual log analytics

Dynatrace

OCTOBER 2, 2024

Dynatrace enables various teams, such as developers, threat hunters, business analysts, and DevOps, to effortlessly consume advanced log insights within a single platform. Dynatrace Grail™ and Davis ® AI act as the foundation, eliminating the need for manual log correlation or analysis while enabling you to take proactive action.

Analytics

Analytics AWS DevOps Cloud

What are SLOs? How service-level objectives work with SLIs to deliver on SLAs

Dynatrace

DECEMBER 2, 2021

These can include business metrics, such as conversion rates, uptime, and availability; service metrics, such as application performance; or technical metrics, such as dependencies to third-party services, underlying CPU, and the cost of running a service. For example, if your SLO guarantees 99.5% What are SLIs? Avoid downtime.

Metrics

Metrics Best Practices DevOps Infrastructure

How Dynatrace boosts production resilience with Site Reliability Guardian

Dynatrace

MAY 17, 2023

To ensure high standards, it’s essential that your organization establish automated validations in an early phase of the software development process—ideally when code is written. In this case, the four golden signals (latency, traffic, errors, and saturation) are derived from span attributes and DQL metric queries via Dynatrace Grail™.

DevOps

DevOps Traffic Latency Best Practices

What is serverless computing? Driving efficiency without sacrificing observability

Dynatrace

JANUARY 26, 2021

AWS Lambda functions are an example of how a serverless framework works: Developers write a function in a supported language or platform. The developer uploads the function and configuration for how to run the function to the cloud. When an application is triggered, it can cause latency as the application starts. Pay Per Use.

Serverless

Serverless Efficiency Lambda Azure

Performance Hero: Annie Sullivan

Speed Curve

JANUARY 19, 2025

Annie leads the Chrome Speed Metrics team at Google, which has arguably had the most significant impact on web performance of the past decade. It's really important to acknowledge that none of this would have been possible without the great work from Annie and her small-but-mighty Speed Metrics team at Google. Nice job, everyone!

Performance

Performance Google Speed Metrics

Business KPI tracking for mobile applications with Dynatrace: The value of an end-to-end platform for mobile app owners

Dynatrace

SEPTEMBER 16, 2022

To answer these questions for the business as well as work with your mobile developers to prioritize efforts and implement changes, it’s critical to have a single source of truth that provides the operational and business answers you need. When it comes to mobile app development, it’s vital that owners get the full picture.

Mobile

Mobile Metrics Monitoring Latency

Common SLO pitfalls and how to avoid them

Dynatrace

FEBRUARY 2, 2022

service availability with <50ms latency for an application with no revenue impact. SLOs created by upper management without buy-in from relevant development, operations, and SRE stakeholders can lead to finger-pointing, blaming, and chaotic war rooms when violations occur. Pitfall 2: SLOs with no ownership or accountability.

DevOps

DevOps Metrics Best Practices Latency

Build and operate multicloud FaaS with enhanced, intelligent end-to-end observability

Dynatrace

APRIL 25, 2023

It helps developers and operators identify and troubleshoot issues, optimize performance and improve user experience. Enable faster development and deployment cycles by abstracting away the infrastructure complexity. Higher latency and cold start issues due to the initialization time of the functions.

Serverless

Serverless Lambda Azure AWS

Edgar: Solving Mysteries Faster with Observability

The Netflix TechBlog

SEPTEMBER 2, 2020

Tracing as a foundation Logs, metrics, and traces are the three pillars of observability. Metrics communicate what’s happening on a macro scale, traces illustrate the ecosystem of an isolated request, and the logs provide a detail-rich snapshot into what happened within a service. Is this an anomaly or are we dealing with a pattern?

Latency

Latency Transportation Engineering Traffic

How digital experience monitoring helps deliver business observability

Dynatrace

APRIL 26, 2022

Fast, consistent application delivery creates a positive user experience that can ultimately drive customer loyalty and improve business metrics like conversion rate and user retention. DEM can give organizations business observability—insight into the effects of user experience on the bottom line. What is digital experience monitoring?

Monitoring

Monitoring Social Media IoT Metrics

Optimising for High Latency Environments

What is observability? Not just logs, metrics and traces

Trending Sources

Best practices and key metrics for improving mobile app performance

Application observability meets developer observability: Unlock a 360º view of your environment

RabbitMQ vs. Kafka: Key Differences

Mastering Latency With P90, P99, and Mean Response Times

Migrating Netflix to GraphQL Safely

Noisy Neighbor Detection with eBPF

Implementing service-level objectives to improve software quality

OpenTelemetry 101: A nontechnical guide for IT leaders and enthusiasts

What are quality gates? How to use quality gates to deliver better software at speed and scale

Best Practices for Scaling RabbitMQ

Build systems more reliably with Dynatrace: Chaos Engineering

Dynatrace supports SnapStart for Lambda as an AWS launch partner

Escaping POC Purgatory: Evaluation-Driven Development for AI Systems

Time to First Byte: What It Is and Why It Matters

Dynatrace supports Azure Managed Instance for Apache Cassandra

SLOs done right: how DevOps teams can build better service-level objectives

Why applying chaos engineering to data-intensive applications matters

Site reliability done right: 5 SRE best practices that deliver on business objectives

Lessons learned from enterprise service-level objective management

Automated Change Impact Analysis with Site Reliability Guardian

What is AWS Lambda?

Extending Vector with eBPF to inspect host and container performance

What is real user monitoring (RUM)?

What is full stack observability?

Enhanced AI model observability with Dynatrace and Traceloop OpenLLMetry

Seamlessly Swapping the API backend of the Netflix Android app

Implementing AWS well-architected pillars with automated workflows

What is API monitoring?

Improved Alerting with Atlas Streaming Eval

Pushy to the Limit: Evolving Netflix’s WebSocket proxy for the future

Dynatrace accelerates business transformation with new AI observability solution

Observability platform vs. observability tools

Unlock the power of contextual log analytics

What are SLOs? How service-level objectives work with SLIs to deliver on SLAs

How Dynatrace boosts production resilience with Site Reliability Guardian

What is serverless computing? Driving efficiency without sacrificing observability

Performance Hero: Annie Sullivan

Business KPI tracking for mobile applications with Dynatrace: The value of an end-to-end platform for mobile app owners

Common SLO pitfalls and how to avoid them

Build and operate multicloud FaaS with enhanced, intelligent end-to-end observability

Edgar: Solving Mysteries Faster with Observability

How digital experience monitoring helps deliver business observability

Stay Connected