Availability, Infrastructure and Latency - Technology Performance Pulse

Dynatrace on Microsoft Azure in Australia enables regional customers to leverage AI-powered observability

Dynatrace

OCTOBER 28, 2024

As modern multicloud environments become more distributed and complex, having real-time insights into applications and infrastructure while keeping data residency in local markets is crucial. As of October 2024, Dynatrace is available on Microsoft Azure Australia East region, enabling joint customers to maintain a local SaaS presence.

Azure

Azure Latency Infrastructure Cloud

Netflix’s Distributed Counter Abstraction

The Netflix TechBlog

NOVEMBER 12, 2024

By: Rajiv Shringi , Oleksii Tkachuk , Kartik Sathyanarayanan Introduction In our previous blog post, we introduced Netflix’s TimeSeries Abstraction , a distributed service designed to store and query large volumes of temporal event data with low millisecond latencies. Today, we’re excited to present the Distributed Counter Abstraction.

Latency

Latency Cache Infrastructure Strategy

Next-level interaction and customization of data visualizations in Dynatrace Dashboards and Notebooks

Dynatrace

OCTOBER 10, 2024

Take your monitoring, data exploration, and storytelling to the next level with outstanding data visualization All your applications and underlying infrastructure produce vast volumes of data that you need to monitor or analyze for insights. Infrastructure health: A honeycomb chart is often used to visualize infrastructure health.

Latency

Latency Infrastructure Monitoring Metrics

Comparing Approaches to Durability in Low Latency Messaging Queues

DZone

AUGUST 2, 2022

A significant feature of Chronicle Queue Enterprise is support for TCP replication across multiple servers to ensure the high availability of application infrastructure. Little’s Law and Why Latency Matters. In many cases, the assumption is that as long as throughput is high enough, the latency won’t be a problem.

Latency

Latency Benchmarking Network Infrastructure

Cloud infrastructure monitoring in action: Dynatrace on Dynatrace

Dynatrace

SEPTEMBER 29, 2020

Since we moved to AWS in May 2014 we have had an availability of 99.95%! Sydney, we have a disk write latency problem! It was on August 25 th at 14:00 when Davis initially alerted on a disk write latency issues to Elastic File System (EFS) on one of our EC2 instances in AWS’s Sydney Data Center.

Infrastructure

Infrastructure Cloud Monitoring AWS

Cut costs and complexity: 5 strategies for reducing tool sprawl with Dynatrace

Dynatrace

APRIL 10, 2025

Dynatrace integrates application performance monitoring (APM), infrastructure monitoring, and real-user monitoring (RUM) into a single platform, with its Foundation & Discovery mode offering a cost-effective, unified view of the entire infrastructure, including non-critical applications previously monitored using legacy APM tools.

Strategy

Strategy Storage Network Architecture

RabbitMQ vs. Kafka: Key Differences

Scalegrid

FEBRUARY 6, 2025

Its design prioritizes high availability and efficient data transfer with minimal overhead, making it a practical choice for handling real-time data pipelines and distributed event processing. It follows a push-based approach, ensuring messages are distributed to consumers as soon as they become available.

Latency

Latency Analytics Architecture Storage

Dynatrace Managed turnkey Premium High Availability for globally distributed data centers (Early Adopter)

Dynatrace

JUNE 25, 2020

Dynatrace Managed is intrinsically highly available as it stores three copies of all events, user sessions, and metrics across its cluster nodes. The network latency between cluster nodes should be around 10 ms or less. Turnkey high availability across globally distributed data centers. Dynatrace news.

Availability

Availability Hardware Latency Traffic

Foundation Model for Personalized Recommendation

The Netflix TechBlog

MARCH 28, 2025

Yet, many are confined to a brief temporal window due to constraints in serving latency or training costs. In recommendation systems, context windows during inference are often limited to hundreds of eventsnot due to model capability but because these services typically require millisecond-level latency.

Tuning

Tuning Efficiency Latency Strategy

Optimize Citrix platform performance and user experience with Dynatrace (GA)

Dynatrace

JANUARY 15, 2020

Having released this functionality in an Preview Release back in September 2019, we’re now happy to announce the General Availability of our Citrix monitoring extension. Synthetic monitoring: Citrix login availability and performance. OneAgent: Citrix infrastructure performance. OneAgent: SAP infrastructure performance.

Latency

Latency Performance Virtualization Infrastructure

Solve hybrid Kubernetes performance and reliability problems with unified observability

Dynatrace

APRIL 10, 2025

With the many observability options available from Dynatrace, you can seamlessly monitor hybrid Kubernetes environments in a unified platform, gaining end-to-end visibility across both operating systems and the underlying cluster. The containers list as individual PaaS hosts after successful deployment.

Performance

Performance Java Operating System Infrastructure

Varnish and BBR: Lower Latency OTT Video Delivery

DZone

APRIL 30, 2020

Currently, publicly available wifi hotspots are the preferred networks for video consumption, but poor network infrastructure also leads to unbearable video buffering and latency. However, OTT streaming delivery requires something faster than what the internet offers in terms of how chunks/fragments are supposed to flow.

Latency

Latency Internet Internet Network

MySQL on Azure Performance Benchmark – ScaleGrid vs. Azure Database

Scalegrid

AUGUST 26, 2020

Microsoft Azure is one of the most popular cloud providers in the world, and a natural fit for database hosting on applications leveraging Microsoft across their infrastructure. While Microsoft offers their own Azure Database product, there are other alternatives available that may be able to help you improve your MySQL performance.

Azure

Azure Benchmarking Database Latency

Implementing service-level objectives to improve software quality

Dynatrace

DECEMBER 27, 2022

SLOs can be a great way for DevOps and infrastructure teams to use data and performance expectations to make decisions, such as whether to release and where engineers should focus their time. Latency is the time that it takes a request to be served. Availability. SLOs aid decision making. SLOs promote automation. Reliability.

Software

Software Software Benchmarking Latency

Best Practices for Scaling RabbitMQ

Scalegrid

FEBRUARY 24, 2025

Implementing clustering and quorum queues in RabbitMQ significantly improves load distribution and data redundancy, ensuring high availability and fault tolerance for messaging services. Classic queues can be used in clusters, emphasizing their behavior during node failures, particularly regarding durability and availability.

Best Practices

Best Practices Traffic Strategy Scalability

Dynatrace supports SnapStart for Lambda as an AWS launch partner

Dynatrace

NOVEMBER 28, 2022

The new Amazon capability enables customers to improve the startup latency of their functions from several seconds to as low as sub-second (up to 10 times faster) at P99 (the 99th latency percentile). This can cause latency outliers and may lead to a poor end-user experience for latency-sensitive applications.

Lambda

Lambda AWS Serverless Latency

Noisy Neighbor Detection with eBPF

The Netflix TechBlog

SEPTEMBER 10, 2024

The first step is determining whether the problem originates from the application or the underlying infrastructure. Learn how Linux kernel instrumentation can improve your infrastructure observability with deeper insights and enhanced monitoring. We then calculate the run queue latency by simply subtracting the timestamps.

Latency

Latency Metrics Programming Monitoring

Comparing PostgreSQL DigitalOcean Performance & Pricing – ScaleGrid vs. DigitalOcean Managed Databases

Scalegrid

JUNE 4, 2020

As an open source database, it’s a highly popular choice for enterprise applications looking to modernize their infrastructure and reduce their total cost of ownership, along with startup and developer applications looking for a powerful, flexible and cost-effective database to work with. Compare Latency. At a glance – TLDR.

Database

Database Latency Benchmarking Performance

Introducing Netflix’s Key-Value Data Abstraction Layer

The Netflix TechBlog

SEPTEMBER 18, 2024

Vidhya Arvind , Rajasekhar Ummadisetty , Joey Lynch , Vinay Chella Introduction At Netflix our ability to deliver seamless, high-quality, streaming experiences to millions of users hinges on robust, global backend infrastructure. It also serves as central configuration of access patterns such as consistency or latency targets.

Latency

Latency Storage Cache Servers

The Three Cs: Concatenate, Compress, Cache

CSS Wizardry

OCTOBER 16, 2023

What is the availability, configurability, and efficacy of each? ?️ Plotted on the same horizontal axis of 1.6s, the waterfalls speak for themselves: 201ms of cumulative latency; 109ms of cumulative download. 4,362ms of cumulative latency; 240ms of cumulative download. And do any of our previous decisions dictate our options?

Cache

Cache Latency Strategy Speed

Optimizing your Kubernetes clusters without breaking the bank

Dynatrace

JANUARY 14, 2022

Its ability to densely schedule containers into the underlying machines translates to low infrastructure costs. However, setting the right parameters for Kubernetes clusters to ensure application availability, performance, and resilience while avoiding overspending isn’t a walk in the park. below 500ms) and error rates (e.g.

Latency

Latency Tuning Efficiency AWS

Dynatrace supports Azure Managed Instance for Apache Cassandra

Dynatrace

MAY 13, 2022

Because of its scalability and distributed architecture, thousands of companies trust it to run their cloud and hybrid-based workloads at high availability without compromising performance. It also removes the need for developers and database administrators to manage infrastructure or update database versions.

Azure

Azure Latency Metrics Infrastructure

Service level objectives: 5 SLOs to get started

Dynatrace

JUNE 1, 2023

These organizations rely heavily on performance, availability, and user satisfaction to drive sales and retain customers. Availability Availability SLO quantifies the expected level of service availability over a specific time period. Availability is typically expressed in 9’s, such as 99.9%. or 99.99% of the time.

Latency

Latency Website Traffic Virtualization

Self-Host Your Static Assets

CSS Wizardry

MAY 31, 2019

One of the quickest wins—and one of the first things I recommend my clients do—to make websites faster can at first seem counter-intuitive: you should self-host all of your static assets, forgoing others’ CDNs/infrastructure. On a slower, higher-latency connection, the story is much, mush worse. You’re going to suffer, too.

Cache

Cache Latency Infrastructure Website

The Power of Caching: Boosting API Performance and Scalability

DZone

AUGUST 16, 2023

Benefits of Caching Improved performance: Caching eliminates the need to retrieve data from the original source every time, resulting in faster response times and reduced latency. Reduced server load: By serving cached content, the load on the server is reduced, allowing it to handle more requests and improving overall scalability.

Cache

Cache Scalability Performance Latency

Site reliability done right: 5 SRE best practices that deliver on business objectives

Dynatrace

MAY 31, 2023

Keeping pace with modern digital transformation requires ensuring that applications are responsive, resilient, and always available amid increased complexity. There are now many more applications, tools, and infrastructure variables that impact an application’s performance and availability.

Best Practices

Best Practices DevOps Latency Metrics

Site reliability engineering: 5 things you need to know

Dynatrace

FEBRUARY 4, 2021

Site reliability engineering (SRE) is the practice of applying software engineering principles to operations and infrastructure processes to help organizations create highly reliable and scalable software systems. ” According to Google, “SRE is what you get when you treat operations as a software problem.”

Engineering

Engineering DevOps Government Latency

Why applying chaos engineering to data-intensive applications matters

Dynatrace

MAY 23, 2024

Failures can occur unpredictably across various levels, from physical infrastructure to software layers. Stream processing systems, designed for continuous, low-latency processing, demand swift recovery mechanisms to tolerate and mitigate failures effectively. This significantly increases event latency.

Engineering

Engineering Tuning Latency Open Source

Migrating Netflix to GraphQL Safely

The Netflix TechBlog

JUNE 14, 2023

To determine customer impact, we could compare various metrics such as error rates, latencies, and time to render. The Replay Testing framework leverages the @override directive available in GraphQL Federation. The AB experiment results hinted that GraphQL’s correctness was not up to par with the legacy system. How does it work?

Traffic

Traffic Latency Metrics Cache

Time to First Byte: What It Is and Why It Matters

CSS Wizardry

AUGUST 7, 2019

The first—and often most surprising for people to learn—thing that I want to draw your attention to is that TTFB counts one whole round trip of latency. The reason is because mobile networks are, as a rule, high latency connections. Last mile latency deals with the disproportionate complexity toward the terminus of a connection.

Latency

Latency Ecommerce Servers Mobile

Introducing Netflix TimeSeries Data Abstraction Layer

The Netflix TechBlog

OCTOBER 8, 2024

Rajiv Shringi Vinay Chella Kaidan Fullerton Oleksii Tkachuk Joey Lynch Introduction As Netflix continues to expand and diversify into various sectors like Video on Demand and Gaming , the ability to ingest and store vast amounts of temporal data — often reaching petabytes — with millisecond access latency has become increasingly vital.

Latency

Latency Storage Traffic Tuning

Dynatrace supports the newly released AWS Lambda Response Streaming

Dynatrace

APRIL 7, 2023

Now, customers can use streamed responses to build more responsive applications by sending partial responses to clients as the response becomes available. Customers can use AWS Lambda Response Streaming to improve performance for latency-sensitive applications and return larger payload sizes. What is a Lambda serverless function?

Lambda

Lambda AWS Serverless Latency

How Park ‘N Fly eliminated silos and improved customer experience with Dynatrace cloud monitoring

Dynatrace

APRIL 7, 2021

But your infrastructure teams don’t see any issue on their AWS or Azure monitoring tools, your platform team doesn’t see anything too concerning in Kubernetes logging, and your apps team says there are green lights across the board. This scenario has become all too common as digital infrastructure has grown increasingly complex.

Cloud

Cloud Monitoring Latency Games

Best MySQL DigitalOcean Performance – ScaleGrid vs. DigitalOcean Managed Databases

Scalegrid

JUNE 22, 2020

Compare Latency. On average, ScaleGrid achieves almost 30% lower latency over DigitalOcean for the same deployment configurations. Now that we’ve compared throughput performance, let’s take a look at ScaleGrid vs. DigitalOcean latency for MySQL. Read-Intensive Latency Benchmark. Balanced Workload Latency Benchmark.

Database

Database Benchmarking Latency Performance

What are quality gates? How to use quality gates to deliver better software at speed and scale

Dynatrace

FEBRUARY 21, 2024

To remain competitive in today’s fast-paced market, organizations must not only ensure that their digital infrastructure is functioning optimally but also that software deployments and updates are delivered rapidly and consistently. In this example, unlike latency, the remaining three signals did not receive a “pass.”

Speed

Speed Software Software Latency

Site reliability engineering: 5 things to you need to know

Dynatrace

FEBRUARY 4, 2021

Site reliability engineering (SRE) is the practice of applying software engineering principles to operations and infrastructure processes to help organizations create highly reliable and scalable software systems. ” According to Google, “SRE is what you get when you treat operations as a software problem.”

Engineering

Engineering DevOps Government Latency

Dynatrace automatically monitors OpenAI ChatGPT for companies that deliver reliable, cost-effective services powered by generative AI

Dynatrace

JUNE 7, 2023

One of the crucial success factors for delivering cost-efficient and high-quality AI-agent services, following the approach described above, is to closely observe their cost, latency, and reliability. With these latency, reliability, and cost measurements in place, your operations team can now define their own OpenAI dashboards and SLOs.

Monitoring

Monitoring Latency Metrics Azure

Dynatrace accelerates business transformation with new AI observability solution

Dynatrace

JANUARY 31, 2024

Data dependencies and framework intricacies require observing the lifecycle of an AI-powered application end to end, from infrastructure and model performance to semantic caches and workflow orchestration. Estimates show that NVIDIA, a semiconductor manufacturer, could release 1.5 million AI server units annually by 2027, consuming 75.4+

Cache

Cache Azure Infrastructure Monitoring

How to Improve MySQL AWS Performance 2X Over Amazon RDS at The Same Cost

Scalegrid

OCTOBER 24, 2019

As organizations continue to migrate to the cloud, it’s important to get in front of performance issues, such as high latency, low throughput, and replication lag with higher distances between your users and cloud infrastructure. ScaleGrid also maintains 53% lower latency on average throughout the entire MySQL AWS performance tests.

AWS

AWS Latency Performance Performance Testing

Optimize your environment: Unveiling Dynatrace Hyper-V extension for enhanced performance and efficient troubleshooting

Dynatrace

OCTOBER 23, 2023

Full stack observability with the Dynatrace Hyper-V extension Use Dynatrace for deeper insights into the Microsoft ecosystem with the new Hyper-V extension; it provides crucial virtualization layer characteristics for Windows infrastructure observability. We’re always looking for feedback and opportunities to improve.

Efficiency

Efficiency Virtualization Hardware Performance

Pushy to the Limit: Evolving Netflix’s WebSocket proxy for the future

The Netflix TechBlog

SEPTEMBER 10, 2024

This is particularly important as we build out new functionality that relies on Pushy; a strong, stable infrastructure foundation allows our partners to continue to build on top of Pushy with confidence. In our case, we value low latency — the faster we can read from KeyValue, the faster these messages can get delivered.

Latency

Latency Cache Tuning Efficiency

How observability analytics helps teams uncover answers

Dynatrace

JUNE 26, 2024

And they can create relevant queries based on available data to answer questions and make business decisions. While measuring app response time under different circumstances provides a latency value, for example, it doesn’t tell you why the app is slow, fast, or somewhere in between. Predictive analysis.

Analytics

Analytics Infrastructure Metrics Efficiency

Build systems more reliably with Dynatrace: Chaos Engineering

Dynatrace

AUGUST 21, 2024

These releases often assumed ideal conditions such as zero latency, infinite bandwidth, and no network loss, as highlighted in Peter Deutsch’s eight fallacies of distributed systems. Impact of fewer resources, for example, CPU and disk, available to different services and applications.

Engineering

Engineering Systems Latency Metrics

What is observability? Not just logs, metrics and traces

Dynatrace

OCTOBER 1, 2021

In these modern environments, every hardware, software, and cloud infrastructure component and every container, open-source tool, and microservice generates records of every activity. Metrics can originate from a variety of sources, including infrastructure, hosts, services, cloud platforms, and external sources.

Metrics

Metrics Open Source Monitoring Cloud

Dynatrace on Microsoft Azure in Australia enables regional customers to leverage AI-powered observability

Netflix’s Distributed Counter Abstraction

Trending Sources

Next-level interaction and customization of data visualizations in Dynatrace Dashboards and Notebooks

Comparing Approaches to Durability in Low Latency Messaging Queues

Cloud infrastructure monitoring in action: Dynatrace on Dynatrace

Cut costs and complexity: 5 strategies for reducing tool sprawl with Dynatrace

RabbitMQ vs. Kafka: Key Differences

Dynatrace Managed turnkey Premium High Availability for globally distributed data centers (Early Adopter)

Foundation Model for Personalized Recommendation

Optimize Citrix platform performance and user experience with Dynatrace (GA)

Solve hybrid Kubernetes performance and reliability problems with unified observability

Varnish and BBR: Lower Latency OTT Video Delivery

MySQL on Azure Performance Benchmark – ScaleGrid vs. Azure Database

Implementing service-level objectives to improve software quality

Best Practices for Scaling RabbitMQ

Dynatrace supports SnapStart for Lambda as an AWS launch partner

Noisy Neighbor Detection with eBPF

Comparing PostgreSQL DigitalOcean Performance & Pricing – ScaleGrid vs. DigitalOcean Managed Databases

Introducing Netflix’s Key-Value Data Abstraction Layer

The Three Cs: Concatenate, Compress, Cache

Optimizing your Kubernetes clusters without breaking the bank

Dynatrace supports Azure Managed Instance for Apache Cassandra

Service level objectives: 5 SLOs to get started

Self-Host Your Static Assets

The Power of Caching: Boosting API Performance and Scalability

Site reliability done right: 5 SRE best practices that deliver on business objectives

Site reliability engineering: 5 things you need to know

Why applying chaos engineering to data-intensive applications matters

Migrating Netflix to GraphQL Safely

Time to First Byte: What It Is and Why It Matters

Introducing Netflix TimeSeries Data Abstraction Layer

Dynatrace supports the newly released AWS Lambda Response Streaming

How Park ‘N Fly eliminated silos and improved customer experience with Dynatrace cloud monitoring

Best MySQL DigitalOcean Performance – ScaleGrid vs. DigitalOcean Managed Databases

What are quality gates? How to use quality gates to deliver better software at speed and scale

Site reliability engineering: 5 things to you need to know

Dynatrace automatically monitors OpenAI ChatGPT for companies that deliver reliable, cost-effective services powered by generative AI

Dynatrace accelerates business transformation with new AI observability solution

How to Improve MySQL AWS Performance 2X Over Amazon RDS at The Same Cost

Optimize your environment: Unveiling Dynatrace Hyper-V extension for enhanced performance and efficient troubleshooting

Pushy to the Limit: Evolving Netflix’s WebSocket proxy for the future

How observability analytics helps teams uncover answers

Build systems more reliably with Dynatrace: Chaos Engineering

What is observability? Not just logs, metrics and traces

Stay Connected