Innovation and Latency - Technology Performance Pulse

Foundation Model for Personalized Recommendation

The Netflix TechBlog

MARCH 28, 2025

Furthermore, it was difficult to transfer innovations from one model to another, given that most are independently trained despite using common data sources. Yet, many are confined to a brief temporal window due to constraints in serving latency or training costs.

Tuning

Tuning Efficiency Latency Strategy

Dynatrace supports SnapStart for Lambda as an AWS launch partner

Dynatrace

NOVEMBER 28, 2022

The new Amazon capability enables customers to improve the startup latency of their functions from several seconds to as low as sub-second (up to 10 times faster) at P99 (the 99th latency percentile). This can cause latency outliers and may lead to a poor end-user experience for latency-sensitive applications.

Lambda

Lambda AWS Serverless Latency

Title Launch Observability at Netflix Scale

The Netflix TechBlog

DECEMBER 17, 2024

The Challenge of Title Launch Observability As engineers, were wired to track system metrics like error rates, latencies, and CPU utilizationbut what about metrics that matter to a titlessuccess? Stay tuned for a closer look at the innovation behind thescenes!

Traffic

Traffic Scalability Strategy Monitoring

Introducing Netflix’s Key-Value Data Abstraction Layer

The Netflix TechBlog

SEPTEMBER 18, 2024

These include challenges with tail latency and idempotency, managing “wide” partitions with many rows, handling single large “fat” columns, and slow response pagination. It also serves as central configuration of access patterns such as consistency or latency targets. Useful for keeping “n-newest” or prefix path deletion.

Latency

Latency Storage Cache Servers

What is full stack observability?

Dynatrace

APRIL 6, 2022

Full-stack observability is fast becoming a must-have capability for organizations under pressure to deliver innovation in increasingly cloud-native environments. Observability can identify the baseline user experience and allow teams to improve it by optimizing page load times or reducing latency. Dynatrace news.

DevOps

DevOps Innovation Infrastructure Cloud

Nine ways technology executives can get significant business value with the right observability platform

Dynatrace

MAY 21, 2024

I also have the privilege of being “customer zero” for our platform, which enables me to continually discover where Dynatrace can deliver on more use cases to drive my team’s productivity and innovation. That’s because it does not require any pre-prepared schemas, and access to cold/hot storage is fully automatic and with zero latency.

Technology

Technology Technology Analytics Storage

What are quality gates? How to use quality gates to deliver better software at speed and scale

Dynatrace

FEBRUARY 21, 2024

This approach supports innovation, ambitious SLOs, DevOps scalability, and competitiveness. These metrics are latency, traffic, errors, and saturation, all of which must be key considerations when curating user experience. In this example, unlike latency, the remaining three signals did not receive a “pass.”

Speed

Speed Software Software Latency

Dynatrace supports the newly released AWS Lambda Response Streaming

Dynatrace

APRIL 7, 2023

Customers can use AWS Lambda Response Streaming to improve performance for latency-sensitive applications and return larger payload sizes. Customers can use response streaming to achieve the following: Improve Time to First Byte (TTFB) performance for latency-sensitive applications. Return larger payload sizes.

Lambda

Lambda AWS Serverless Latency

Rebuilding Netflix Video Processing Pipeline with Microservices

The Netflix TechBlog

JANUARY 10, 2024

This architecture shift greatly reduced the processing latency and increased system resiliency. We rolled out encoding innovations such as per-title and per-shot optimizations, which provided significant quality-of-experience (QoE) improvement to Netflix members. This introductory blog focuses on an overview of our journey.

Processing

Processing Media Latency Innovation

Dynatrace automatically monitors OpenAI ChatGPT for companies that deliver reliable, cost-effective services powered by generative AI

Dynatrace

JUNE 7, 2023

Businesses in all sectors are introducing novel approaches to innovate with generative AI in their domains. One of the crucial success factors for delivering cost-efficient and high-quality AI-agent services, following the approach described above, is to closely observe their cost, latency, and reliability.

Monitoring

Monitoring Latency Metrics Azure

Analyze OpenTelemetry traces and log data at scale: Accelerate troubleshooting and optimize application performance

Dynatrace

OCTOBER 3, 2024

Teams need a better way to work together, eliminate silos and spend more time innovating. Without distributed tracing, pinpointing the cause of increased latency could take hours or even days. Interact with data intuitively and easily and benefit from immediate, AI-supported insights.

Performance

Performance Architecture Innovation Latency

Build systems more reliably with Dynatrace: Chaos Engineering

Dynatrace

AUGUST 21, 2024

This approach enhances key DORA metrics and enables early detection of failures in the release process, allowing SREs more time for innovation. These releases often assumed ideal conditions such as zero latency, infinite bandwidth, and no network loss, as highlighted in Peter Deutsch’s eight fallacies of distributed systems.

Engineering

Engineering Systems Latency Metrics

What is AWS Lambda?

Dynatrace

APRIL 5, 2021

You can eliminate the latency issues caused by cold starts — an increase in normal response time when a new instance receives its first request — by using edge-optimized functions that run code closer to users and other projects. AWS continues to improve how it handles latency issues. It helps SRE teams automate responses.

Lambda

Lambda AWS Serverless Hardware

For your eyes only: improving Netflix video quality with neural networks

The Netflix TechBlog

NOVEMBER 17, 2022

On a CPU, we leveraged oneDnn to further reduce latency. Integrating neural networks into our next-generation encoding platform The Encoding Technologies and Media Cloud Engineering teams at Netflix have jointly innovated to bring Cosmos , our next-generation encoding platform, to life. Our filter can run on both CPU and GPU.

Network

Network Media Innovation Efficiency

How Park ‘N Fly eliminated silos and improved customer experience with Dynatrace cloud monitoring

Dynatrace

APRIL 7, 2021

For example, if there is a latency on a particular service, Dynatrace will flag this and trace its source – even if the source is a third party. We can see latency trends so we can say, ‘hey, you guys at nine o’clock at night keep having horrible latency spikes, and you’re causing a problem for our customers.’”

Cloud

Cloud Monitoring Latency Games

Site reliability engineering: 5 things you need to know

Dynatrace

FEBRUARY 4, 2021

As a discipline, SRE focuses on improving software system reliability across key categories including availability, performance, latency, efficiency, capacity, and incident response. ” According to Google, “SRE is what you get when you treat operations as a software problem.” SRE requires a cultural change.

Engineering

Engineering DevOps Government Latency

Service level objectives: 5 SLOs to get started

Dynatrace

JUNE 1, 2023

But the pressure on CIOs to innovate faster comes at a cost. Note : you might hear the term latency used instead of response time. Both latency and response time are critical to ensure reliability. Latency typically refers to the time it takes for a single request to travel from its source to its destination.

Latency

Latency Website Traffic DevOps

What is observability? Not just logs, metrics and traces

Dynatrace

OCTOBER 1, 2021

An advanced observability solution can also be used to automate more processes, increasing efficiency and innovation among Ops and Apps teams. These organizational improvements open the door to further innovation and digital transformation. How do you make a system observable?

Metrics

Metrics Open Source Monitoring Cloud

Netflix Cloud Packaging in the Terabyte Era

The Netflix TechBlog

SEPTEMBER 24, 2021

By Xiaomei Liu , Rosanna Lee , Cyril Concolato Introduction Behind the scenes of the beloved Netflix streaming service and content, there are many technology innovations in media processing. Uploading and downloading data always come with a penalty, namely latency. Packaging has always been an important step in media processing.

Cloud

Cloud Media Storage Cache

Introducing Netflix TimeSeries Data Abstraction Layer

The Netflix TechBlog

OCTOBER 8, 2024

Rajiv Shringi Vinay Chella Kaidan Fullerton Oleksii Tkachuk Joey Lynch Introduction As Netflix continues to expand and diversify into various sectors like Video on Demand and Gaming , the ability to ingest and store vast amounts of temporal data — often reaching petabytes — with millisecond access latency has become increasingly vital.

Latency

Latency Storage Traffic Tuning

Site reliability done right: 5 SRE best practices that deliver on business objectives

Dynatrace

MAY 31, 2023

By automating and accelerating the service-level objective (SLO) validation process and quickly reacting to regressions in service-level indicators (SLIs), SREs can speed up software delivery and innovation. At the lowest level, SLIs provide a view of service availability, latency, performance, and capacity across systems.

Best Practices

Best Practices DevOps Latency Metrics

DevOps observability: A guide for DevOps and DevSecOps teams

Dynatrace

JANUARY 18, 2023

As organizations accelerate innovation to keep pace with digital transformation, DevOps observability is becoming a critical key to success for DevOps and DevSecOps teams. This drive for speed has a cost: 22% of leaders admit they’re under so much pressure to innovate faster that they must sacrifice code quality.

DevOps

DevOps Best Practices Innovation Strategy

Site reliability engineering: 5 things to you need to know

Dynatrace

FEBRUARY 4, 2021

As a discipline, SRE focuses on improving software system reliability across key categories including availability, performance, latency, efficiency, capacity, and incident response. ” According to Google, “SRE is what you get when you treat operations as a software problem.” SRE requires a cultural change.

Engineering

Engineering DevOps Government Latency

Dynatrace accelerates business transformation with new AI observability solution

Dynatrace

JANUARY 31, 2024

For production models, this provides observability of service-level agreement (SLA) performance metrics, such as token consumption, latency, availability, response time, and error count. Enterprises that fail to adapt to these innovations face extinction.

Cache

Cache Azure Infrastructure Monitoring

SRE vs DevOps: What you need to know

Dynatrace

FEBRUARY 24, 2021

Reduced latency. At Dynatrace, we believe in advancing customers’ and the industry’s movement toward highly automated, AI-driven DevOps so teams can automate mundane tasks and deliver innovation faster and with less risk. Yes, it’s a broad philosophy whose tenets can be applied in other areas (e.g. Efficiency.

DevOps

DevOps Software Engineering Speed Google

Real-time business analytics with Dynatrace: Unleashing the treasure trove of insights from your observability data

Dynatrace

AUGUST 20, 2024

Common business analytics incur too much latency. In the coming weeks, I’ll dive deeper into each of the nine executive use case areas to drive innovation, mitigate risk, and optimize cost so you can unlock the potential of your business data using Dynatrace. Follow the new “Dynatrace for Executives” blog series.

Analytics

Analytics Latency Processing Systems

Rapid Event Notification System at Netflix

The Netflix TechBlog

FEBRUARY 18, 2022

Scaling Policies To address the thundering herd problem and to keep latencies under acceptable thresholds, the cluster scale-up policies are configured to be more aggressive than the scale-down policies. We were able to onboard additional product use cases at a fast pace thus unblocking a lot of innovation.

Systems

Systems Traffic Architecture Mobile

Taming DORA compliance with AI, observability, and security

Dynatrace

AUGUST 27, 2024

This can divert attention and resources from delivering better customer experience and innovation. It detects regressions and deviations from previously observed behavior, including latency, traffic, error rates, saturation, security coverage, vulnerability risk levels, and memory consumption. Integration with existing processes.

Best Practices

Best Practices Government DevOps Analytics

Allez, rendez-vous à Paris – An AWS Region is coming to France!

All Things Distributed

SEPTEMBER 29, 2016

Based in the Paris area, the region will provide even lower latency and will allow users who want to store their content in datacenters in France to easily do so. He has said, “By moving a large part of our IT system from our old IBM mainframe to AWS, we have adopted a cloud first strategy, boosting our power of innovation.

AWS

AWS IoT Internet Internet

Why growing AI adoption requires an AI observability strategy

Dynatrace

JANUARY 17, 2024

As organizations turn to artificial intelligence for operational efficiency and product innovation in multicloud environments, they have to balance the benefits with skyrocketing costs associated with AI. Growing AI adoption has ushered in a new reality. Use containerization.

Strategy

Strategy Artificial Intelligence Storage Cloud

These 7 Edge Data Challenges Will Test Companies the Most in 2025

VoltDB

DECEMBER 11, 2024

By bringing computation closer to the data source, edge-based deployments reduce latency, enhance real-time capabilities, and optimize network bandwidth. Increased latency during peak loads. By focusing on security, scalability, and compliance, they can turn these challenges into opportunities for growth and innovation.

IoT

IoT Energy Logistics Latency

Application observability meets developer observability: Unlock a 360º view of your environment

Dynatrace

NOVEMBER 6, 2023

Manually sifting through data to answer these questions is time-consuming and takes time away from innovation. Dynatrace enables teams to specify SLOs, such as latency, uptime, availability, and more. When an incident occurs, developers need to know what data to look at, where the incident occurred, and other relevant metrics.

Development

Development DevOps Programming Cloud

Interpreting A/B test results: false negatives and power

The Netflix TechBlog

OCTOBER 26, 2021

Martin Tingley with Wenjing Zheng , Simon Ejdemyr , Stephanie Lane , and Colin McFarland This is the fourth post in a multi-part series on how Netflix uses A/B tests to inform decisions and continuously innovate on our products. Need to catch up? Have a look at Part 1 (Decision Making at Netflix), Part 2 (What is an A/B Test?),

Testing

Testing Latency Metrics Design

Netflix Video Quality at Scale with Cosmos Microservices

The Netflix TechBlog

NOVEMBER 2, 2021

As VMAF evolves and is integrated with more encoding and streaming workflows within Netflix, we need scalable ways of fostering video quality innovations. This article explains how we designed microservices and workflows on top of the Cosmos platform to bolster such video quality innovations. via bug fixes). The workflow is initiated.

Media

Media Innovation Metrics Latency

Managing risk for financial services: The secret to visibility and control during times of volatility

Dynatrace

APRIL 8, 2024

Resilient, high-performing technology ecosystems that accelerate innovation through faster development cycles. Maximize performance for high-frequency and low-latency trading strategies. Automated delivery of actionable insights in real-time and in context. Automated risk management workflows, processes, and decisions.

Analytics

Analytics Infrastructure Efficiency Technology

The Anna Key-Value Store Now Has 355x the Performance of DynamoDB for the Dollar

High Scalability

SEPTEMBER 8, 2018

RISELabs , those wonderfully innovative folks over at Berkeley, have uplifted their Anna datatabase —a shared-nothing, thread-per-core architecture to achieve lightning-fast speeds by avoiding all coordination mechanisms—to become cloud-aware. New databases used to be announced seemingly every week.

Storage

Storage Performance AWS Cloud

What are SLOs? How service-level objectives work with SLIs to deliver on SLAs

Dynatrace

DECEMBER 2, 2021

Properly set and defined SLOs should have error budgets that give developers space to innovate without impacting operations. Achieving 100% reliability isn’t always realistic, so using SLOs can help you figure out the balance between innovating (which could result in downtime) and delivering (which ensures users are happy).

Metrics

Metrics DevOps Best Practices Infrastructure

Unlock the power of contextual log analytics

Dynatrace

OCTOBER 2, 2024

If your team deploys applications cloud-natively, we meet you there, too, as we recently covered in our blog post, Dynatrace log management innovations: Syslog, AWS Firehose. We covered in rich detail how Dynatrace supports log ingestion for cloud-native workloads and simplified log ingestion also for hybrid environments.

Analytics

Analytics AWS DevOps Cloud

Common SLO pitfalls and how to avoid them

Dynatrace

FEBRUARY 2, 2022

service availability with <50ms latency for an application with no revenue impact. However, another of the common SLO pitfalls is that many organizations assemble these metrics manually using disparate tools, which can take time from innovation. This can create an unnecessary distraction and steal time away from critical tasks.

DevOps

DevOps Metrics Best Practices Latency

Service level objective examples: 5 SLO examples for faster, more reliable apps

Dynatrace

JUNE 1, 2023

But the pressure on CIOs to innovate faster comes at a cost. Note : you might hear the term latency used instead of response time. Both latency and response time are critical to ensure reliability. Latency typically refers to the time it takes for a single request to travel from its source to its destination.

Traffic

Traffic Website Latency DevOps

Amazon EC2 Cluster GPU Instances - All Things Distributed

All Things Distributed

NOVEMBER 15, 2010

Today Amazon Web Services takes another step on the continuous innovation path by announcing a new Amazon EC2 instance type: The Cluster GPU Instance. We believe that making these GPU resources available for everyone to use at low cost will drive new innovation in the application of highly parallel programming models. Comments ().

AWS

AWS Programming Latency Architecture

Ciao Milano! – An AWS Region is coming to Italy!

All Things Distributed

NOVEMBER 13, 2018

Some of the largest enterprises and public sector organizations in Italy are using AWS to build innovations and power their businesses, drive cost savings, accelerate innovation, and speed time-to-market. AWS also has a vibrant partner ecosystem in Italy as part of the AWS Partner Network (APN).

AWS

AWS Energy Automotive Traffic

Expanding the Cloud – An AWS Region is coming to Hong Kong

All Things Distributed

JUNE 20, 2017

This enables customers to serve content to their end users with low latency, giving them the best application experience. In 2008, AWS opened a point of presence (PoP) in Hong Kong to enable customers to serve content to their end users with low latency. Since then, AWS has added two more PoPs in Hong Kong, the latest in 2016.

AWS

AWS Logistics Cloud Social Media

Expanding the Cloud – Introducing the AWS EU (Frankfurt) Region

All Things Distributed

OCTOBER 23, 2014

The new Frankfurt region provides low millisecond latencies to major cities in continental Europe and is also run with carbon neutral power. In addition to a broad base of customers, AWS has a vibrant partner ecosystem in Germany that has built innovative solutions and services on AWS.

AWS

AWS Cloud Media Latency

Foundation Model for Personalized Recommendation

Dynatrace supports SnapStart for Lambda as an AWS launch partner

Trending Sources

Title Launch Observability at Netflix Scale

Introducing Netflix’s Key-Value Data Abstraction Layer

What is full stack observability?

Nine ways technology executives can get significant business value with the right observability platform

What are quality gates? How to use quality gates to deliver better software at speed and scale

Dynatrace supports the newly released AWS Lambda Response Streaming

Rebuilding Netflix Video Processing Pipeline with Microservices

Dynatrace automatically monitors OpenAI ChatGPT for companies that deliver reliable, cost-effective services powered by generative AI

Analyze OpenTelemetry traces and log data at scale: Accelerate troubleshooting and optimize application performance

Build systems more reliably with Dynatrace: Chaos Engineering

What is AWS Lambda?

For your eyes only: improving Netflix video quality with neural networks

How Park ‘N Fly eliminated silos and improved customer experience with Dynatrace cloud monitoring

Site reliability engineering: 5 things you need to know

Service level objectives: 5 SLOs to get started

What is observability? Not just logs, metrics and traces

Netflix Cloud Packaging in the Terabyte Era

Introducing Netflix TimeSeries Data Abstraction Layer

Site reliability done right: 5 SRE best practices that deliver on business objectives

DevOps observability: A guide for DevOps and DevSecOps teams

Site reliability engineering: 5 things to you need to know

Dynatrace accelerates business transformation with new AI observability solution

SRE vs DevOps: What you need to know

Real-time business analytics with Dynatrace: Unleashing the treasure trove of insights from your observability data

Rapid Event Notification System at Netflix

Taming DORA compliance with AI, observability, and security

Allez, rendez-vous à Paris – An AWS Region is coming to France!

Why growing AI adoption requires an AI observability strategy

These 7 Edge Data Challenges Will Test Companies the Most in 2025

Application observability meets developer observability: Unlock a 360º view of your environment

Interpreting A/B test results: false negatives and power

Netflix Video Quality at Scale with Cosmos Microservices

Managing risk for financial services: The secret to visibility and control during times of volatility

The Anna Key-Value Store Now Has 355x the Performance of DynamoDB for the Dollar

What are SLOs? How service-level objectives work with SLIs to deliver on SLAs

Unlock the power of contextual log analytics

Common SLO pitfalls and how to avoid them

Service level objective examples: 5 SLO examples for faster, more reliable apps

Amazon EC2 Cluster GPU Instances - All Things Distributed

Ciao Milano! – An AWS Region is coming to Italy!

Expanding the Cloud – An AWS Region is coming to Hong Kong

Expanding the Cloud – Introducing the AWS EU (Frankfurt) Region

Stay Connected