Availability and Processing - Technology Performance Pulse

Dynatrace SaaS on Azure now Generally Available

Dynatrace

FEBRUARY 2, 2022

In September, we announced the availability of the Dynatrace Software Intelligence Platform on Microsoft Azure as a SaaS solution and natively in the Azure portal. Today, we are excited to provide an update that Dynatrace SaaS on Azure is now generally available (GA) to the public through Dynatrace sales channels. Dynatrace news.

Azure

Azure Availability Hardware Innovation

Dynatrace on Microsoft Azure in Australia enables regional customers to leverage AI-powered observability

Dynatrace

OCTOBER 28, 2024

Dynatrace on Microsoft Azure allows enterprises to streamline deployment, gain critical insights, and automate manual processes. As of October 2024, Dynatrace is available on Microsoft Azure Australia East region, enabling joint customers to maintain a local SaaS presence. The result?

Azure

Azure Latency Infrastructure Cloud

Managing PostgreSQL® High Availability – Part I: PostgreSQL Automatic Failover

Scalegrid

SEPTEMBER 5, 2024

Managing High Availability (HA) in your PostgreSQL hosting is very important to ensuring your database deployment clusters maintain exceptional uptime and strong operational performance so your data is always available to your application. Effective management of failover and switchover operations is crucial for high availability.

Availability

Availability Servers Database Open Source

Business Flow: Why IT operations teams should monitor business processes

Dynatrace

MARCH 12, 2024

The business process observability challenge Increasingly dynamic business conditions demand business agility; reacting to a supply chain disruption and optimizing order fulfillment are simple but illustrative examples. Most business processes are not monitored. First and foremost, it’s a data problem.

Processing

Processing Monitoring Analytics C++

Dynatrace extends Synthetic Monitoring capabilities with Network Availability Monitors to validate the availability of infrastructure and services

Dynatrace

JULY 15, 2024

As HTTP and browser monitors cover the application level of the ISO /OSI model , successful executions of synthetic tests indicate that availability and performance meet the expected thresholds of your entire technological stack. Our script, available on GitHub , provides details. Are the corresponding services running on those hosts?

Availability

Availability Network Monitoring Infrastructure

Flexible, scalable, self-service Kubernetes native observability now in General Availability

Dynatrace

MAY 17, 2022

The application consists of several microservices that are available as pod-backed services. From here we jump directly into Dynatrace Distributed traces view, shown below, to understand code-level contributions to total processing time. Information about each of these topics will be available in upcoming announcements.

Availability

Availability Scalability Cloud Metrics

OpenPipeline: Simplify access to critical business data

Dynatrace

NOVEMBER 4, 2024

Dynatrace OpenPipeline is a new stream processing technology that ingests and contextualizes data from any source. Business process monitoring and optimization. Most of the use cases in these two broad categories benefit from the flexibility that comes from multiple available sources of business data.

Analytics

Analytics Airlines Metrics Monitoring

Reliability indicators that matter to your business: SLOs for all data types

Dynatrace

OCTOBER 31, 2024

This lets you build your SLOs around the indicators that matter to you and your customers—critical metrics related to availability, failure rates, request response times, or select logs and business events. Hence, having a dedicated dashboard tile visualizing the key parameters of each SLO simplifies the process of evaluating them.

Metrics

Metrics Availability Monitoring Scalability

Don’t just react: How executives can predict and prevent outages to maximize availability

Dynatrace

OCTOBER 3, 2024

The end goal, of course, is to optimize the availability of organizations’ software. Dynatrace is widely recognized for its AI capabilities’ ability to predict and prevent issues, and automatically identify root causes, maximizing availability.

Availability

Availability DevOps Analytics Cloud

Dynatrace observability now available for Red Hat OpenShift on IBM Z and LinuxONE mainframes

Dynatrace

JULY 24, 2024

By leveraging Dynatrace observability on Red Hat OpenShift running on Linux, you can accelerate modernization to hybrid cloud and increase operational efficiencies with greater visibility across the full stack from hardware through application processes. Dynatrace observability is available for Red Hat OpenShift on IBM Power.

Availability

Availability Infrastructure Metrics Hardware

Netflix’s Distributed Counter Abstraction

The Netflix TechBlog

NOVEMBER 12, 2024

Both categories share common requirements, such as high throughput and high availability. Eventually Consistent Global Counter While some users may accept the limitations of a Best-Effort counter, others opt for precise counts, durability and global availability.

Latency

Latency Cache Infrastructure Strategy

Leverage logs for an end-to-end view of your business processes via Dynatrace OpenPipeline

Dynatrace

SEPTEMBER 27, 2024

Unrealized optimization potential of business processes due to monitoring gaps Imagine a retail company facing gaps in its business process monitoring due to disparate data sources. Due to separated systems that handle different parts of the process, the view of the process is fragmented.

Processing

Processing Retail Analytics Monitoring

Dare to debug production with Dynatrace Live Debugger

Dynatrace

FEBRUARY 4, 2025

A production bug is the worst; besides impacting customer experience, you need special access privileges, making the process far more time-consuming. It also makes the process risky as production servers might be more exposed, leading to the need for real-time production data. This cumbersome process should not be the norm.

Open Source

Open Source Code Engineering Best Practices

2. Diving Deeper into Psyberg: Stateless vs Stateful Data Processing

The Netflix TechBlog

NOVEMBER 14, 2023

By Abhinaya Shetty , Bharath Mummadisetty In the inaugural blog post of this series, we introduced you to the state of our pipelines before Psyberg and the challenges with incremental processing that led us to create the Psyberg framework within Netflix’s Membership and Finance data engineering team.

Processing

Processing Data Engineering Efficiency Analytics

New integrations announced at AWS re:Invent enhance cloud performance, security, and automation

Dynatrace

DECEMBER 3, 2024

Streamlining observability with Dynatrace OneAgent on AWS Image Builder In our ongoing collaboration with AWS, we’re excited to make the Dynatrace OneAgent available as a first-class integration on AWS Image Builder via the AWS Marketplace.

AWS

AWS Cloud Performance Innovation

New Distributed Tracing app provides effortless trace insights

Dynatrace

OCTOBER 23, 2024

Automatic data capture and display: More data, including span attributes, is available for out-of-the-box analysis, with no additional configuration necessary. As soon as the new Distributed Tracing Experience is available for your environment, you’ll see a teaser banner in your classic Distributed Traces app.

Tuning

Tuning Website Availability Performance

Simplify log onboarding: From zero to observability in minutes

Dynatrace

MARCH 5, 2025

The newly introduced step-by-step guidance streamlines the process, while quick data flow validation accelerates the onboarding experience even for power users. Step-by-step setup The log ingestion wizard guides you through the prerequisites and provides ready-to-use command examples to start the installation process.

Open Source

Open Source IoT Cloud Azure

Dynatrace OpenPipeline: Stream processing data ingestion converges observability, security, and business data at massive scale for analytics and automation in context

Dynatrace

JANUARY 31, 2024

Organizations choose data-driven approaches to maximize the value of their data, achieve better business outcomes, and realize cost savings by improving their products, services, and processes. Data is then dynamically routed into pipelines for further processing. Commitment to privacy.

Analytics

Analytics Processing Transportation Storage

New continuous compliance requirements drive the need to converge observability and security

Dynatrace

DECEMBER 12, 2024

Boost your operational resilience: Combining availability and security is now essential. Carefully planning and integrating new processes and tools is critical to ensuring compliance without disrupting daily operations. Its time to adopt a unified observability and security approach.

Analytics

Analytics Government Efficiency Innovation

Create simple workflows to automate alerts during development

Dynatrace

JANUARY 22, 2025

Dynatrace Simple Workflows make this process automatic and frictionlessthere is no additional cost for workflows. Why manual alerting falls short As your product and deployments scale horizontally and vertically, the sheer volume of data makes it impossible for teams to catch every error quickly using manual processes.

Development

Development Processing Monitoring Code

Level up your strategic IT management with fully cost-transparent, fine-grained Dynatrace Cost Allocation

Dynatrace

NOVEMBER 27, 2024

Sometimes, introducing new IT solutions is delayed or canceled because a single business unit can’t manage the operating costs alone, and per-department cost insights that could facilitate cost sharing aren’t available. In scenarios like these, automated and precise cost allocation can make a huge difference.

Best Practices

Best Practices Strategy Cloud Efficiency

Better dashboarding with Dynatrace Davis AI: Instant meaningful insights

Dynatrace

JANUARY 21, 2025

Ensuring smooth operations is no small feat, whether you’re in charge of application performance, IT infrastructure, or business processes. Activate Davis AI to analyze charts within seconds Davis AI can help you expand your dashboards and dive deeper into your available data to extract additional information.

Traffic

Traffic Metrics Analytics Monitoring

Globalizing Productions with Netflix’s Media Production Suite

The Netflix TechBlog

MARCH 31, 2025

A lack of automation and standardization often results in a labour-intensive process across post-production and VFX with a lot of dependencies that introduce potential human errors and security risks. Depending on the market, or production budget, cutting-edge technology might not be available or affordable.

Media

Media Logistics Innovation Cloud

SLOs for Kubernetes clusters: Optimize resource utilization of Kubernetes clusters with service-level objectives

Dynatrace

NOVEMBER 11, 2024

However, due to the fact that they boil down selected indicators to single values and track error budget levels, they also offer a suitable way to monitor optimization processes while aligning on single values to meet overall goals. By recognizing the insights provided, you can optimize processes and improve overall efficiency.

Efficiency

Efficiency Best Practices Monitoring Cloud

Dynatrace Observability for Developers saves time with real-time data

Dynatrace

FEBRUARY 4, 2025

As every developer knows, logs are crucial for uncovering insights and detecting fundamental flaws, such as process crashes or exceptions. Using Live Debugger, we immediately get insights into the running code, including variable values, process and thread information, and even a trace ID for the captured transaction.

Development

Development Analytics Code Architecture

OpenTelemetry histograms reveal patterns, outliers, and trends

Dynatrace

OCTOBER 10, 2024

This feature, available by default for OTel-instrumented services, allows users a standard way to measure and compare response times across different services consistently. But for now, percentile calculation and buckets are available only for explicit bucket histograms.

Metrics

Metrics Monitoring Efficiency Availability

Demo: Transform OpenTelemetry data into actionable insights with the Dynatrace Distributed Tracing app

Dynatrace

OCTOBER 29, 2024

In addition to service-level monitoring, certain services within the OpenTelemetry demo application expose process-level metrics, such as CPU and memory consumption, number of threads, or heap size for services written in different languages. For this purpose, we’ll use the in-built failure scenarios included in the OpenTelemetry demo.

Metrics

Metrics Tuning Monitoring Availability

Transform data into insights with Dynatrace Dashboards and Notebooks

Dynatrace

OCTOBER 16, 2024

Kickstart your creation journey using ready-made dashboards and notebooks Creating dashboards and notebooks from scratch can take time, particularly when figuring out available data and how to best use it. Kickstarting the dashboard creation process is, however, just one advantage of ready-made dashboards.

Social Media

Social Media Metrics Network Analytics

Build resilient IT systems and manage regulatory requirements with compliance and resilience capabilities from Dynatrace

Dynatrace

JANUARY 15, 2025

Smartscape topology visualizes the relationships between applications, services, processes, hosts, and data centers, highlighting problems and vulnerabilities. Site Reliability Guardian provides an automated change impact analysis to validate service availability, performance, and capacity objectives across various systems.

Systems

Systems DevOps Monitoring Analytics

Introducing Impressions at Netflix

The Netflix TechBlog

FEBRUARY 14, 2025

It requires a state-of-the-art system that can track and process these impressions while maintaining a detailed history of each profiles exposure. In this multi-part blog series, we take you behind the scenes of our system that processes billions of impressions daily.

Tuning

Tuning Latency Efficiency Storage

Best Practices for Scaling RabbitMQ

Scalegrid

FEBRUARY 24, 2025

Implementing clustering and quorum queues in RabbitMQ significantly improves load distribution and data redundancy, ensuring high availability and fault tolerance for messaging services. Classic queues can be used in clusters, emphasizing their behavior during node failures, particularly regarding durability and availability.

Best Practices

Best Practices Traffic Strategy Scalability

Tailored access management, Part 3: Simplified setup for enterprise-scale access management

Dynatrace

OCTOBER 14, 2024

Access policies for Dynatrace Grail™ data lakehouse are still available as service-related policies; they allow you to control access to the monitoring data on a per-data-source level, for example, logs and metrics. All other default policies on the service level, for example, “AutomationEngine – User” access, are now marked as Legacy.

Monitoring

Monitoring Metrics Systems Scalability

Foundation Model for Personalized Recommendation

The Netflix TechBlog

MARCH 28, 2025

The impetus for constructing a foundational recommendation model is based on the paradigm shift in natural language processing (NLP) to large language models (LLMs). To harness this data effectively, we employ a process of interaction tokenization, ensuring meaningful events are identified and redundancies are minimized.

Tuning

Tuning Efficiency Latency Strategy

Microsoft Ignite 2024 guide: Cloud observability for AI transformation

Dynatrace

NOVEMBER 18, 2024

The Grail™ data lakehouse provides fast, auto-indexed, schema-on-read storage with massively parallel processing (MPP) to deliver immediate, contextualized answers from all data at scale. By prioritizing observability, organizations can ensure the availability, performance, and security of business-critical applications.

Cloud

Cloud Azure Artificial Intelligence Innovation

Debug complex performance issues in production with ease

Dynatrace

FEBRUARY 4, 2025

Using this data, developers can inspect local variables, server-process details, thread information, and trace data to identify the root cause of issues. In this case, the debugging process reveals there are background threads potentially consuming excessive CPU resources.

Performance

Performance Code Processing Availability

RabbitMQ vs. Kafka: Key Differences

Scalegrid

FEBRUARY 6, 2025

RabbitMQ is designed for flexible routing and message reliability, while Kafka handles high-throughput event streaming and real-time data processing. RabbitMQ follows a message broker model with advanced routing, while Kafkas event streaming architecture uses partitioned logs for distributed processing. What is Apache Kafka?

Latency

Latency Analytics Architecture Storage

Cut costs and complexity: 5 strategies for reducing tool sprawl with Dynatrace

Dynatrace

APRIL 10, 2025

IT teams must now ingest petabytes of data and then store, process, and query it cost-effectively and securely. That volume and flexibility eliminate the need for extra data ingest tools and ease data normalization, filtering, and pre-processing, which makes data more reliable.

Strategy

Strategy Storage Network Architecture

Dynatrace Cost & Carbon Optimization certified for accuracy and transparency

Dynatrace

MARCH 5, 2025

Integration with existing systems and processes : Integration with existing IT infrastructure, observability solutions, and workflows often requires significant investment and customization. The certification results are now publicly available.

Energy

Energy Analytics Traffic Cloud

Overview of Telemetry for Kubernetes Clusters: Enhancing Observability and Monitoring

DZone

APRIL 14, 2025

However, it is not an easy task to maintain transparency in and monitor availability and performance of Kubernetes clusters. Telemetry in Kubernetes involves collecting, processing, and visualization of cluster information for cluster health, fault diagnostics, and performance optimizations. That is where telemetry comes in.

Monitoring

Monitoring Best Practices Software Software

KubeCon EU 2025 retrospective: Reflections from my sixth KubeCon

Dynatrace

APRIL 15, 2025

OpenTelemetry provides us with a standard for generating, collecting, and emitting telemetry, and we have existing tooling that leverages OTel data to help us understand work processes and workflows. Fun fact: the OTel docs are now available in English, Spanish, French, Japanese, Portuguese, and Chinese!

Open Source

Open Source Energy Benchmarking Tuning

Title Launch Observability at Netflix Scale

The Netflix TechBlog

MARCH 4, 2025

As a result, requests are uniformly handled, and responses are processed cohesively. This data is processed from a real-time impressions stream into a Kafka queue, which our title health system regularly polls. Many of the metadata and assets involved in title setup have specific timelines for when they become available to members.

Traffic

Traffic Strategy Entertainment Innovation

Grafana Loki Fundamentals and Architecture

DZone

FEBRUARY 28, 2025

Grafana Loki is a horizontally scalable, highly available log aggregation system. Logs can also be transformed appropriately for presentation, for example, or further pipeline processing. It is designed for simplicity and cost-efficiency. Loki can provide a comprehensive log journey.

Architecture

Architecture Scalability Efficiency Cloud

CrowdStrike incident takeaways: Revisiting vendor quality control and release standards to minimize outage exposure

Dynatrace

JULY 25, 2024

A key learning from the outage caused by the faulty CrowdStrike “Rapid Response” update is how critical it is to understand your vendors’ quality control and release processes. What is your testing process? A variety of events and circumstances can cause an outage. A variety of events and circumstances can cause an outage.

Strategy

Strategy Monitoring Open Source Testing

Dynatrace SaaS on Azure now Generally Available

Dynatrace on Microsoft Azure in Australia enables regional customers to leverage AI-powered observability

Trending Sources

Managing PostgreSQL® High Availability – Part I: PostgreSQL Automatic Failover

Business Flow: Why IT operations teams should monitor business processes

Dynatrace extends Synthetic Monitoring capabilities with Network Availability Monitors to validate the availability of infrastructure and services

Flexible, scalable, self-service Kubernetes native observability now in General Availability

OpenPipeline: Simplify access to critical business data

Reliability indicators that matter to your business: SLOs for all data types

Don’t just react: How executives can predict and prevent outages to maximize availability

Dynatrace observability now available for Red Hat OpenShift on IBM Z and LinuxONE mainframes

Netflix’s Distributed Counter Abstraction

Leverage logs for an end-to-end view of your business processes via Dynatrace OpenPipeline

Dare to debug production with Dynatrace Live Debugger

2. Diving Deeper into Psyberg: Stateless vs Stateful Data Processing

New integrations announced at AWS re:Invent enhance cloud performance, security, and automation

New Distributed Tracing app provides effortless trace insights

Simplify log onboarding: From zero to observability in minutes

Dynatrace OpenPipeline: Stream processing data ingestion converges observability, security, and business data at massive scale for analytics and automation in context

New continuous compliance requirements drive the need to converge observability and security

Create simple workflows to automate alerts during development

Level up your strategic IT management with fully cost-transparent, fine-grained Dynatrace Cost Allocation

Better dashboarding with Dynatrace Davis AI: Instant meaningful insights

Globalizing Productions with Netflix’s Media Production Suite

SLOs for Kubernetes clusters: Optimize resource utilization of Kubernetes clusters with service-level objectives

Dynatrace Observability for Developers saves time with real-time data

OpenTelemetry histograms reveal patterns, outliers, and trends

Demo: Transform OpenTelemetry data into actionable insights with the Dynatrace Distributed Tracing app

Transform data into insights with Dynatrace Dashboards and Notebooks

Build resilient IT systems and manage regulatory requirements with compliance and resilience capabilities from Dynatrace

Introducing Impressions at Netflix

Best Practices for Scaling RabbitMQ

Tailored access management, Part 3: Simplified setup for enterprise-scale access management

Foundation Model for Personalized Recommendation

Microsoft Ignite 2024 guide: Cloud observability for AI transformation

Debug complex performance issues in production with ease

RabbitMQ vs. Kafka: Key Differences

Cut costs and complexity: 5 strategies for reducing tool sprawl with Dynatrace

Dynatrace Cost & Carbon Optimization certified for accuracy and transparency

Overview of Telemetry for Kubernetes Clusters: Enhancing Observability and Monitoring

KubeCon EU 2025 retrospective: Reflections from my sixth KubeCon

Title Launch Observability at Netflix Scale

Top PostgreSQL 17 New Features

Grafana Loki Fundamentals and Architecture

CrowdStrike incident takeaways: Revisiting vendor quality control and release standards to minimize outage exposure

Stay Connected