Availability, Metrics and Systems - Technology Performance Pulse

OpenPipeline: Simplify access to critical business data

Dynatrace

NOVEMBER 4, 2024

There’s a goldmine of business data traversing your IT systems, yet most of it remains untapped. Other data sources, including APIs and log files — are used to expand access, often to external or proprietary systems. In fact, it’s likely that some of your critical business systems already write business data to log files.

Analytics

Analytics Airlines Metrics Monitoring

Five-nines availability: Always-on infrastructure delivers system availability during the holidays’ peak loads

Dynatrace

NOVEMBER 22, 2022

The nirvana state of system uptime at peak loads is known as “five-nines availability.” In its pursuit, IT teams hover over system performance dashboards hoping their preparations will deliver five nines—or even four nines—availability. But is five nines availability attainable? Downtime per year. 90% (one nine).

Infrastructure

Infrastructure Availability Systems Retail

Reliability indicators that matter to your business: SLOs for all data types

Dynatrace

OCTOBER 31, 2024

This lets you build your SLOs around the indicators that matter to you and your customers—critical metrics related to availability, failure rates, request response times, or select logs and business events. While the SLO management web UI and API are already available, the dashboard tile will be released within the next weeks.

Metrics

Metrics Availability Monitoring Scalability

Rapid Event Notification System at Netflix

The Netflix TechBlog

FEBRUARY 18, 2022

To this end, we developed a Rapid Event Notification System (RENO) to support use cases that require server initiated communication with devices in a scalable and extensible manner. In this blog post, we will give an overview of the Rapid Event Notification System at Netflix and share some of the learnings we gained along the way.

Systems

Systems Traffic Architecture Mobile

HTTP monitors on the latest Dynatrace platform extend insights into the health of your API endpoints and simplify test management

Dynatrace

DECEMBER 18, 2024

But nowadays, with complex and dynamically changing modern IT systems, the last result details might not be enough in some cases. Thanks to the power of Grail, those details are available for all executions stored for the entire retention period during which synthetic results are kept.

Monitoring

Monitoring Testing Metrics Analytics

Power Dashboarding, Part I: Start your exploration journey with Dashboards

Dynatrace

FEBRUARY 6, 2025

Whether you’re a seasoned IT expert or a marketing professional looking to improve business performance, understanding the data available to you is essential. Even if infrastructure metrics aren’t your thing, you’re welcome to join us on this creative journey simply swap out the suggested metrics for ones that interest you.

Metrics

Metrics Infrastructure Monitoring Best Practices

Tailored access management, Part 3: Simplified setup for enterprise-scale access management

Dynatrace

OCTOBER 14, 2024

Manage the complexity of authorization systems Most modern authorization systems provide access management using Attribute-Based Access Control (ABAC). The system demands significant effort to design, manage, and maintain, especially as an organization’s needs evolve.

Monitoring

Monitoring Metrics Systems Scalability

Elevating System Management: The Role of Monitoring and Observability in DevOps

DZone

JUNE 21, 2023

In the ever-evolving world of DevOps , the ability to gain deep insights into system behavior, diagnose issues, and improve overall performance is one of the top priorities. Monitoring and observability are two key concepts that facilitate this process, offering valuable visibility into the health and performance of systems.

DevOps

DevOps Systems Monitoring Metrics

Dynatrace extends Synthetic Monitoring capabilities with Network Availability Monitors to validate the availability of infrastructure and services

Dynatrace

JULY 15, 2024

As HTTP and browser monitors cover the application level of the ISO /OSI model , successful executions of synthetic tests indicate that availability and performance meet the expected thresholds of your entire technological stack. Combined with Dynatrace OneAgent ® , you gain a precise view of the status of your systems at a glance.

Availability

Availability Network Monitoring Infrastructure

What is observability? Not just logs, metrics and traces

Dynatrace

OCTOBER 1, 2021

As dynamic systems architectures increase in complexity and scale, IT teams face mounting pressure to track and respond to conditions and issues across their multi-cloud environments. How do you make a system observable? Dynatrace news. Why is it important, and what can it actually help organizations achieve? What is observability?

Metrics

Metrics Open Source Monitoring Cloud

Flexible, scalable, self-service Kubernetes native observability now in General Availability

Dynatrace

MAY 17, 2022

For years, enterprises managed observability data on a team-by-team basis , using a combination of ticketing systems and configuration management tools. From a cost perspective, internal customers waste valuable time sending tickets to operations teams asking for metrics, logs, and traces to be enabled.

Availability

Availability Scalability Cloud Metrics

Demo: Transform OpenTelemetry data into actionable insights with the Dynatrace Distributed Tracing app

Dynatrace

OCTOBER 29, 2024

A Dynatrace API token with the following permissions: Ingest OpenTelemetry traces ( openTelemetryTrace.ingest ) Ingest metrics ( metrics.ingest ) Ingest logs ( logs.ingest ) To set up the token, see Dynatrace API – Tokens and authentication in Dynatrace documentation. If you don’t have one, you can use a trial account.

Metrics

Metrics Tuning Monitoring Availability

9 key DevOps metrics for success

Dynatrace

SEPTEMBER 28, 2021

As we look at today’s applications, microservices, and DevOps teams, we see leaders are tasked with supporting complex distributed applications using new technologies spread across systems in multiple locations. The emerging concepts of working with DevOps metrics and DevOps KPIs have really come a long way. Deployment frequency.

DevOps

DevOps Metrics Traffic Efficiency

Dynatrace joins the Microsoft Intelligent Security Association

Dynatrace

NOVEMBER 20, 2024

This rising risk amplifies the need for reliable security solutions that integrate with existing systems. Dynatrace, available as an Azure-native service , has a longstanding partnership with Microsoft, deeply rooted in a strong “build with” approach to deliver seamless user experience.

Best Practices

Best Practices Innovation Azure Cloud

Build systems more reliably with Dynatrace: Chaos Engineering

Dynatrace

AUGUST 21, 2024

This approach enhances key DORA metrics and enables early detection of failures in the release process, allowing SREs more time for innovation. This blog post explores the Reliability metric , which measures modern operational practices. Why reliability? More critical services that are likely to take other services down.

Engineering

Engineering Systems Latency Metrics

New continuous compliance requirements drive the need to converge observability and security

Dynatrace

DECEMBER 12, 2024

I realized that our platforms unique ability to contextualize security events, metrics, logs, traces, and user behavior could revolutionize the security domain by converging observability and security. Boost your operational resilience: Combining availability and security is now essential.

Analytics

Analytics Government Efficiency Innovation

Dynatrace observability now available for Red Hat OpenShift on IBM Z and LinuxONE mainframes

Dynatrace

JULY 24, 2024

IBM Z and LinuxONE mainframes running the Linux operating system enable you to respond faster to business demands, protect data from core to cloud, and streamline insights and automation. Telemetry data, such as traces and metrics, allow you to analyze the end-to-end performance of your deployed applications.

Availability

Availability Infrastructure Metrics Hardware

Simplify observability for all your custom metrics (Part 4: Prometheus)

Dynatrace

JANUARY 12, 2021

In Part 1 we explored how you can use the Davis AI to analyze your StatsD metrics. In Part 2 we showed how you can run multidimensional analysis for external metrics that are ingested via the OneAgent Metric API. In Part 3 we discussed how the Davis AI can analyze your metrics from scripting languages like Bash or PowerShell.

Metrics

Metrics Open Source Monitoring Analytics

New analytics capabilities for messaging system-related anomalies

Dynatrace

JANUARY 12, 2022

Messaging systems can significantly improve the reliability, performance, and scalability of the communication processes between applications and services. In serverless and microservices architectures, messaging systems are often used to build asynchronous service-to-service communication. Dynatrace news. This is great!

Analytics

Analytics Systems DevOps Healthcare

Better dashboarding with Dynatrace Davis AI: Instant meaningful insights

Dynatrace

JANUARY 21, 2025

Chances are, youre a seasoned expert who visualizes meticulously identified key metrics across several sophisticated charts. Activate Davis AI to analyze charts within seconds Davis AI can help you expand your dashboards and dive deeper into your available data to extract additional information.

Traffic

Traffic Metrics Analytics Monitoring

Next-level interaction and customization of data visualizations in Dynatrace Dashboards and Notebooks

Dynatrace

OCTOBER 10, 2024

New: identify hotspots with the honeycomb visualization Honeycombs are great for visualizing health in complex and distributed systems, enabling you to visualize countless entities effectively and at scale. To achieve the best visual outcome, we recommend experimenting with the available customization options.

Latency

Latency Infrastructure Monitoring Metrics

Dynatrace SaaS on Google now generally available

Dynatrace

APRIL 20, 2022

In October 2021, Dynatrace announced the availability of the Dynatrace Software Intelligence Platform on Google Cloud as a software as a service (SaaS) solution. Today, we are excited to announce this SaaS delivery model is now generally available (GA) to the public through Dynatrace sales channels. Dynatrace news.

Google

Google Availability Cloud Infrastructure

Don’t just react: How executives can predict and prevent outages to maximize availability

Dynatrace

OCTOBER 3, 2024

The end goal, of course, is to optimize the availability of organizations’ software. Dynatrace is widely recognized for its AI capabilities’ ability to predict and prevent issues, and automatically identify root causes, maximizing availability. That’s where observability from Dynatrace goes far beyond “observing systems.”

Availability

Availability DevOps Analytics Cloud

Part 1: A Survey of Analytics Engineering Work at Netflix

The Netflix TechBlog

DECEMBER 17, 2024

Analytics Engineers deliver these insights by establishing deep business and product partnerships; translating business challenges into solutions that unblock critical decisions; and designing, building, and maintaining end-to-end analytical systems. DJ acts as a central store where metric definitions can live and evolve.

Analytics

Analytics Engineering Entertainment Metrics

Dynatrace Cost & Carbon Optimization certified for accuracy and transparency

Dynatrace

MARCH 5, 2025

Integration with existing systems and processes : Integration with existing IT infrastructure, observability solutions, and workflows often requires significant investment and customization. The certification results are now publicly available. We implemented a wasted energy metric in the app to enhance practitioner actionability.

Energy

Energy Analytics Traffic Cloud

SLOs for Kubernetes clusters: Optimize resource utilization of Kubernetes clusters with service-level objectives

Dynatrace

NOVEMBER 11, 2024

Kubernetes is a widely used open source system for container orchestration. A Kubernetes SLO that continuously evaluates CPU, memory usage, and capacity and compares these available resources to the requested and utilized memory of Kubernetes workloads makes potential resource waste visible, revealing opportunities for countermeasures.

Efficiency

Efficiency Best Practices Monitoring Cloud

Transform log data into actionable metrics and have Davis AI do the work for you

Dynatrace

MARCH 16, 2022

Now, Dynatrace has the ability to turn numerical values from logs into metrics, which unlocks AI-powered answers, context, and automation for your apps and infrastructure, at scale. Key information about your system and applications comes from logs. Manual tracking of metrics from logs is too complex at scale.

Metrics

Metrics Lambda Infrastructure Monitoring

AI-driven analysis of Spring Micrometer metrics in context, with typology at scale

Dynatrace

APRIL 7, 2022

Micrometer is used for instrumenting both out-of-the-box and custom metrics from Spring Boot applications. Davis topology-aware anomaly detection and alerting for your Micrometer metrics. Topology-related custom metrics for seamless reports and alerts. Micrometer uses a registry to export metrics to monitoring systems.

Metrics

Metrics Java Latency Cache

Dynatrace innovates again with the release of topology-driven auto-adaptive metric baselines

Dynatrace

JULY 23, 2020

With the advent and ingestion of thousands of custom metrics into Dynatrace, we’ve once again pushed the boundaries of automatic, AI-based root cause analysis with the introduction of auto-adaptive baselines as a foundational concept for Dynatrace topology-driven timeseries measurements. In many cases, metric behavior changes over time.

Metrics

Metrics Innovation Strategy Monitoring

What is log management? How to tame distributed cloud system complexities

Dynatrace

SEPTEMBER 8, 2022

Log management is an organization’s rules and policies for managing and enabling the creation, transmission, analysis, storage, and other tasks related to IT systems’ and applications’ log data. Metrics, logs , and traces make up three vital prongs of modern observability. How log management systems optimize performance and security.

Cloud

Cloud Systems Analytics DevOps

Microsoft Ignite 2024 guide: Cloud observability for AI transformation

Dynatrace

NOVEMBER 18, 2024

The power of cloud observability Modernizing legacy systems can be challenging, and it’s important to do so with purpose—not just to modernize for its own sake. By prioritizing observability, organizations can ensure the availability, performance, and security of business-critical applications.

Cloud

Cloud Azure Artificial Intelligence Innovation

Observability engineering: Getting Prometheus metrics right for Kubernetes with Dynatrace and Kepler

Dynatrace

DECEMBER 18, 2023

For busy site reliability engineers, ensuring system reliability, scalability, and overall health is an imperative that’s getting harder to achieve in ever-expanding, cloud-native, container-based environments. To get a more granular look into telemetry data, many analysts rely on custom metrics using Prometheus.

Metrics

Metrics Engineering Energy Tuning

The road to observability with OpenTelemetry demo part 1: Identifying metrics and traces

Dynatrace

MAY 17, 2023

Anyone who’s concerned with developing, delivering, and operating software knows the importance of making software and the systems it runs on observable. That is, relying on metrics, logs, and traces to understand what software is doing and where it’s running into snags. OpenTelemetry is a free and open source take on observability.

Metrics

Metrics Open Source Traffic Cache

Best Practices for Scaling RabbitMQ

Scalegrid

FEBRUARY 24, 2025

Scaling RabbitMQ ensures your system can handle growing traffic and maintain high performance. Key Takeaways RabbitMQ improves scalability and fault tolerance in distributed systems by decoupling applications, enabling reliable message exchanges.

Best Practices

Best Practices Traffic Strategy Efficiency

Create simple workflows to automate alerts during development

Dynatrace

JANUARY 22, 2025

Your teams want to iterate rapidly but face multiple hurdles: Increased complexity: Microservices and container-based apps generate massive logs and metrics. You can select any trigger thats available for standard workflows, including schedules, problem triggers, customer event triggers, or on-demand triggers.

Development

Development Processing Monitoring Code

Introducing Impressions at Netflix

The Netflix TechBlog

FEBRUARY 14, 2025

It requires a state-of-the-art system that can track and process these impressions while maintaining a detailed history of each profiles exposure. In this multi-part blog series, we take you behind the scenes of our system that processes billions of impressions daily.

Tuning

Tuning Latency Efficiency Storage

Supporting Diverse ML Systems at Netflix

The Netflix TechBlog

MARCH 7, 2024

The Machine Learning Platform (MLP) team at Netflix provides an entire ecosystem of tools around Metaflow , an open source machine learning infrastructure framework we started, to empower data scientists and machine learning practitioners to build and manage a variety of ML systems. ETL workflows), as well as downstream (e.g.

Systems

Systems Media Cache Open Source

Digital first, and always: Five critical metrics for measuring customer experience at federal agencies

Dynatrace

NOVEMBER 3, 2023

The five key metrics to improve customer satisfaction To help turn this around, Dynatrace makes available its unified observability platform, which captures all CX interactions and transactions in an automated, intelligent manner – including user session replays. When combined, key metrics will generate an accurate CX index score.

Metrics

Metrics Government Website Monitoring

Connect Fluentd logs with Dynatrace traces, metrics, and topology data to enhance Kubernetes observability

Dynatrace

APRIL 8, 2022

While Fluentd solves the challenges of collecting and normalizing Kubernetes events and logs, Kubernetes performance and availability problems can rarely be solved by investigating logs in isolation. But given the hundreds of pods and processes in modern systems, these errors might not be logically connected.

Metrics

Metrics Analytics Software Architecture Open Source

Introducing Configurable Metaflow

The Netflix TechBlog

DECEMBER 19, 2024

Many of these projects are under constant development by dedicated teams with their own business goals and development best practices, such as the system that supports our content decision makers , or the system that ranks which language subtitles are most valuable for a specific piece ofcontent.

Best Practices

Best Practices Cache Metrics Code

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

The Netflix TechBlog

MAY 4, 2023

Behind the scenes, a myriad of systems and services are involved in orchestrating the product experience. These backend systems are consistently being evolved and optimized to meet and exceed customer and product expectations. It provides a good read on the availability and latency ranges under different production conditions.

Traffic

Traffic Latency Tuning Systems

Distributed tracing with Dynatrace just got even better

Dynatrace

MARCH 11, 2025

The Dynatrace platform now enables comprehensive data exploration and interactive analytics across data sets (trace, logs, events, and metrics)empowering you to solve complex use cases, handle any observability scenario, and gain unprecedented visibility into your systems.

Analytics

Analytics Games Innovation Metrics

Getting Started With Prometheus Workshop: Relabeling Metrics in Prometheus

DZone

JUNE 16, 2023

This workshop is for you, designed to expand your knowledge and understanding of open-source observability tooling that is available to you today. Prometheus is an open-source systems monitoring and alerting tool kit that enables you to hit the ground running with discovering, collecting, and querying your observability today.

Metrics

Metrics Open Source Monitoring Design

How to observe logs with Journald and Dynatrace

Dynatrace

APRIL 4, 2025

Journald provides unified structured logging for systems, services, and applications, eliminating the need for custom parsing for severity or details. It provides unified observability by automatically correlating logs and placing them in the context of traces and metrics.

Analytics

Analytics Operating System Scalability Infrastructure

OpenPipeline: Simplify access to critical business data

Five-nines availability: Always-on infrastructure delivers system availability during the holidays’ peak loads

Trending Sources

Reliability indicators that matter to your business: SLOs for all data types

Rapid Event Notification System at Netflix

HTTP monitors on the latest Dynatrace platform extend insights into the health of your API endpoints and simplify test management

Power Dashboarding, Part I: Start your exploration journey with Dashboards

Tailored access management, Part 3: Simplified setup for enterprise-scale access management

Elevating System Management: The Role of Monitoring and Observability in DevOps

Dynatrace extends Synthetic Monitoring capabilities with Network Availability Monitors to validate the availability of infrastructure and services

What is observability? Not just logs, metrics and traces

Flexible, scalable, self-service Kubernetes native observability now in General Availability

Demo: Transform OpenTelemetry data into actionable insights with the Dynatrace Distributed Tracing app

9 key DevOps metrics for success

Dynatrace joins the Microsoft Intelligent Security Association

Build systems more reliably with Dynatrace: Chaos Engineering

New continuous compliance requirements drive the need to converge observability and security

Dynatrace observability now available for Red Hat OpenShift on IBM Z and LinuxONE mainframes

Simplify observability for all your custom metrics (Part 4: Prometheus)

New analytics capabilities for messaging system-related anomalies

Better dashboarding with Dynatrace Davis AI: Instant meaningful insights

Next-level interaction and customization of data visualizations in Dynatrace Dashboards and Notebooks

Dynatrace SaaS on Google now generally available

Don’t just react: How executives can predict and prevent outages to maximize availability

Part 1: A Survey of Analytics Engineering Work at Netflix

Dynatrace Cost & Carbon Optimization certified for accuracy and transparency

SLOs for Kubernetes clusters: Optimize resource utilization of Kubernetes clusters with service-level objectives

Transform log data into actionable metrics and have Davis AI do the work for you

AI-driven analysis of Spring Micrometer metrics in context, with typology at scale

Dynatrace innovates again with the release of topology-driven auto-adaptive metric baselines

What is log management? How to tame distributed cloud system complexities

Microsoft Ignite 2024 guide: Cloud observability for AI transformation

Observability engineering: Getting Prometheus metrics right for Kubernetes with Dynatrace and Kepler

The road to observability with OpenTelemetry demo part 1: Identifying metrics and traces

Best Practices for Scaling RabbitMQ

Create simple workflows to automate alerts during development

Introducing Impressions at Netflix

Supporting Diverse ML Systems at Netflix

Digital first, and always: Five critical metrics for measuring customer experience at federal agencies

Connect Fluentd logs with Dynatrace traces, metrics, and topology data to enhance Kubernetes observability

Introducing Configurable Metaflow

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

Distributed tracing with Dynatrace just got even better

Getting Started With Prometheus Workshop: Relabeling Metrics in Prometheus

How to observe logs with Journald and Dynatrace

Stay Connected