Availability, Event and Processing - Technology Performance Pulse

OpenPipeline: Simplify access to critical business data

Dynatrace

NOVEMBER 4, 2024

Business events: Delivering the best data It’s been two years since we introduced business events , a special class of events designed to support even the most demanding business use cases. Business event ingestion and analysis with log files. OpenPipeline: Simplify access and unify business events from anywhere.

Analytics

Analytics Airlines Metrics Monitoring

Netflix’s Distributed Counter Abstraction

The Netflix TechBlog

NOVEMBER 12, 2024

By: Rajiv Shringi , Oleksii Tkachuk , Kartik Sathyanarayanan Introduction In our previous blog post, we introduced Netflix’s TimeSeries Abstraction , a distributed service designed to store and query large volumes of temporal event data with low millisecond latencies. Today, we’re excited to present the Distributed Counter Abstraction.

Latency

Latency Cache Infrastructure Strategy

Reliability indicators that matter to your business: SLOs for all data types

Dynatrace

OCTOBER 31, 2024

This lets you build your SLOs around the indicators that matter to you and your customers—critical metrics related to availability, failure rates, request response times, or select logs and business events. Are you experiencing an increase or degradation in certain events that indicate a rising problem?

Metrics

Metrics Availability Monitoring Scalability

Rapid Event Notification System at Netflix

The Netflix TechBlog

FEBRUARY 18, 2022

To this end, we developed a Rapid Event Notification System (RENO) to support use cases that require server initiated communication with devices in a scalable and extensible manner. In this blog post, we will give an overview of the Rapid Event Notification System at Netflix and share some of the learnings we gained along the way.

Systems

Systems Traffic Architecture Mobile

New integrations announced at AWS re:Invent enhance cloud performance, security, and automation

Dynatrace

DECEMBER 3, 2024

Streamlining observability with Dynatrace OneAgent on AWS Image Builder In our ongoing collaboration with AWS, we’re excited to make the Dynatrace OneAgent available as a first-class integration on AWS Image Builder via the AWS Marketplace.

AWS

AWS Cloud Performance Innovation

Create simple workflows to automate alerts during development

Dynatrace

JANUARY 22, 2025

Dynatrace Simple Workflows make this process automatic and frictionlessthere is no additional cost for workflows. Why manual alerting falls short As your product and deployments scale horizontally and vertically, the sheer volume of data makes it impossible for teams to catch every error quickly using manual processes.

Development

Development Processing Monitoring Code

Generate security events from Dynatrace Security Investigator via OpenPipeline

Dynatrace

SEPTEMBER 17, 2024

You now want to detect such events automatically by creating a custom Dynatrace security event. Ingest query results as security events The simplest way to do this is to use Dynatrace OpenPipeline. Set up a custom pipeline The best way to set up a security event ingestion to Dynatrace is via Dynatrace OpenPipeline.

Transportation

Transportation AWS Engineering Processing

Business Flow: Why IT operations teams should monitor business processes

Dynatrace

MARCH 12, 2024

The business process observability challenge Increasingly dynamic business conditions demand business agility; reacting to a supply chain disruption and optimizing order fulfillment are simple but illustrative examples. Most business processes are not monitored. First and foremost, it’s a data problem.

Processing

Processing Monitoring Analytics C++

Data Mesh?—?A Data Movement and Processing Platform @ Netflix

The Netflix TechBlog

AUGUST 1, 2022

A Data Movement and Processing Platform @ Netflix By Bo Lei , Guilherme Pires , James Shao , Kasturi Chatterjee , Sujay Jain , Vlad Sydorenko Background Realtime processing technologies (A.K.A stream processing) is one of the key factors that enable Netflix to maintain its leading position in the competition of entertaining our users.

Processing

Processing Transportation Entertainment Tuning

Leverage logs for an end-to-end view of your business processes via Dynatrace OpenPipeline

Dynatrace

SEPTEMBER 27, 2024

Unrealized optimization potential of business processes due to monitoring gaps Imagine a retail company facing gaps in its business process monitoring due to disparate data sources. Due to separated systems that handle different parts of the process, the view of the process is fragmented.

Processing

Processing Retail Analytics Monitoring

Dynatrace enhances Business Analytics with business events powered by Grail

Dynatrace

NOVEMBER 15, 2022

Business events powered by our new Grail™ data lakehouse and by other Dynatrace platform technologies ensures the real-time precision that business and IT teams need to make data-driven decisions and improve business outcomes. Business events deliver the industry’s broadest, deepest, and easiest access to your critical business data.

Analytics

Analytics Mobile Metrics Games

Managing PostgreSQL® High Availability – Part I: PostgreSQL Automatic Failover

Scalegrid

SEPTEMBER 5, 2024

Managing High Availability (HA) in your PostgreSQL hosting is very important to ensuring your database deployment clusters maintain exceptional uptime and strong operational performance so your data is always available to your application. Effective management of failover and switchover operations is crucial for high availability.

Availability

Availability Servers Database Open Source

Extend business observability: Extract business events from online databases (Part 2)

Dynatrace

SEPTEMBER 8, 2023

There are three high-level steps to set up the database business-event stream. Step-by-step: Set up a custom MySQL database extension Now we’ll show you step-by-step how to create a custom MySQL database extension for querying and pushing business data to the Dynatrace business events endpoint. Don’t rename the file.

Database

Database Artificial Intelligence Metrics Monitoring

Introducing Impressions at Netflix

The Netflix TechBlog

FEBRUARY 14, 2025

It requires a state-of-the-art system that can track and process these impressions while maintaining a detailed history of each profiles exposure. In this multi-part blog series, we take you behind the scenes of our system that processes billions of impressions daily.

Tuning

Tuning Latency Efficiency Storage

The value of business events: How IT can increase business agility

Dynatrace

JANUARY 18, 2023

Business events are a special class of events, new to Business Analytics; together with Grail, our data lakehouse, they provide the precision and advanced analytics capabilities required by your most important business use cases. What are business events? This diagram shows a few examples of business events.

Analytics

Analytics Retail Storage Monitoring

Transform data into insights with Dynatrace Dashboards and Notebooks

Dynatrace

OCTOBER 16, 2024

Kickstart your creation journey using ready-made dashboards and notebooks Creating dashboards and notebooks from scratch can take time, particularly when figuring out available data and how to best use it. Kickstarting the dashboard creation process is, however, just one advantage of ready-made dashboards.

Social Media

Social Media Metrics Network Analytics

Unlock the observability value of log data with processing at scale

Dynatrace

AUGUST 16, 2022

Even worse, if your service logs record critical events such as errors in a non-standard way, those errors might go unnoticed by your observability team. Whether a web server, mobile app, backend service, or other custom application, log data can provide you with deep insights into your software’s operations and events.

Processing

Processing Metrics Monitoring Java

Flexible, scalable, self-service Kubernetes native observability now in General Availability

Dynatrace

MAY 17, 2022

The application consists of several microservices that are available as pod-backed services. In addition to logs, and events, Dynatrace surfaces logs streamed from Fluentd so that you can analyze those logs in context with traces and services. Information about each of these topics will be available in upcoming announcements.

Availability

Availability Scalability Cloud Metrics

2. Diving Deeper into Psyberg: Stateless vs Stateful Data Processing

The Netflix TechBlog

NOVEMBER 14, 2023

By Abhinaya Shetty , Bharath Mummadisetty In the inaugural blog post of this series, we introduced you to the state of our pipelines before Psyberg and the challenges with incremental processing that led us to create the Psyberg framework within Netflix’s Membership and Finance data engineering team.

Processing

Processing Data Engineering Efficiency Analytics

RabbitMQ vs. Kafka: Key Differences

Scalegrid

FEBRUARY 6, 2025

RabbitMQ is designed for flexible routing and message reliability, while Kafka handles high-throughput event streaming and real-time data processing. Kafka is optimized for high-throughput event streaming , excelling in real-time analytics and large-scale data ingestion. What is Apache Kafka?

Latency

Latency Analytics Architecture Storage

Five-nines availability: Always-on infrastructure delivers system availability during the holidays’ peak loads

Dynatrace

NOVEMBER 22, 2022

The nirvana state of system uptime at peak loads is known as “five-nines availability.” In its pursuit, IT teams hover over system performance dashboards hoping their preparations will deliver five nines—or even four nines—availability. But is five nines availability attainable? Downtime per year. 90% (one nine).

Infrastructure

Infrastructure Availability Systems Retail

Level up your strategic IT management with fully cost-transparent, fine-grained Dynatrace Cost Allocation

Dynatrace

NOVEMBER 27, 2024

Sometimes, introducing new IT solutions is delayed or canceled because a single business unit can’t manage the operating costs alone, and per-department cost insights that could facilitate cost sharing aren’t available. Figure 4: Set up an anomaly detector for peak cost events.

Strategy

Strategy Best Practices Cloud Efficiency

Better dashboarding with Dynatrace Davis AI: Instant meaningful insights

Dynatrace

JANUARY 21, 2025

Ensuring smooth operations is no small feat, whether you’re in charge of application performance, IT infrastructure, or business processes. Activate Davis AI to analyze charts within seconds Davis AI can help you expand your dashboards and dive deeper into your available data to extract additional information.

Traffic

Traffic Metrics Analytics Monitoring

Accelerate resolution of network issues with AI-powered event reporting based on SNMP traps

Dynatrace

NOVEMBER 30, 2022

The volume of data and events grows in tandem with the rising complexity of IT infrastructure. SNMP provides access to availability and performance indicators. While SNMP allows you to query monitored devices for performance information, SNMP traps are used to proactively report certain types of events.

Network

Network Infrastructure Metrics Monitoring

Dynatrace OpenPipeline: Stream processing data ingestion converges observability, security, and business data at massive scale for analytics and automation in context

Dynatrace

JANUARY 31, 2024

Organizations choose data-driven approaches to maximize the value of their data, achieve better business outcomes, and realize cost savings by improving their products, services, and processes. Data is then dynamically routed into pipelines for further processing. Understanding the context. Addressing security requirements.

Analytics

Analytics Processing Transportation Storage

Dynatrace observability now available for Red Hat OpenShift on IBM Z and LinuxONE mainframes

Dynatrace

JULY 24, 2024

By leveraging Dynatrace observability on Red Hat OpenShift running on Linux, you can accelerate modernization to hybrid cloud and increase operational efficiencies with greater visibility across the full stack from hardware through application processes. Dynatrace observability is available for Red Hat OpenShift on IBM Power.

Availability

Availability Infrastructure Metrics Hardware

Build resilient IT systems and manage regulatory requirements with compliance and resilience capabilities from Dynatrace

Dynatrace

JANUARY 15, 2025

Smartscape topology visualizes the relationships between applications, services, processes, hosts, and data centers, highlighting problems and vulnerabilities. Site Reliability Guardian provides an automated change impact analysis to validate service availability, performance, and capacity objectives across various systems.

Systems

Systems DevOps Analytics Monitoring

Don’t just react: How executives can predict and prevent outages to maximize availability

Dynatrace

OCTOBER 3, 2024

The end goal, of course, is to optimize the availability of organizations’ software. Dynatrace is widely recognized for its AI capabilities’ ability to predict and prevent issues, and automatically identify root causes, maximizing availability.

Availability

Availability DevOps Analytics Cloud

Lower total cost of ownership with improved OneAgent and ActiveGate update process

Dynatrace

APRIL 13, 2021

Given our relatively frequent releases, this means that you can benefit from 11 to 12 OneAgent updates a year that are deployed as soon as they are available for your environment. Each maintenance window can be defined either as a one-off event or a recurring event. a one-off event). Just select Update ActiveGate.

Processing

Processing Monitoring Performance Availability

Batch Processing for Data Integration

DZone

NOVEMBER 7, 2023

Among the spectrum of methodologies available for this task, batch processing is often considered an old guard, especially with the advent of real-time and event-based processing technologies. However, it would be a mistake to dismiss batch processing as an antiquated approach.

Processing

Processing Architecture Technology Technology

Reporting at scale leveraging cross-environment dashboards (General Availability)

Dynatrace

JULY 31, 2020

We’re happy to announce the General Availability of cross-environment dashboarding capabilities (having released this functionality in an Early Adopter release with Dynatrace version 1.172 back in June 2019). Keep the token secret available for the second and final configuration step. Dynatrace news.

Availability

Availability Best Practices Monitoring Network

Dynatrace Managed turnkey Premium High Availability for globally distributed data centers (Early Adopter)

Dynatrace

JUNE 25, 2020

Dynatrace Managed is intrinsically highly available as it stores three copies of all events, user sessions, and metrics across its cluster nodes. Our Premium High Availability comes with the following features: Active-active deployment model for optimum hardware utilization. Dynatrace news.

Availability

Availability Hardware Latency Traffic

Foundation Model for Personalized Recommendation

The Netflix TechBlog

MARCH 28, 2025

The impetus for constructing a foundational recommendation model is based on the paradigm shift in natural language processing (NLP) to large language models (LLMs). To harness this data effectively, we employ a process of interaction tokenization, ensuring meaningful events are identified and redundancies are minimized.

Tuning

Tuning Efficiency Latency Strategy

Debug complex performance issues in production with ease

Dynatrace

FEBRUARY 4, 2025

Upon detecting a high CPU load, Davis AI generates a problem event and populates it with a direct link to Live Debugger. Using this data, developers can inspect local variables, server-process details, thread information, and trace data to identify the root cause of issues.

Performance

Performance Code Processing Availability

5 powerful use cases beyond debugging for Dynatrace Live Debugger

Dynatrace

MARCH 25, 2025

This powerful tool can be leveraged across various environments, including production, to enhance development processes and ensure robust application performance. Many developers attempt to mitigate this challenge with logs, but thats a tedious and error-prone process. Distributed services involve multiple processes and runtimes.

Benchmarking

Benchmarking Code Open Source Engineering

Best Practices for Scaling RabbitMQ

Scalegrid

FEBRUARY 24, 2025

Implementing clustering and quorum queues in RabbitMQ significantly improves load distribution and data redundancy, ensuring high availability and fault tolerance for messaging services. Classic queues can be used in clusters, emphasizing their behavior during node failures, particularly regarding durability and availability.

Best Practices

Best Practices Traffic Strategy Scalability

Title Launch Observability at Netflix Scale

The Netflix TechBlog

MARCH 4, 2025

In order to allow for this mimicking, many systems implement an event handling, where they convert our request into a call to the real service with properties enabled to log when titles are filtered out of their response and why. As a result, requests are uniformly handled, and responses are processed cohesively.

Traffic

Traffic Strategy Entertainment Innovation

How Red Hat and Dynatrace intelligently automate your production environment

Dynatrace

MAY 6, 2024

A tight integration between Red Hat Ansible Automation Platform, Dynatrace Davis ® AI, and the Dynatrace observability and security platform enables closed-loop remediation to automate the process from: Detecting a problem. Managing incidents in corresponding tools. Identifying the root cause and proper countermeasures.

DevOps

DevOps Software Engineering Games Infrastructure

Next generation Dynatrace Davis AI becomes the default causation engine

Dynatrace

NOVEMBER 26, 2019

Define custom events that can either trigger deeper analysis or contribute additional contextual information to Davis. The improved configuration workflow for custom event alerting offers a lot of power in terms of defining additional metric-based events for your Dynatrace environment. We opened up the Davis 2.0

Engineering

Engineering Serverless Metrics Code

Observability throughout the software development lifecycle increases delivery performance

Dynatrace

OCTOBER 4, 2024

Today, development teams suffer from a lack of automation for time-consuming tasks, the absence of standardization due to an overabundance of tool options, and insufficiently mature DevSecOps processes. This process begins when the developer merges a code change and ends when it is running in a production environment.

Software

Software Software Development Performance

Keeping an eye on your control plane is critical to ensuring the high availability and health of your self-managed OpenShift Container Platform

Dynatrace

SEPTEMBER 22, 2021

Controller Manager: Runs controllers such as the node controller responsible for handling node availability. The post Keeping an eye on your control plane is critical to ensuring the high availability and health of your self-managed OpenShift Container Platform appeared first on Dynatrace blog.

Availability

Availability Best Practices Infrastructure Monitoring

Noisy Neighbor Detection with eBPF

The Netflix TechBlog

SEPTEMBER 10, 2024

One issue that often complicates this process is the "noisy neighbor" problem. The sched_wakeup and sched_wakeup_new hooks are invoked when a process changes state from 'sleeping' to 'runnable.' ' They let us identify when a process is ready to run and is waiting for CPU time.

Latency

Latency Metrics Programming Monitoring

CrowdStrike incident takeaways: Revisiting vendor quality control and release standards to minimize outage exposure

Dynatrace

JULY 25, 2024

A key learning from the outage caused by the faulty CrowdStrike “Rapid Response” update is how critical it is to understand your vendors’ quality control and release processes. A variety of events and circumstances can cause an outage. What is your testing process? Questions to ask a vendor: How frequently do you release?

Strategy

Strategy Monitoring Open Source Testing

What is Greenplum Database? Intro to the Big Data Database

Scalegrid

MAY 13, 2020

Greenplum Database is a massively parallel processing (MPP) SQL database that is built and based on PostgreSQL. When handling large amounts of complex data, or big data, chances are that your main machine might start getting crushed by all of the data it has to process in order to produce your analytics results. Query Optimization.

Big Data

Big Data Database Artificial Intelligence Open Source

OpenPipeline: Simplify access to critical business data

Netflix’s Distributed Counter Abstraction

Trending Sources

Reliability indicators that matter to your business: SLOs for all data types

Rapid Event Notification System at Netflix

New integrations announced at AWS re:Invent enhance cloud performance, security, and automation

Create simple workflows to automate alerts during development

Generate security events from Dynatrace Security Investigator via OpenPipeline

Business Flow: Why IT operations teams should monitor business processes

Data Mesh?—?A Data Movement and Processing Platform @ Netflix

Leverage logs for an end-to-end view of your business processes via Dynatrace OpenPipeline

Dynatrace enhances Business Analytics with business events powered by Grail

Managing PostgreSQL® High Availability – Part I: PostgreSQL Automatic Failover

Extend business observability: Extract business events from online databases (Part 2)

Introducing Impressions at Netflix

The value of business events: How IT can increase business agility

Transform data into insights with Dynatrace Dashboards and Notebooks

Unlock the observability value of log data with processing at scale

Flexible, scalable, self-service Kubernetes native observability now in General Availability

2. Diving Deeper into Psyberg: Stateless vs Stateful Data Processing

RabbitMQ vs. Kafka: Key Differences

Five-nines availability: Always-on infrastructure delivers system availability during the holidays’ peak loads

Level up your strategic IT management with fully cost-transparent, fine-grained Dynatrace Cost Allocation

Better dashboarding with Dynatrace Davis AI: Instant meaningful insights

Accelerate resolution of network issues with AI-powered event reporting based on SNMP traps

Dynatrace OpenPipeline: Stream processing data ingestion converges observability, security, and business data at massive scale for analytics and automation in context

Dynatrace observability now available for Red Hat OpenShift on IBM Z and LinuxONE mainframes

Build resilient IT systems and manage regulatory requirements with compliance and resilience capabilities from Dynatrace

Don’t just react: How executives can predict and prevent outages to maximize availability

Lower total cost of ownership with improved OneAgent and ActiveGate update process

Batch Processing for Data Integration

Reporting at scale leveraging cross-environment dashboards (General Availability)

Dynatrace Managed turnkey Premium High Availability for globally distributed data centers (Early Adopter)

Foundation Model for Personalized Recommendation

Debug complex performance issues in production with ease

5 powerful use cases beyond debugging for Dynatrace Live Debugger

Best Practices for Scaling RabbitMQ

Title Launch Observability at Netflix Scale

How Red Hat and Dynatrace intelligently automate your production environment

Next generation Dynatrace Davis AI becomes the default causation engine

Observability throughout the software development lifecycle increases delivery performance

Keeping an eye on your control plane is critical to ensuring the high availability and health of your self-managed OpenShift Container Platform

Noisy Neighbor Detection with eBPF

CrowdStrike incident takeaways: Revisiting vendor quality control and release standards to minimize outage exposure

What is Greenplum Database? Intro to the Big Data Database

Stay Connected