Engineering and Monitoring - Technology Performance Pulse

HTTP monitors on the latest Dynatrace platform extend insights into the health of your API endpoints and simplify test management

Dynatrace

DECEMBER 18, 2024

Traditional insight into HTTP monitor execution details For nearly two thousand Dynatrace customers, Dynatrace Synthetic HTTP monitors provide insights into the health of monitored endpoints worldwide and around the clock. It now fully supports not only Network Availability Monitors but also HTTP synthetic monitors.

Monitoring

Monitoring Testing Metrics Analytics

Part 2: A Survey of Analytics Engineering Work at Netflix

The Netflix TechBlog

JANUARY 2, 2025

This article is the second in a multi-part series sharing a breadth of Analytics Engineering work at Netflix, recently presented as part of our annual internal Analytics Engineering conference. Need to catch up? Check out Part 1.

Analytics

Analytics Engineering Games Entertainment

Cost-Aware Resilience: Implementing Chaos Engineering Without Breaking the Budget

DZone

APRIL 1, 2025

Chaos engineering is a useful way to test and improve system resilience by intentionally creating controlled failures. However, it can be costly due to resource usage, monitoring needs, and testing in production-like environments. However, their complexity can lead to unexpected failures.

Engineering

Engineering Virtualization Scalability Architecture

O11y Guide: Finding Observability and DevEx Tranquility With Platform Engineering

DZone

JANUARY 7, 2025

Monitoring system behavior is essential for ensuring long-term effectiveness. By integrating observability as a first-class citizen within your platform engineering practices, you can simplify this challenge and stay on track in the ever-evolving cloud-native landscape.

Engineering

Engineering Cloud Monitoring Systems

Catching up with OpenTelemetry in 2025

Dynatrace

FEBRUARY 27, 2025

To get a better idea of OpenTelemetry trends in 2025 and how to get the most out of it in your observability strategy, some of our Dynatrace open-source engineers and advocates picked out the innovations they find most interesting. Because its constantly evolving, staying up to date with the latest in OpenTelemetry is no small feat.

Tuning

Tuning Open Source Innovation Monitoring

Build systems more reliably with Dynatrace: Chaos Engineering

Dynatrace

AUGUST 21, 2024

To enhance reliability, testing the software under these conditions is crucial to prepare for potential issues by leveraging chaos engineering or similar tools. Chaos engineering is a practice that extends beyond traditional failure testing by identifying unpredictable issues. It forms the cornerstone of chaos engineering experiments.

Engineering

Engineering Systems Latency Metrics

Unlock the Power of DevSecOps with Newly Released Kubernetes Experience for Platform Engineering

Dynatrace

NOVEMBER 7, 2023

Platform engineering is on the rise. According to leading analyst firm Gartner, “80% of software engineering organizations will establish platform teams as internal providers of reusable services, components, and tools for application delivery…” by 2026. All important health signals are highlighted.

Engineering

Engineering DevOps Best Practices Infrastructure

Automate digital excellence with Dynatrace Synthetic Monitoring and Workflows

Dynatrace

JULY 18, 2024

To keep up with current demands, DevOps and platform engineering teams need a solution that can fully embrace and understand complexity, delivering precise answers that enable the creation of trustworthy automation. Automation + Synthetic = Perfect match This is why we integrated Synthetic monitoring in Workflows.

Monitoring

Monitoring DevOps Infrastructure Games

A Kubernetes platform engineering strategy tames Kubernetes complexity

Dynatrace

JULY 25, 2024

I spoke with Martin Spier, PicPay’s VP of Engineering, about the challenges PicPay experienced and the Kubernetes platform engineering strategy his team adopted in response. Taking a strategic Kubernetes platform engineering approach Spier noted that keeping Kubernetes simple requires a strategic approach.

Strategy

Strategy Engineering Open Source Java

What Is a Performance Engineer and How to Become One: Part 1

DZone

OCTOBER 8, 2024

A performance engineer is actually a professional performance testing and engineering expert with in-depth knowledge of many load-testing tools like LoadRunner, JMeter, Neoload, Gatling, K6, etc., and must have extensive experience in specialized skills.

Engineering

Engineering Blockchain Healthcare IoT

How platform engineering and IDP observability can accelerate developer velocity

Dynatrace

MARCH 6, 2024

As organizations look to expand DevOps maturity, improve operational efficiency, and increase developer velocity, they are embracing platform engineering as a key driver. Platform engineering: Build for self-service Self-service deployment is a key attribute of platform engineering. “It makes them more productive.

Engineering

Engineering Development DevOps Infrastructure

Embracing Resilience: The Power of Chaos Engineering

DZone

OCTOBER 20, 2023

But chaos engineering stands out for its exceptional capacity to identify weaknesses and proactively fortify systems. The rise of a new discipline known as chaos engineering is a result of the increased complexity combined with the constant demand for reliability and resilience.

Engineering

Engineering Strategy Network Technology

Platform engineering: Empowering key Kubernetes use cases with Dynatrace

Dynatrace

OCTOBER 30, 2023

Today, speed and DevOps automation are critical to innovating faster, and platform engineering has emerged as an answer to some of the most significant challenges DevOps teams are facing. It needs to be engineered properly as a product or service, and it needs automation, observability, and security in itself.”

Engineering

Engineering DevOps Innovation Storage

The keys to selecting a platform for end-to-end observability

Dynatrace

DECEMBER 2, 2024

On average, organizations use 10 different tools to monitor applications, infrastructure, and user experiences across these environments. Such fragmented approaches fall short of giving teams the insights they need to run IT and site reliability engineering operations effectively.

Artificial Intelligence

Artificial Intelligence DevOps Architecture Cloud

SLOs for Kubernetes clusters: Optimize resource utilization of Kubernetes clusters with service-level objectives

Dynatrace

NOVEMBER 11, 2024

Service-level objectives are typically used to monitor business-critical services and applications. However, due to the fact that they boil down selected indicators to single values and track error budget levels, they also offer a suitable way to monitor optimization processes while aligning on single values to meet overall goals.

Efficiency

Efficiency Best Practices Monitoring Cloud

Dynatrace KSPM: Transforming Kubernetes security and compliance

Dynatrace

DECEMBER 9, 2024

Manual approaches lack continuous monitoring, making them ill-equipped to prevent issues before they arise. Processes are time-intensive. Custom scripts and manual workflows demand substantial time and effort, creating inefficiencies. Reactivity. The skills gap creates inefficiencies.

Best Practices

Best Practices Benchmarking DevOps Efficiency

How observability, application security, and AI enhance DevOps and platform engineering maturity

Dynatrace

APRIL 18, 2024

DevOps and platform engineering are essential disciplines that provide immense value in the realm of cloud-native technology and software delivery. Observability of applications and infrastructure serves as a critical foundation for DevOps and platform engineering, offering a comprehensive view into system performance and behavior.

DevOps

DevOps Engineering Artificial Intelligence Infrastructure

Dynatrace joins the Microsoft Intelligent Security Association

Dynatrace

NOVEMBER 20, 2024

Combined with Microsoft Sentinel, Dynatrace automation and AI capabilities provide SecOps teams with deeper intelligence to detect attacks, vulnerabilities, audit logs, and problem events based on metrics, logs, and traces it collects from monitored environments. Runtime application protection.

Best Practices

Best Practices Innovation Azure Cloud

DevOps engineer tools: Deploy, test, evaluate, repeat

Dynatrace

DECEMBER 8, 2022

As cloud-native, distributed architectures proliferate, the need for DevOps technologies and DevOps platform engineers has increased as well. DevOps engineer tools can help ease the pressure as environment complexity grows. ” What does a DevOps platform engineer do? .” What are DevOps engineer tools and platforms.

DevOps

DevOps Engineering Testing Open Source

Mastering Prometheus: Unlocking Actionable Insights and Enhanced Monitoring in Kubernetes Environments

DZone

FEBRUARY 15, 2024

In the dynamic world of cloud-native technologies, monitoring and observability have become indispensable. However, managing its health and performance efficiently necessitates a robust monitoring solution. Kubernetes, the de-facto orchestration platform, offers scalability and agility.

Monitoring

Monitoring Open Source Metrics Scalability

Transform data into insights with Dynatrace Dashboards and Notebooks

Dynatrace

OCTOBER 16, 2024

In this blog post, we look at these enhancements, exploring methods for monitoring your Kubernetes environment and showcasing how modern dashboards can transform your data. These ready-made dashboards offer your platform engineers, who oversee Kubernetes environments, immediate and comprehensive data visibility.

Social Media

Social Media Metrics Network Analytics

Automating Success: Building a better developer experience with platform engineering

Dynatrace

FEBRUARY 12, 2024

When it comes to platform engineering, not only does observability play a vital role in the success of organizations’ transformation journeys—it’s key to successful platform engineering initiatives. The various presenters in this session aligned platform engineering use cases with the software development lifecycle.

Engineering

Engineering Development Infrastructure Cloud

Dynatrace extends Synthetic Monitoring capabilities with Network Availability Monitors to validate the availability of infrastructure and services

Dynatrace

JULY 15, 2024

Current synthetic capabilities Dynatrace Synthetic Monitoring is a powerful tool that provides insight into the health of your applications around the clock and as they’re perceived by your end users worldwide. Compared to other solutions I have tested, Dynatrace NAM monitors are the most configurable which is to my liking.

Availability

Availability Network Monitoring Infrastructure

Dynatrace + Metis: Helping developers & SREs solve Database issues with AI

Dynatrace

MARCH 5, 2025

Site Reliability Engineers (SREs) also face significant challenges in maintaining database reliability, ensuring performance, and preventing disruptions in highly dynamic and distributed environments. For SREs, this means better proactive monitoring, fewer database-related incidents, and greater stability in production environments.

Database

Database Development Tuning DevOps

Demo: Monitoring the OpenTelemetry demo app Astronomy Shop with Dynatrace Dashboards

Dynatrace

MARCH 17, 2025

The post Demo: Monitoring the OpenTelemetry demo app Astronomy Shop with Dynatrace Dashboards appeared first on Dynatrace news. If youre new to Dynatrace and want to try out the new experience of Distributed Tracing app, check out our free trial. If youre not yet a DPS customer, you can use the Dynatrace playground instead.

Monitoring

Monitoring Analytics Metrics Strategy

How automated workflows and multicloud automation can reduce engineering toil

Dynatrace

JUNE 5, 2023

But without automated workflows, IT professionals are finding it difficult to monitor, manage, secure, and troubleshoot applications at scale. In the era of in-house, internal-facing databases, manual monitoring and oversight were possible with minimal errors. Modern multicloud environments are powerful and agile, yet highly complex.

Engineering

Engineering Speed Monitoring Efficiency

Enhancing Kubernetes cluster management key to platform engineering success

Dynatrace

MARCH 29, 2024

Five of the most common include cluster instability, resource and cost management, security, observability, and stress on engineering teams. Engineering teams are overwhelmed with stuff to do.” “And as the cost is going down, we’re also monitoring to see what’s happening to application performance.”

Engineering

Engineering DevOps Operating System Cloud

Observability is expanding: Transforming complexity into business opportunity

Dynatrace

MARCH 5, 2025

Observability is no longer just for IT Ops Observability is no longer just about monitoring IT systems. Its not just for IT Ops but a critical capability for platform engineering, SREs, developers, as well as business and IT executives. Its aboutunderstandingand automating the entire digital ecosystem.

Innovation

Innovation Speed Efficiency Engineering

Title Launch Observability at Netflix Scale

The Netflix TechBlog

MARCH 4, 2025

This standardization enhances adoption within the personalization stack, simplifies the system, and improves understanding and debuggability for engineers. They must also provide enough information for partner engineers to identify the problem with the underlying service in cases of system-level issues. there is a dedicated collector.

Traffic

Traffic Strategy Entertainment Innovation

How to Configure Istio, Prometheus and Grafana for Monitoring

DZone

AUGUST 29, 2023

One of the primary responsibilities of Site reliability engineers (SREs) in large organizations is to monitor the golden metrics of their applications, such as CPU utilization, memory utilization, latency, and throughput.

Monitoring

Monitoring Latency Network Infrastructure

Build resilient IT systems and manage regulatory requirements with compliance and resilience capabilities from Dynatrace

Dynatrace

JANUARY 15, 2025

It gives you visibility into which components are monitored and which are not and helps automate time-consuming compliance configuration checks. Discovery & Coverage helps prevent unexpected outages by detecting and remediating monitoring coverage gaps across your entire enterprise.

Systems

Systems DevOps Analytics Monitoring

Site Reliability Engineering

DZone

JANUARY 19, 2024

In the dynamic world of online services, the concept of site reliability engineering (SRE) has risen as a pivotal discipline, ensuring that large-scale systems maintain their performance and reliability.

Engineering

Engineering Tuning Software Engineering Internet

DevOps monitoring tools: How to drive DevOps efficiency

Dynatrace

MAY 8, 2023

With the world’s increased reliance on digital services and the organizational pressure on IT teams to innovate faster, the need for DevOps monitoring tools has grown exponentially. But when and how does DevOps monitoring fit into the process? And how do DevOps monitoring tools help teams achieve DevOps efficiency?

DevOps

DevOps Efficiency Monitoring Infrastructure

The state of site reliability engineering: SRE challenges and best practices in 2023

Dynatrace

NOVEMBER 14, 2023

Site reliability engineering (SRE) has become increasingly important to organizations looking to keep up with the rapid pace of digital transformation. Effective site reliability engineering requires enterprise-wide transformation Without a unified understanding of SRE practices, organizational silos can quickly form between departments.

Best Practices

Best Practices Engineering DevOps Software Engineering

How Netflix Content Engineering makes a federated graph searchable

The Netflix TechBlog

APRIL 12, 2022

By Alex Hutter , Falguni Jhaveri and Senthil Sayeebaba Over the past few years Content Engineering at Netflix has been transitioning many of its services to use a federated GraphQL platform. By transacting with a database which is monitored by a CDC connector that creates events, or b.

Engineering

Engineering Architecture Java Infrastructure

Reliability indicators that matter to your business: SLOs for all data types

Dynatrace

OCTOBER 31, 2024

In the coming weeks and months, we will add to the current collection of templates for synthetic monitoring, digital experience management measures, Kubernetes resource optimization, and infrastructure monitoring. However, all of these can be created today using DQL queries.

Metrics

Metrics Availability Monitoring Scalability

Network performance monitoring top of mind for CloudOps teams

Dynatrace

MAY 19, 2023

For cloud operations teams, network performance monitoring is central in ensuring application and infrastructure performance. Network performance monitoring core to observability For these reasons, network activity becomes a key data source in IT observability. But this approach merely perpetuates data silos and cloud complexity.

Network

Network Monitoring Performance Traffic

Better dashboarding with Dynatrace Davis AI: Instant meaningful insights

Dynatrace

JANUARY 21, 2025

For example, if you’re monitoring network traffic and the average over the past 7 days is 500 Mbps, the threshold will adapt to this baseline. Using a seasonal baseline, you can monitor sales performance based on the past fourteen days. For instance, in a web shop, sales might vary by day of the week.

Traffic

Traffic Metrics Analytics Monitoring

Observability engineering: Getting Prometheus metrics right for Kubernetes with Dynatrace and Kepler

Dynatrace

DECEMBER 18, 2023

For busy site reliability engineers, ensuring system reliability, scalability, and overall health is an imperative that’s getting harder to achieve in ever-expanding, cloud-native, container-based environments. Because of its adaptability, Prometheus has become an essential tool for observability engineering. Jolly good!

Metrics

Metrics Engineering Energy Tuning

Next-level batch job monitoring and alerting: Elevate performance and reliability

Dynatrace

SEPTEMBER 27, 2024

The urgency of monitoring these batch jobs can’t be overstated. Monitor batch jobs Monitoring is critical for batch jobs because it ensures that essential tasks, such as data processing and system maintenance, are completed on time and without errors. This blog post offers further details about DPL architect.

Monitoring

Monitoring Performance IoT Analytics

SRE Best Practices for Java Applications

DZone

MARCH 12, 2025

Site reliability engineering (SRE) plays a vital role in ensuring Java applications' high availability, performance, and scalability. This discipline merges software engineering and operations, aiming to create a robust infrastructure that supports seamless user experiences.

Best Practices

Best Practices Java Software Engineering Scalability

New continuous compliance requirements drive the need to converge observability and security

Dynatrace

DECEMBER 12, 2024

For executives, these directives present several challenges, including compliance complexity, resource allocation for continuous monitoring, and incident reporting. For example, for companies with over 1,000 DevOps engineers, the potential savings are between $3.4

Analytics

Analytics Government Efficiency Innovation

Title Launch Observability at Netflix Scale

The Netflix TechBlog

JANUARY 6, 2025

Challenge: Dont understand the cascading effects of their setup on these perceived black box personalization systems - Personalization System Engineers Role: Develop and operate the personalization systems. Defining Title Health provided a framework to monitor and optimize each titles lifecycle.

Scalability

Scalability Cache Engineering Systems

Enhance efficiency and compliance with automated AWS tag change triggers: A step-by-step guide

Dynatrace

APRIL 2, 2025

Proactive site reliability: Automated guardians can monitor the four golden signals , enabling proactive reliability measures. Step 6: Validate and monitor the setup Perform end-to-end validation by changing an EC2 tag again. With automation, SRG helps engineering teams achieve efficiency, improved compliance, and cost optimization.

AWS

AWS Efficiency Architecture Best Practices

HTTP monitors on the latest Dynatrace platform extend insights into the health of your API endpoints and simplify test management

Part 2: A Survey of Analytics Engineering Work at Netflix

Trending Sources

Cost-Aware Resilience: Implementing Chaos Engineering Without Breaking the Budget

O11y Guide: Finding Observability and DevEx Tranquility With Platform Engineering

Catching up with OpenTelemetry in 2025

Build systems more reliably with Dynatrace: Chaos Engineering

Unlock the Power of DevSecOps with Newly Released Kubernetes Experience for Platform Engineering

Automate digital excellence with Dynatrace Synthetic Monitoring and Workflows

A Kubernetes platform engineering strategy tames Kubernetes complexity

What Is a Performance Engineer and How to Become One: Part 1

How platform engineering and IDP observability can accelerate developer velocity

Embracing Resilience: The Power of Chaos Engineering

Platform engineering: Empowering key Kubernetes use cases with Dynatrace

The keys to selecting a platform for end-to-end observability

SLOs for Kubernetes clusters: Optimize resource utilization of Kubernetes clusters with service-level objectives

Dynatrace KSPM: Transforming Kubernetes security and compliance

How observability, application security, and AI enhance DevOps and platform engineering maturity

Dynatrace joins the Microsoft Intelligent Security Association

DevOps engineer tools: Deploy, test, evaluate, repeat

Mastering Prometheus: Unlocking Actionable Insights and Enhanced Monitoring in Kubernetes Environments

Transform data into insights with Dynatrace Dashboards and Notebooks

Automating Success: Building a better developer experience with platform engineering

Dynatrace extends Synthetic Monitoring capabilities with Network Availability Monitors to validate the availability of infrastructure and services

Dynatrace + Metis: Helping developers & SREs solve Database issues with AI

Demo: Monitoring the OpenTelemetry demo app Astronomy Shop with Dynatrace Dashboards

How automated workflows and multicloud automation can reduce engineering toil

Enhancing Kubernetes cluster management key to platform engineering success

Observability is expanding: Transforming complexity into business opportunity

Title Launch Observability at Netflix Scale

How to Configure Istio, Prometheus and Grafana for Monitoring

Build resilient IT systems and manage regulatory requirements with compliance and resilience capabilities from Dynatrace

Site Reliability Engineering

DevOps monitoring tools: How to drive DevOps efficiency

The state of site reliability engineering: SRE challenges and best practices in 2023

How Netflix Content Engineering makes a federated graph searchable

Reliability indicators that matter to your business: SLOs for all data types

Network performance monitoring top of mind for CloudOps teams

Better dashboarding with Dynatrace Davis AI: Instant meaningful insights

Observability engineering: Getting Prometheus metrics right for Kubernetes with Dynatrace and Kepler

Next-level batch job monitoring and alerting: Elevate performance and reliability

SRE Best Practices for Java Applications

New continuous compliance requirements drive the need to converge observability and security

Title Launch Observability at Netflix Scale

Enhance efficiency and compliance with automated AWS tag change triggers: A step-by-step guide

Stay Connected