Monitoring and Software Engineering - Technology Performance Pulse

Error Monitoring vs Defect Monitoring: Key Differences

DZone

AUGUST 30, 2021

Identifying defects and troubleshooting for their root cause is one of the important but painful tasks in software engineering and essential to maintaining good quality software. To help them in the quest for improving MTTR, software developers use application monitoring tools.

Monitoring

Monitoring Software Engineering Engineering Software

SRE Best Practices for Java Applications

DZone

MARCH 12, 2025

Site reliability engineering (SRE) plays a vital role in ensuring Java applications' high availability, performance, and scalability. This discipline merges software engineering and operations, aiming to create a robust infrastructure that supports seamless user experiences.

Best Practices

Best Practices Java Software Engineering Scalability

AWS observability: AWS monitoring best practices for resiliency

Dynatrace

NOVEMBER 22, 2021

These resources generate vast amounts of data in various locations, including containers, which can be virtual and ephemeral, thus more difficult to monitor. These challenges make AWS observability a key practice for building and monitoring cloud-native applications. AWS monitoring best practices. Automate monitoring tasks.

Best Practices

Best Practices AWS Monitoring Serverless

Kubernetes Observability: Lessons Learned From Running Kubernetes in Production

DZone

OCTOBER 1, 2024

In recent years, observability has re-emerged as a critical aspect of DevOps and software engineering in general, driven by the growing complexity and scale of modern, cloud-native applications.

Software Engineering

Software Engineering DevOps Cloud Architecture

Open-Sourcing a Monitoring GUI for Metaflow

The Netflix TechBlog

OCTOBER 27, 2021

Open-Sourcing a Monitoring GUI for Metaflow, Netflix’s ML Platform tl;dr Today, we are open-sourcing a long-awaited GUI for Metaflow. The Metaflow GUI allows data scientists to monitor their workflows in real-time, track experiments, and see detailed logs and results for every executed task.

Open Source

Open Source Monitoring Scalability Code

5 powerful use cases beyond debugging for Dynatrace Live Debugger

Dynatrace

MARCH 25, 2025

Performance benchmarking Performance benchmarking is one of the unresolved mysteries of software engineering. Maybe you want to monitor performance under different system loads. Live snapshot includes variables, process, stack trace, and tracing information. In many ways, it’s more of an art than a science.

Benchmarking

Benchmarking Code Open Source Engineering

SRE Opportunities Grow as Businesses Reopen and Reacclimate

DZone

AUGUST 4, 2021

Take one look at LinkedIn right now, and you’ll notice some of the most in-demand jobs include application developers and software engineers. After a deeper dive, you’ll find many companies across multiple industries are looking for site reliability engineers or SREs.

Software Engineering

Software Engineering Speed Engineering Monitoring

A New Era Has Come, and So Must Your Database Observability

DZone

SEPTEMBER 28, 2023

Software engineers didn’t need to understand the database, and even if they owned it, it was just a single component of the system. Guaranteeing software quality was much easier because the deployment happened rarely, and things could be captured on time via automated tests. Reasoning about applications is now much harder.

Database

Database Software Engineering Software Software

Title Launch Observability at Netflix Scale

The Netflix TechBlog

MARCH 4, 2025

Implement proactive monitoring for each of these endpoints. Key Features Proactive monitoring through scheduled collectors jobs Our Title Health microservice runs a scheduled collector job every 30 minutes for most of our personalization stack. Track real-time title impressions from the NetflixUI. there is a dedicated collector.

Traffic

Traffic Strategy Entertainment Innovation

Software engineering for machine learning: a case study

The Morning Paper

JULY 7, 2019

Software engineering for machine learning: a case study Amershi et al., More specifically, we’ll be looking at the results of an internal study with over 500 participants designed to figure out how product development and software engineering is changing at Microsoft with the rise of AI and ML. ICSE’19.

Software Engineering

Software Engineering Engineering Software Software

How to leverage mobile analytics to ensure crash-free, five-star mobile applications

Dynatrace

JUNE 28, 2022

After investigating, the software engineering team discovered that it wasn’t leveraging application performance monitoring (APM) tooling data to its full potential. The team constructed dashboards to monitor their progress toward achieving those key performance indicators (KPIs) over time.

Mobile

Mobile Analytics Best Practices Software Engineering

Unlock the Power of DevSecOps with Newly Released Kubernetes Experience for Platform Engineering

Dynatrace

NOVEMBER 7, 2023

Platform engineering is on the rise. According to leading analyst firm Gartner, “80% of software engineering organizations will establish platform teams as internal providers of reusable services, components, and tools for application delivery…” by 2026. Monitoring-as-code can also be configured in GitOps fashion.

Engineering

Engineering DevOps Best Practices Infrastructure

Up your quality and agility factor – using automation to build “performance-as-a-self-service”

Dynatrace

MARCH 3, 2020

For software engineering teams, this demand means not only delivering new features faster but ensuring quality, performance, and scalability too. One way to apply improvements is transforming the way application performance engineering and testing is done. Industry apps explosion.

Performance

Performance Education Innovation Software Architecture

What is DevOps orchestration? And why invest in orchestration tools?

Dynatrace

DECEMBER 5, 2022

Cloud providers enable faster delivery of new services but require new practices, including a need for closely monitoring costs. One key advantage of this integration is a single point of access to monitoring, logging, and other information needed to keep software development operations running efficiently.

DevOps

DevOps Virtualization Best Practices Innovation

How Red Hat and Dynatrace intelligently automate your production environment

Dynatrace

MAY 6, 2024

Problem remediation is too time-consuming According to the DevOps Automation Pulse Survey 2023 , on average, a software engineer takes nine hours to remediate a problem within a production application. With that, Software engineers, SREs, and DevOps can define a broad automation and remediation mapping.

DevOps

DevOps Software Engineering Games Java

Connect your software with the right people: Ownership drives effective collaboration

Dynatrace

MARCH 28, 2023

To address this need, Dynatrace now provides automation for DevSecOps collaboration that associates ownership information with monitored services to further minimize mean-time-to-restore (MTTR). Associating ownership-team details with monitored services is flexible. team structure, or links to external resources such as a wiki.

Software

Software Software Monitoring Software Engineering

How platform engineering and IDP observability can accelerate developer velocity

Dynatrace

MARCH 6, 2024

During a breakout session at the Dynatrace Perform 2024 conference, Dynatrace DevSecOps activist Andreas Grabner and staff engineer Adam Gardner demonstrated how to use observability to monitor an IDP for key performance indicators (KPIs). Intelligent monitoring is also crucial. “It makes them more productive.

Engineering

Engineering Development DevOps Infrastructure

Protect your organization against zero-day vulnerabilities

Dynatrace

AUGUST 3, 2022

Techniques such as statistics-based monitoring and behavior-based monitoring are also possible. Statistics-based monitoring is when organizations take statistics from exploits that vendors have detected and feed them into a system to learn and identify these attacks. Application logs are a good data source for this method.

Java

Java Traffic Benchmarking Strategy

Revolutionizing Observability: How AI-Driven Observability Unlocks a New Era of Efficiency

DZone

FEBRUARY 12, 2024

It is a crucial aspect of distributed systems, as it allows stakeholders such as Software Engineers, Site Reliability Engineers , and Product Managers to troubleshoot issues with their service, monitor performance, and gain insights into the software system's behavior.

Efficiency

Efficiency Software Engineering Monitoring Metrics

Site Reliability Engineering

DZone

JANUARY 19, 2024

Originating from the complex operational challenges faced by large internet companies, SRE incorporates aspects of software engineering and applies them to infrastructure and operations problems.

Engineering

Engineering Tuning Software Engineering Internet

Bringing AV1 Streaming to Netflix Members’ TVs

The Netflix TechBlog

NOVEMBER 9, 2021

To maximize the impact of AV1 encoding while minimizing associated costs, the Data Science and Engineering team devised a catalog rollout strategy for AV1 that took into consideration title popularity and a number of other factors. Challenge 4: How do we continuously monitor AV1 streaming?

Media

Media Open Source Software Engineering Efficiency

Dynatrace extends distributed tracing for serverless on AWS Lambda (GA)?

Dynatrace

JANUARY 19, 2021

Other tools in the market for monitoring AWS Lambda traces can’t deliver real end-to-end visibility from the end-user perspective across all?moving – Robert Trueman, Head of Software Engineering at CDL. extension provides insights into traces and metrics from each monitored Lambda function. Real User Monitoring.

Lambda

Lambda Serverless AWS Mobile

Architected for resiliency: How Dynatrace withstands data center outages

Dynatrace

JUNE 15, 2021

The email walked through how our Dynatrace self-monitoring notified users of the outage but automatically remediated the problem thanks to our platform’s architecture. There are several ways Dynatrace monitors and alerts on the impact of service disruption. Ready to learn more? Then read on! Fact #1: AWS EC2 outage properly documented.

AWS

AWS Traffic Architecture Azure

OpenTelemetry observability and Dynatrace deliver actionable answers at scale

Dynatrace

AUGUST 21, 2023

If a microservice falls in the forest and all your monitoring solutions report it differently, can operators accurately trace what happened and automate a response? Different monitoring point solutions, such as Jaeger, Zipkin, Logstash, Fluentd, and StatsD, each have their own way of observing and recording such an event.

Open Source

Open Source Analytics Lambda Metrics

Extend the AI and automation core of Dynatrace with host extensions to resolve infrastructure problems

Dynatrace

MAY 13, 2020

A single instance of OneAgent can handle the monitoring of many types of entities , including servers, applications, services, databases, and more. But what if a particular metric is crucial for your monitoring needs and it isn’t there? Dynatrace news. GPU-based machine learning system crashes, and you don’t know why?

Infrastructure

Infrastructure Metrics Monitoring Software Engineering

Auto-adaptive thresholds for AI-driven quality gating

Dynatrace

JUNE 4, 2024

Build an umbrella for Development and Operations In modern software engineering, the discipline of platform engineering delivers DevSecOps practices to developers to bridge the gaps between development, security, and operations and enhance the developer experience. However, other data formats, like logs, can also be employed.

Metrics

Metrics Engineering Code Tuning

Site reliability engineering: 5 things you need to know

Dynatrace

FEBRUARY 4, 2021

What is site reliability engineering? Site reliability engineering (SRE) is the practice of applying software engineering principles to operations and infrastructure processes to help organizations create highly reliable and scalable software systems. Dynatrace news.

Engineering

Engineering DevOps Government Latency

Site reliability done right: 5 SRE best practices that deliver on business objectives

Dynatrace

MAY 31, 2023

How site reliability engineering affects organizations’ bottom line SRE applies the disciplines of software engineering to infrastructure management, both on-premises and in the cloud. But the transition to SRE maturity is not always easy.

Best Practices

Best Practices DevOps Latency Metrics

Conducting log analysis with an observability platform and full data context

Dynatrace

APRIL 20, 2023

With more automated approaches to log monitoring and log analysis, however, organizations can gain visibility into their applications and infrastructure efficiently and with greater precision—even as cloud environments grow.

Analytics

Analytics Infrastructure Storage Architecture

Automating Success: Building a better developer experience with platform engineering

Dynatrace

FEBRUARY 12, 2024

Check out the following use cases to learn how to drive innovation from development to production efficiently and securely with platform engineering observability. Using Dynatrace, teams can directly access their synthetic monitors and drill down into locations where, for example, execution failed because of local or global outages.

Engineering

Engineering Development Infrastructure Cloud

The state of site reliability engineering: SRE challenges and best practices in 2023

Dynatrace

NOVEMBER 14, 2023

Customer empathy is key to a fully optimized site reliability engineering practice Software engineering can often be an impersonal discipline. A key component of a proactive SRE model involves the implementation of end-to-end monitoring, including on systems that are not directly owned by the SRE team’s organization.

Best Practices

Best Practices Engineering DevOps Software Engineering

Watchman: monitoring dependency conflicts for Python library ecosystem

The Morning Paper

SEPTEMBER 20, 2020

Watchman: monitoring dependency conflicts for Python library ecosystem Wang et al., Watchman was used for online monitoring of PyPI from 11th July 2019, detecting and predicting 189 further dependency conflict issues in the period to the 16th August. There are more than 1.4M Python libraries in the PyPI repository.

Monitoring

Monitoring Java C++ Strategy

Best PostgreSQL GUI [2024]

Scalegrid

OCTOBER 18, 2024

A dashboard for monitoring activities such as database locks, connected sessions, and prepared transactions for multiple servers. They come with features such as query analysis, performance monitoring, and advanced SQL refactoring capabilities.

Open Source

Open Source Database Cloud Operating System

Automated observability, security, and reliability at scale

Dynatrace

JULY 18, 2023

To handle this challenge, enterprises need to automate and streamline the onboarding and lifecycle of tool configurations in the software development processes, including aspects of observability, security, alerting, and remediation. Development teams must set up tailored configurations for each tool and component they’re responsible for.

Best Practices

Best Practices Code Infrastructure Latency

Site reliability engineering: 5 things to you need to know

Dynatrace

FEBRUARY 4, 2021

Site reliability engineering (SRE) is the practice of applying software engineering principles to operations and infrastructure processes to help organizations create highly reliable and scalable software systems. This can be anything from adjusting monitoring and alerting to making code changes in production.

Engineering

Engineering DevOps Government Latency

The 737Max and Why Software Engineers Might Want to Pay Attention

J. Paul Reed

MARCH 14, 2019

The 737Max and Why Software Engineers Might Want to Pay Attention As someone with a bit of a reputation for talking about aviation and software development and operations , I’ve been asked about the 737Max repeatedly over the past week. To cope, they added additional monitoring and control systems.

Software Engineering

Software Engineering Engineering Software Software

Scale DevOps and SRE with open source Keptn

Dynatrace

APRIL 18, 2022

Software engineer Taras Tsugrii of Meta (formerly Facebook) paid Keptn a high compliment, saying it feels like a reference implementation of Google’s SRE principles , which are the search giant’s techniques for ensuring the integrity of its sites and services. Dynatrace developed and released Keptn to open source in 2020.

Open Source

Open Source DevOps Cloud Metrics

Key Application Performance Metrics From the Viewpoint of a Statistician-Turned-Developer

DZone

MAY 15, 2020

Now that you’ve deployed your code, it’s time to monitor it, collect data, and analyze your metrics. Without application performance monitoring in place, you can’t accurately determine how well things are going. The first step to gather this type of data is application monitoring. Your job is done, right? Is the app performant?

Metrics

Metrics Performance Development Monitoring

What is application security? And why it needs a new approach

Dynatrace

MARCH 17, 2021

Application security is a software engineering term that refers to several different types of security practices designed to ensure applications do not contain vulnerabilities that could allow illicit access to sensitive data, unauthorized code modification, or resource hijacking. Dynatrace news. So, why is all this important?

Open Source

Open Source Cloud Games Java

Application observability meets developer observability: Unlock a 360º view of your environment

Dynatrace

NOVEMBER 6, 2023

In a recent webinar , Dynatrace DevOps activist Andi Grabner and senior software engineer Yarden Laifenfeld explored developer observability. With traditional monitoring tools, the granular data that developers require typically involves manual preparation. But developers need code-level visibility and code-level data.”

Development

Development DevOps Programming Cloud

Netflix at AWS re:Invent 2019

The Netflix TechBlog

NOVEMBER 22, 2019

4:45pm-5:45pm NFX 209 File system as a service at Netflix Kishore Kasi , Senior Software Engineer Abstract : As Netflix grows in original content creation, its need for storage is also increasing at a rapid pace. By watching applications for anomalous actions, security and operations teams can monitor unusual and erroneous behavior.

AWS

AWS Entertainment Open Source Benchmarking

Escaping POC Purgatory: Evaluation-Driven Development for AI Systems

O'Reilly

MARCH 25, 2025

Weve seen this across dozens of companies, and the teams that break out of this trap all adopt some version of Evaluation-Driven Development (EDD), where testing, monitoring, and evaluation drive every decision from the start. Were also betting that this will be a time of software development flourishing.

Systems

Systems Development Tuning Monitoring

Dynatrace Perform 2024 Guide: Deriving business value from AI data analysis

Dynatrace

JANUARY 23, 2024

The discipline shows promise: According to Gartner, 80% of software engineering organizations “will establish platform teams as internal providers of reusable services, components, and tools for application delivery” by 2026.

Performance

Performance DevOps Innovation Energy

Autonomous Cloud Enablement aka Scaling NoOps via Self-Service

Dynatrace

FEBRUARY 6, 2020

A key to success from the start was that not only we did build Dynatrace, but Anita’s team was also always “Customer 0” of Dynatrace because clearly we were in need of a world class monitoring platform that gave us visibility into our deployments in dev, staging and production. Wave two: NoOps to ensure stability!

Cloud

Cloud DevOps Engineering Speed

Error Monitoring vs Defect Monitoring: Key Differences

SRE Best Practices for Java Applications

Trending Sources

AWS observability: AWS monitoring best practices for resiliency

Kubernetes Observability: Lessons Learned From Running Kubernetes in Production

Open-Sourcing a Monitoring GUI for Metaflow

5 powerful use cases beyond debugging for Dynatrace Live Debugger

SRE Opportunities Grow as Businesses Reopen and Reacclimate

A New Era Has Come, and So Must Your Database Observability

Title Launch Observability at Netflix Scale

Software engineering for machine learning: a case study

How to leverage mobile analytics to ensure crash-free, five-star mobile applications

Unlock the Power of DevSecOps with Newly Released Kubernetes Experience for Platform Engineering

Up your quality and agility factor – using automation to build “performance-as-a-self-service”

What is DevOps orchestration? And why invest in orchestration tools?

How Red Hat and Dynatrace intelligently automate your production environment

Connect your software with the right people: Ownership drives effective collaboration

How platform engineering and IDP observability can accelerate developer velocity

Protect your organization against zero-day vulnerabilities

Revolutionizing Observability: How AI-Driven Observability Unlocks a New Era of Efficiency

Site Reliability Engineering

Bringing AV1 Streaming to Netflix Members’ TVs

Dynatrace extends distributed tracing for serverless on AWS Lambda (GA)?

Architected for resiliency: How Dynatrace withstands data center outages

OpenTelemetry observability and Dynatrace deliver actionable answers at scale

Extend the AI and automation core of Dynatrace with host extensions to resolve infrastructure problems

Auto-adaptive thresholds for AI-driven quality gating

Site reliability engineering: 5 things you need to know

Site reliability done right: 5 SRE best practices that deliver on business objectives

Conducting log analysis with an observability platform and full data context

Automating Success: Building a better developer experience with platform engineering

The state of site reliability engineering: SRE challenges and best practices in 2023

Watchman: monitoring dependency conflicts for Python library ecosystem

Best PostgreSQL GUI [2024]

Automated observability, security, and reliability at scale

Site reliability engineering: 5 things to you need to know

The 737Max and Why Software Engineers Might Want to Pay Attention

Scale DevOps and SRE with open source Keptn

Key Application Performance Metrics From the Viewpoint of a Statistician-Turned-Developer

What is application security? And why it needs a new approach

Application observability meets developer observability: Unlock a 360º view of your environment

Netflix at AWS re:Invent 2019

Escaping POC Purgatory: Evaluation-Driven Development for AI Systems

Dynatrace Perform 2024 Guide: Deriving business value from AI data analysis

Autonomous Cloud Enablement aka Scaling NoOps via Self-Service

Stay Connected