Engineering and Infrastructure - Technology Performance Pulse

Stress Testing for Resilience in Modern Infrastructure

DZone

DECEMBER 24, 2024

Indeed, chaos engineering is an innovation concerning testing infrastructure resilience these days. Therefore, no one can underestimate the role of stress testing in ensuring that the systems are resilient against unfortunate events and failures.

Infrastructure

Infrastructure Testing Innovation Engineering

Sustainability: Thoughts from a software engineer

Dynatrace

MARCH 17, 2025

How to achieve sustainable IT practices Use observability tools The first step in driving improvements is to obtain a comprehensive view of your IT infrastructure’s climate impact. Platform engineers can set defaults for development teams, such as the number of replicas a service should have or whether it scales automatically.

Software Engineering

Software Engineering Engineering Software Software

What Is Platform Engineering?

DZone

FEBRUARY 6, 2024

Platform engineering is the creation and management of foundational infrastructure and automated processes, incorporating principles like abstraction, automation, and self-service, to empower development teams, optimize resource utilization, ensure security, and foster collaboration for efficient and scalable software development.

Engineering

Engineering Scalability Infrastructure Efficiency

What is platform engineering?

Dynatrace

NOVEMBER 3, 2023

In response to this shift, platform engineering is growing in popularity. The practice of platform engineering has evolved alongside the increasing complexity of cloud environments. A platform encompasses a set of tools, services, and infrastructure that enables developers to build, test, and deploy software applications.

Engineering

Engineering DevOps Software Engineering Scalability

Next generation Dynatrace Davis AI becomes the default causation engine

Dynatrace

NOVEMBER 26, 2019

Back during Perform 2019, we introduced the next generation of the Dynatrace AI causation engine , also known as Davis. becomes the default causation engine and will replace the previous version as the default for all new environments. as the default AI engine. AI causation engine. All existing Davis 1.0

Engineering

Engineering Serverless Metrics Code

Unlock the Power of DevSecOps with Newly Released Kubernetes Experience for Platform Engineering

Dynatrace

NOVEMBER 7, 2023

Platform engineering is on the rise. According to leading analyst firm Gartner, “80% of software engineering organizations will establish platform teams as internal providers of reusable services, components, and tools for application delivery…” by 2026. Automation, automation, automation.

Engineering

Engineering DevOps Best Practices Infrastructure

A Kubernetes platform engineering strategy tames Kubernetes complexity

Dynatrace

JULY 25, 2024

In fact, 76% of technology leaders say the dynamic nature of Kubernetes makes it more difficult to maintain visibility of their infrastructure compared with traditional technology stacks. Taking a strategic Kubernetes platform engineering approach Spier noted that keeping Kubernetes simple requires a strategic approach. billion. “We

Strategy

Strategy Engineering Open Source Java

Bringing Software Engineering Rigor to Data

DZone

FEBRUARY 20, 2023

In software engineering, we've learned that building robust and stable applications has a direct correlation with overall organization performance. The data community is striving to incorporate the core concepts of engineering rigor found in software communities but still has further to go.

Software Engineering

Software Engineering Engineering Software Software

How platform engineering and IDP observability can accelerate developer velocity

Dynatrace

MARCH 6, 2024

As organizations look to expand DevOps maturity, improve operational efficiency, and increase developer velocity, they are embracing platform engineering as a key driver. The goal is to abstract away the underlying infrastructure’s complexities while providing a streamlined and standardized environment for development teams.

Engineering

Engineering Development DevOps Infrastructure

How observability, application security, and AI enhance DevOps and platform engineering maturity

Dynatrace

APRIL 18, 2024

DevOps and platform engineering are essential disciplines that provide immense value in the realm of cloud-native technology and software delivery. Observability of applications and infrastructure serves as a critical foundation for DevOps and platform engineering, offering a comprehensive view into system performance and behavior.

DevOps

DevOps Engineering Artificial Intelligence Infrastructure

Build systems more reliably with Dynatrace: Chaos Engineering

Dynatrace

AUGUST 21, 2024

To enhance reliability, testing the software under these conditions is crucial to prepare for potential issues by leveraging chaos engineering or similar tools. Chaos engineering is a practice that extends beyond traditional failure testing by identifying unpredictable issues. It forms the cornerstone of chaos engineering experiments.

Engineering

Engineering Systems Latency Metrics

Hybrid cloud infrastructure explained: Weighing the pros, cons, and complexities

Dynatrace

JUNE 29, 2022

More than 90% of enterprises now rely on a hybrid cloud infrastructure to deliver innovative digital services and capture new markets. That’s because cloud platforms offer flexibility and extensibility for an organization’s existing infrastructure. Dynatrace news. With public clouds, multiple organizations share resources.

Infrastructure

Infrastructure Cloud Azure AWS

Platform engineering: Empowering key Kubernetes use cases with Dynatrace

Dynatrace

OCTOBER 30, 2023

Today, speed and DevOps automation are critical to innovating faster, and platform engineering has emerged as an answer to some of the most significant challenges DevOps teams are facing. It needs to be engineered properly as a product or service, and it needs automation, observability, and security in itself.”

Engineering

Engineering DevOps Innovation Storage

The platform engineer role: A game-changer or just hype?

Dynatrace

SEPTEMBER 21, 2023

Site reliability engineering first emerged to address cloud computing’s new performance needs. Today, the platform engineer role is gaining speed as the newest byproduct of scaling DevOps in the emerging but complex cloud-native world. Understanding the platform engineer role DevOps is a constantly evolving discipline.

Games

Games Engineering DevOps Education

DevOps engineer tools: Deploy, test, evaluate, repeat

Dynatrace

DECEMBER 8, 2022

As cloud-native, distributed architectures proliferate, the need for DevOps technologies and DevOps platform engineers has increased as well. DevOps engineer tools can help ease the pressure as environment complexity grows. ” What does a DevOps platform engineer do? .” What are DevOps engineer tools and platforms.

DevOps

DevOps Engineering Testing Open Source

Key Elements of Site Reliability Engineering (SRE)

DZone

MARCH 14, 2023

Site Reliability Engineering (SRE) is a systematic and data-driven approach to improving the reliability, scalability, and efficiency of systems. It combines principles of software engineering, operations, and quality assurance to ensure that systems meet performance goals and business objectives.

Engineering

Engineering Software Engineering Scalability Efficiency

Automating Success: Building a better developer experience with platform engineering

Dynatrace

FEBRUARY 12, 2024

When it comes to platform engineering, not only does observability play a vital role in the success of organizations’ transformation journeys—it’s key to successful platform engineering initiatives. The various presenters in this session aligned platform engineering use cases with the software development lifecycle.

Engineering

Engineering Development Infrastructure Cloud

Five-nines availability: Always-on infrastructure delivers system availability during the holidays’ peak loads

Dynatrace

NOVEMBER 22, 2022

Five-nines availability has long been the goal of site reliability engineers (SREs) to provide system availability that is “always on.” Site reliability engineering teams often measure system availability in percentages in the pursuit of 100% uptime. What is always-on infrastructure?

Infrastructure

Infrastructure Availability Systems Retail

Building Resilience With Chaos Engineering and Litmus

DZone

JUNE 15, 2023

These incidents underscore the wide-ranging causes of outages in microservices architectures, encompassing configuration errors, database issues, infrastructure scaling failures, and code problems.

Engineering

Engineering Architecture Scalability Google

Enhancing Kubernetes cluster management key to platform engineering success

Dynatrace

MARCH 29, 2024

Five of the most common include cluster instability, resource and cost management, security, observability, and stress on engineering teams. Engineering teams are overwhelmed with stuff to do.” The post Enhancing Kubernetes cluster management key to platform engineering success appeared first on Dynatrace news.

Engineering

Engineering DevOps Operating System Cloud

Path to NoOps part 2: How infrastructure as code makes cloud automation attainable—and repeatable—at scale

Dynatrace

NOVEMBER 29, 2022

Infrastructure as code is a way to automate infrastructure provisioning and management. In this blog, I explore how Dynatrace has made cloud automation attainable—and repeatable—at scale by embracing the principles of infrastructure as code. Infrastructure-as-code. But how does it work in practice?

Infrastructure

Infrastructure Code Cloud DevOps

Why applying chaos engineering to data-intensive applications matters

Dynatrace

MAY 23, 2024

Stream processing enables software engineers to model their applications’ business logic as high-level representations in a directed acyclic graph without explicitly defining a physical execution plan. Failures can occur unpredictably across various levels, from physical infrastructure to software layers.

Engineering

Engineering Tuning Latency Open Source

The keys to selecting a platform for end-to-end observability

Dynatrace

DECEMBER 2, 2024

On average, organizations use 10 different tools to monitor applications, infrastructure, and user experiences across these environments. Such fragmented approaches fall short of giving teams the insights they need to run IT and site reliability engineering operations effectively.

Artificial Intelligence

Artificial Intelligence DevOps Architecture Cloud

Dynatrace joins the Microsoft Intelligent Security Association

Dynatrace

NOVEMBER 20, 2024

This latest integration with Microsoft Sentinel expands our partnership, providing joint customers with a holistic view of their entire cloud environment; from application to infrastructure, data, and security. “As The Davis AI engine automatically and continuously delivers actionable insights based on an environment’s current state.

Best Practices

Best Practices Innovation Azure Cloud

What Is Platform Engineering? How To Get Started

DZone

APRIL 21, 2023

Platform engineering is the discipline of building and maintaining a self-service platform for developers. The goal of platform engineering is to improve developer experience (DX) by standardizing and automating most of the tasks in the software delivery lifecycle (SDLC).

Engineering

Engineering Infrastructure Efficiency Cloud

How Netflix Content Engineering makes a federated graph searchable

The Netflix TechBlog

APRIL 12, 2022

By Alex Hutter , Falguni Jhaveri and Senthil Sayeebaba Over the past few years Content Engineering at Netflix has been transitioning many of its services to use a federated GraphQL platform. it began to power a significant portion of the user experience for many applications within Content Engineering.

Engineering

Engineering Architecture Java Infrastructure

Dynatrace extends Synthetic Monitoring capabilities with Network Availability Monitors to validate the availability of infrastructure and services

Dynatrace

JULY 15, 2024

Whether necessary as part of deep root-cause analyses of issues faced by your users that impact your business or if you’re an engineer responsible for the infrastructure hosting your applications and network paths. You want to be able to answer questions like these: What is responsible for application slowdown?

Availability

Availability Network Monitoring Infrastructure

Site Reliability Engineering

DZone

JANUARY 19, 2024

In the dynamic world of online services, the concept of site reliability engineering (SRE) has risen as a pivotal discipline, ensuring that large-scale systems maintain their performance and reliability.

Engineering

Engineering Tuning Software Engineering Internet

Power Dashboarding, Part I: Start your exploration journey with Dashboards

Dynatrace

FEBRUARY 6, 2025

With Dashboards , you can monitor business performance, user interactions, security vulnerabilities, IT infrastructure health, and so much more, all in real time. Even if infrastructure metrics aren’t your thing, you’re welcome to join us on this creative journey simply swap out the suggested metrics for ones that interest you.

Metrics

Metrics Infrastructure Monitoring Best Practices

SRE Best Practices for Java Applications

DZone

MARCH 12, 2025

Site reliability engineering (SRE) plays a vital role in ensuring Java applications' high availability, performance, and scalability. This discipline merges software engineering and operations, aiming to create a robust infrastructure that supports seamless user experiences.

Best Practices

Best Practices Java Software Engineering Scalability

Ingest and enrich AWS Security Hub findings with Dynatrace

Dynatrace

FEBRUARY 20, 2025

AWS Security Hub findings AWS Security Hub provides a great way of aggregating security findings, especially those related to cloud infrastructure. It can also be challenging to construct a full view of one’s security exposures when analyzing security findings across various environments and cloud infrastructures.

AWS

AWS Efficiency Infrastructure Cloud

Viking Enterprise Solutions: Empowering Modern Data Infrastructure

DZone

JULY 10, 2024

In today's rapidly evolving technological landscape, developers, engineers, and architects face unprecedented challenges in managing, processing, and deriving value from vast amounts of data.

Infrastructure

Infrastructure Hardware Innovation Efficiency

Observability engineering: Getting Prometheus metrics right for Kubernetes with Dynatrace and Kepler

Dynatrace

DECEMBER 18, 2023

For busy site reliability engineers, ensuring system reliability, scalability, and overall health is an imperative that’s getting harder to achieve in ever-expanding, cloud-native, container-based environments. Because of its adaptability, Prometheus has become an essential tool for observability engineering. Jolly good!

Metrics

Metrics Engineering Energy Tuning

Reliability indicators that matter to your business: SLOs for all data types

Dynatrace

OCTOBER 31, 2024

In the coming weeks and months, we will add to the current collection of templates for synthetic monitoring, digital experience management measures, Kubernetes resource optimization, and infrastructure monitoring. However, all of these can be created today using DQL queries.

Metrics

Metrics Availability Monitoring Scalability

Dynatrace supports newly launched GKE Arm clusters

Dynatrace

JULY 13, 2022

Today, Google announced virtual machines (VMs) based on the Arm architecture on Compute Engine called Tau T2A , which are optimized for cost-effective performance for scale-out workloads, as well as GKE Arm. Dynatrace’s AI engine, Davis® , uses this map to automatically identify and prioritize anomalies, and enable automatic remediation.

Google

Google Infrastructure Architecture Engineering

Dynatrace observability now available for Red Hat OpenShift on IBM Z and LinuxONE mainframes

Dynatrace

JULY 24, 2024

Dynatrace full stack observability for Red Hat OpenShift Dynatrace enhances software quality and operational efficiency, which drives innovation by unifying application, operation, and platform engineering teams on a single platform. Learn more about the new Kubernetes Experience for Platform Engineering.

Availability

Availability Infrastructure Metrics Monitoring

Introducing Configurable Metaflow

The Netflix TechBlog

DECEMBER 19, 2024

Subsequent versions of the model will result from experimenting with hyper parameters, tweaking feature engineering, or conducting feature diets. training Below is a simple Metaflow pipeline that fetches data, executes feature engineering, and trains a LinearRegression model. cluster=sandbox, workflow.id=demo.branch_demox.EXP_01.training

Best Practices

Best Practices Cache Metrics Code

Automate digital excellence with Dynatrace Synthetic Monitoring and Workflows

Dynatrace

JULY 18, 2024

Navigate digital infrastructure complexity In today’s rapidly evolving digital environment, organizations face increasing pressure from customers and competitors to deliver faster, more secure innovations. Use case: Digital infrastructure change The problem is not always in the application.

Monitoring

Monitoring DevOps Infrastructure Games

Overcoming Challenges and Best Practices for Data Migration From On-Premise to Cloud

DZone

MARCH 29, 2023

With the rapid adoption of cloud computing , businesses are moving their IT infrastructure to the cloud. The article will also explore the role of data engineering in ensuring successful data transfer and integration and different approaches to data migration.

Best Practices

Best Practices Cloud Data Engineering Storage

Achieving High Availability in CI/CD With Observability

DZone

MARCH 5, 2024

Forbes estimates that cloud budgets will break all previous records as businesses will spend over $1 trillion on cloud computing infrastructure in 2024. Complementing these practices is site reliability engineering (SRE), a discipline ensuring system reliability, performance, and scalability.

Availability

Availability DevOps Infrastructure Scalability

API Gateway vs. Istio Service Mesh

DZone

JUNE 22, 2023

Architects, DevOps, and cloud engineers are gradually trying to understand which is better to continue the journey with: the API gateway, or adopt an entirely new service mesh technology?

DevOps

DevOps Architecture Innovation Infrastructure

Dynatrace expands root cause analysis to Kubernetes with Davis AI

Dynatrace

SEPTEMBER 26, 2022

With the release of Dynatrace version 1.249, the Davis® AI Causation Engine provides broader support to subsequent Kubernetes issues and their impact on business continuity like: Automated Kubernetes root cause analysis. Such context is easy to understand using the Dynatrace Davis AI engine. Incidents are harder to solve.

Storage

Storage Engineering Traffic Infrastructure

How a data lakehouse brings data insights to life

Dynatrace

OCTOBER 4, 2022

For IT infrastructure managers and site reliability engineers, or SREs , logs provide a treasure trove of data. These traditional approaches to log monitoring and log analytics thwart IT teams’ goal to address infrastructure performance problems, security threats, and user experience issues.

Analytics

Analytics Storage Infrastructure Metrics

How Netflix uses eBPF flow logs at scale for network insight

The Netflix TechBlog

JUNE 7, 2021

Challenges The cloud network infrastructure that Netflix utilizes today consists of AWS services such as VPC, DirectConnect, VPC Peering, Transit Gateways, NAT Gateways, etc and Netflix owned devices. These metrics are visualized using Lumen , a self-service dashboarding infrastructure. What is BPF?

Network

Network Transportation AWS Cloud

Stress Testing for Resilience in Modern Infrastructure

Sustainability: Thoughts from a software engineer

Trending Sources

What Is Platform Engineering?

What is platform engineering?

Next generation Dynatrace Davis AI becomes the default causation engine

Unlock the Power of DevSecOps with Newly Released Kubernetes Experience for Platform Engineering

A Kubernetes platform engineering strategy tames Kubernetes complexity

Bringing Software Engineering Rigor to Data

How platform engineering and IDP observability can accelerate developer velocity

How observability, application security, and AI enhance DevOps and platform engineering maturity

Build systems more reliably with Dynatrace: Chaos Engineering

Hybrid cloud infrastructure explained: Weighing the pros, cons, and complexities

Platform engineering: Empowering key Kubernetes use cases with Dynatrace

The platform engineer role: A game-changer or just hype?

DevOps engineer tools: Deploy, test, evaluate, repeat

Key Elements of Site Reliability Engineering (SRE)

Automating Success: Building a better developer experience with platform engineering

Five-nines availability: Always-on infrastructure delivers system availability during the holidays’ peak loads

Building Resilience With Chaos Engineering and Litmus

Enhancing Kubernetes cluster management key to platform engineering success

Path to NoOps part 2: How infrastructure as code makes cloud automation attainable—and repeatable—at scale

Why applying chaos engineering to data-intensive applications matters

The keys to selecting a platform for end-to-end observability

Dynatrace joins the Microsoft Intelligent Security Association

What Is Platform Engineering? How To Get Started

How Netflix Content Engineering makes a federated graph searchable

Dynatrace extends Synthetic Monitoring capabilities with Network Availability Monitors to validate the availability of infrastructure and services

Site Reliability Engineering

Power Dashboarding, Part I: Start your exploration journey with Dashboards

SRE Best Practices for Java Applications

Ingest and enrich AWS Security Hub findings with Dynatrace

Viking Enterprise Solutions: Empowering Modern Data Infrastructure

Observability engineering: Getting Prometheus metrics right for Kubernetes with Dynatrace and Kepler

Reliability indicators that matter to your business: SLOs for all data types

Dynatrace supports newly launched GKE Arm clusters

Dynatrace observability now available for Red Hat OpenShift on IBM Z and LinuxONE mainframes

Introducing Configurable Metaflow

Automate digital excellence with Dynatrace Synthetic Monitoring and Workflows

Overcoming Challenges and Best Practices for Data Migration From On-Premise to Cloud

Achieving High Availability in CI/CD With Observability

API Gateway vs. Istio Service Mesh

Dynatrace expands root cause analysis to Kubernetes with Davis AI

How a data lakehouse brings data insights to life

How Netflix uses eBPF flow logs at scale for network insight

Stay Connected