Architecture, Engineering and Software Engineering

Low-Maintenance Backend Architectures for Scalable Applications

DZone

JANUARY 10, 2025

After years of working in the intricate world of software engineering, I learned that the most beautiful solutions are often those unseen: backends that hum along, scaling with grace and requiring very little attention. Developers could understand and manage the entire systems intricacies.

Architecture

Architecture Scalability Software Engineering Cloud

Why applying chaos engineering to data-intensive applications matters

Dynatrace

MAY 23, 2024

Stream processing One approach to such a challenging scenario is stream processing, a computing paradigm and software architectural style for data-intensive software systems that emerged to cope with requirements for near real-time processing of massive amounts of data. Recovery time of the latency p90.

Engineering

Engineering Tuning Latency Open Source

Architecture Patterns: The Circuit-Breaker

DZone

NOVEMBER 3, 2023

Much like how an electrical circuit breaker prevents an overload by stopping the flow of electricity when excessive current is detected, the Circuit Breaker pattern in software engineering stops the flow of requests to a service when the number of failures exceeds a predefined threshold.

Architecture

Architecture Software Engineering Traffic Engineering

Site reliability engineering: 5 things you need to know

Dynatrace

FEBRUARY 4, 2021

What is site reliability engineering? Site reliability engineering (SRE) is the practice of applying software engineering principles to operations and infrastructure processes to help organizations create highly reliable and scalable software systems. Dynatrace news. SRE focuses on automation.

Engineering

Engineering DevOps Government Latency

Title Launch Observability at Netflix Scale

The Netflix TechBlog

MARCH 4, 2025

Part 3: System Strategies and Architecture By: VarunKhaitan With special thanks to my stunning colleagues: Mallika Rao , Esmir Mesic , HugoMarques This blog post is a continuation of Part 2 , where we cleared the ambiguity around title launch observability at Netflix. The request schema for the observability endpoint.

Traffic

Traffic Strategy Entertainment Innovation

How platform engineering and IDP observability can accelerate developer velocity

Dynatrace

MARCH 6, 2024

As organizations look to expand DevOps maturity, improve operational efficiency, and increase developer velocity, they are embracing platform engineering as a key driver. Platform engineering: Build for self-service Self-service deployment is a key attribute of platform engineering. “It makes them more productive.

Engineering

Engineering Development DevOps Infrastructure

Site reliability engineering: 5 things to you need to know

Dynatrace

FEBRUARY 4, 2021

Site reliability engineering (SRE) is the practice of applying software engineering principles to operations and infrastructure processes to help organizations create highly reliable and scalable software systems. ” According to Google, “SRE is what you get when you treat operations as a software problem.”

Engineering

Engineering DevOps Government Latency

A Recap of the Data Engineering Open Forum at Netflix

The Netflix TechBlog

JUNE 20, 2024

A summary of sessions at the first Data Engineering Open Forum at Netflix on April 18th, 2024 The Data Engineering Open Forum at Netflix on April 18th, 2024. At Netflix, we aspire to entertain the world, and our data engineering teams play a crucial role in this mission by enabling data-driven decision-making at scale.

Data Engineering

Data Engineering Engineering Entertainment Software Engineering

Kubernetes Observability: Lessons Learned From Running Kubernetes in Production

DZone

OCTOBER 1, 2024

In recent years, observability has re-emerged as a critical aspect of DevOps and software engineering in general, driven by the growing complexity and scale of modern, cloud-native applications.

Software Engineering

Software Engineering DevOps Cloud Architecture

5 powerful use cases beyond debugging for Dynatrace Live Debugger

Dynatrace

MARCH 25, 2025

Following are some of the coolest things weve seen engineers do with Live Debugger. Performance benchmarking Performance benchmarking is one of the unresolved mysteries of software engineering. White box testing The nicest thing about deploying UI changes to production is that you can immediately see the changes in action.

Benchmarking

Benchmarking Code Open Source Engineering

The state of site reliability engineering: SRE challenges and best practices in 2023

Dynatrace

NOVEMBER 14, 2023

Site reliability engineering (SRE) has become increasingly important to organizations looking to keep up with the rapid pace of digital transformation. Effective site reliability engineering requires enterprise-wide transformation Without a unified understanding of SRE practices, organizational silos can quickly form between departments.

Best Practices

Best Practices Engineering DevOps Software Engineering

Automating Success: Building a better developer experience with platform engineering

Dynatrace

FEBRUARY 12, 2024

When it comes to platform engineering, not only does observability play a vital role in the success of organizations’ transformation journeys—it’s key to successful platform engineering initiatives. The various presenters in this session aligned platform engineering use cases with the software development lifecycle.

Engineering

Engineering Development Infrastructure Cloud

Up your quality and agility factor – using automation to build “performance-as-a-self-service”

Dynatrace

MARCH 3, 2020

For software engineering teams, this demand means not only delivering new features faster but ensuring quality, performance, and scalability too. One way to apply improvements is transforming the way application performance engineering and testing is done. Here is the definition of this model: ?. Try it today using Keptn .

Performance

Performance Education Innovation Software Architecture

Nurturing Design in Your Software Engineering Culture

Strategic Tech

MARCH 16, 2021

There are a few qualities that differentiate average from high performing software engineering organisations. I believe that attitude towards the design of code and architecture is one of them. In Accelerate , Nicole Forsgren shows a link between well-designed, loosely-coupled architecture and more frequent software delivery.

Software Engineering

Software Engineering Design Engineering Software

Architected for resiliency: How Dynatrace withstands data center outages

Dynatrace

JUNE 15, 2021

The fact is, Reliability and Resiliency must be rooted in the architecture of a distributed system. The email walked through how our Dynatrace self-monitoring notified users of the outage but automatically remediated the problem thanks to our platform’s architecture. Let me start with the end-user impact.

AWS

AWS Traffic Architecture Azure

Growth Engineering at Netflix?—?Automated Imagery Generation

The Netflix TechBlog

FEBRUARY 9, 2021

Growth Engineering at Netflix?—?Automated In the Growth Engineering team, we refer to this as the top of the signup funnel. For more background on the signup funnel and Growth Engineering’s role in the signup funnel, please read our initial post on the topic: Growth Engineering at Netflix? Growth Engineering at Netflix?—?Automated

Engineering

Engineering Storage Latency Entertainment

Site reliability engineering: Six SRE trends to unleash DevOps innovation

Dynatrace

JUNE 2, 2022

Site reliability engineering (SRE) continues to gain popularity as organizations embrace hybrid cloud strategies and IT automation at scale. By applying software engineering principles to operations and infrastructure practices, SRE enables organizations to streamline and automate IT processes. Dynatrace news.

DevOps

DevOps Innovation Engineering Benchmarking

Growth Engineering at Netflix- Creating a Scalable Offers Platform

The Netflix TechBlog

FEBRUARY 9, 2021

The Growth Engineering team is responsible for executing growth initiatives that help us anticipate and adapt to this change. For more background on Growth Engineering and the signup funnel, please have a look at our previous blog post that covers the basics. We need to be constantly adapting and innovating as a result of this change.

Engineering

Engineering Scalability Architecture Innovation

Conducting log analysis with an observability platform and full data context

Dynatrace

APRIL 20, 2023

Causal AI—which brings AI-enabled actionable insights to IT operations—and a data lakehouse, such as Dynatrace Grail , can help break down silos among ITOps, DevSecOps, site reliability engineering, and business analytics teams. Logs are automatically produced and time-stamped documentation of events relevant to cloud architectures.

Analytics

Analytics Infrastructure Storage Architecture

What is DevOps orchestration? And why invest in orchestration tools?

Dynatrace

DECEMBER 5, 2022

Monitoring and logging tools that once worked well with earlier IT architectures no longer provide sufficient context and integration to understand the state of complex systems or diagnose and correct security issues. Manually managing and securing multi-cloud environments is no longer practical. Automation versus orchestration.

DevOps

DevOps Virtualization Best Practices Innovation

Data Engineers of Netflix?—?Interview with Samuel Setegne

The Netflix TechBlog

JUNE 1, 2021

Data Engineers of Netflix?—?Interview Interview with Samuel Setegne Samuel Setegne This post is part of our “Data Engineers of Netflix” interview series, where our very own data engineers talk about their journeys to Data Engineering @ Netflix. What drew you to Netflix?

Data Engineering

Data Engineering Engineering Big Data Healthcare

Designing Instagram

High Scalability

JANUARY 11, 2022

Machine Learning Engineer at Amazon and has led several machine-learning initiatives across the Amazon ecosystem. Architecture. FUN FACT : In this talk , Rodrigo Schmidt, director of engineering at Instagram talks about the different challenges they have faced in scaling the data infrastructure at Instagram. High Level Design.

Design

Design Media Storage Logistics

How and Why the Developer-First Approach Is Changing the Observability Landscape

DZone

DECEMBER 11, 2024

In our quest for greater scalability, resilience, and flexibility within the digital infrastructure of our organization, there has been a strategic pivot away from traditional monolithic application architectures towards embracing modern software engineering practices such as microservices architecture coupled with cloud-native applications.

Development

Development Software Engineering Architecture Scalability

Using a GPU Boosts TiDB Analytics Performance from 10 to 150

DZone

JULY 14, 2021

Fei Xu (Software Engineer at PingCAP). Authors: Ruoxi Sun (Tech Lead of Analytical Computing Team at PingCAP). TiDB is a Hybrid Transaction/Analytical Processing (HTAP) database that can efficiently process analytical queries.

Analytics

Analytics Software Engineering Performance Database

Building and Scaling Data Lineage at Netflix to Improve Data Infrastructure Reliability, and…

The Netflix TechBlog

MARCH 25, 2019

Now, imagine yourself in the role of a software engineer responsible for a micro-service which publishes data consumed by few critical customer facing services (e.g. You are about to make structural changes to the data and want to know who and what downstream to your service will be impacted.

Infrastructure

Infrastructure Big Data Transportation Architecture

The Show Must Go On: Securing Netflix Studios At Scale

The Netflix TechBlog

SEPTEMBER 13, 2021

Supporting developers through those checklists for edge cases, and then validating that each team’s choices resulted in an architecture with all the desired security properties, was similarly not scalable for our security engineers. Netflix engineers talk a lot about the concept of a “ Paved Road ”.

Internet

Internet Internet Cloud Traffic

SKP's Java/Java EE Gotchas: Clash of the Titans, C++ vs. Java!

DZone

FEBRUARY 27, 2021

As a Software Engineer, the mind is trained to seek optimizations in every aspect of development and ooze out every bit of available CPU Resource to deliver a performing application. This begins not only in designing the algorithm or coming out with efficient and robust architecture but right onto the choice of programming language.

Java

Java C++ Benchmarking Programming

Connect your software with the right people: Ownership drives effective collaboration

Dynatrace

MARCH 28, 2023

Incident management with clearly defined responsibilities Site Reliability Engineers (SRE) are challenged not only to detect problems and identify the root cause quickly but also to remediate problems immediately. Any software engineer can search for monitored entities that relate to specific deployments and their respective teams.

Software

Software Software Monitoring Software Engineering

Site reliability done right: 5 SRE best practices that deliver on business objectives

Dynatrace

MAY 31, 2023

Site reliability engineering (SRE) has recently become a critical discipline in recent years as the world has shifted in favor of web-based interactions. This shift is leading more organizations to hire site reliability engineers to guarantee the reliability and resiliency of their services. Mobile retail e-commerce spending in the U.

Best Practices

Best Practices DevOps Latency Metrics

What is application security? And why it needs a new approach

Dynatrace

MARCH 17, 2021

Application security is a software engineering term that refers to several different types of security practices designed to ensure applications do not contain vulnerabilities that could allow illicit access to sensitive data, unauthorized code modification, or resource hijacking. Dynatrace news.

Open Source

Open Source Cloud Games Java

Scale DevOps and SRE with open source Keptn

Dynatrace

APRIL 18, 2022

When it comes to site reliability engineering (SRE) initiatives adopting DevOps practices, developers and operations teams frequently find themselves at odds with one another. Dynatrace news. Developers want to write high-quality code and deploy it quickly. Operations teams want to make sure the system doesn’t break.

Open Source

Open Source DevOps Cloud Metrics

Dynatrace extends distributed tracing for serverless on AWS Lambda (GA)?

Dynatrace

JANUARY 19, 2021

Serverless architectures help developers innovate more efficiently and effectively by removing the burden of managing underlying infrastructure. Dynatrace is happy to announce its enhanced AWS Lambda extension, expanding its support for Amazon Web Services (AWS) Lambda and serverless architectures. The Dynatrace AI engine, Davis,?automatically

Lambda

Lambda Serverless AWS Mobile

Automated observability, security, and reliability at scale

Dynatrace

JULY 18, 2023

As software development grows more complex, managing components using an automated onboarding process becomes increasingly important. This is especially crucial in microservice architectures, where the number of components can be overwhelming.

Best Practices

Best Practices Code Infrastructure Latency

Re-Architecting Cash and Digital Wallet Payments for India with Uber Engineering

Uber Engineering

JUNE 19, 2017

In this article, San Francisco-based software engineer Yijun Liu reflects on his experiences working with … The post Re-Architecting Cash and Digital Wallet Payments for India with Uber Engineering appeared first on Uber Engineering Blog.

Engineering

Engineering Software Engineering Software Software

Engineering well-rounded technology leaders

O'Reilly Software

JANUARY 12, 2018

2018 marks the fourth year of O’Reilly’s Software Architecture Conference , a software engineering event focused on providing hands-on training experiences for technologists at all levels of an organization—from experienced developers up through CTOs. Building evolutionary software architecture.

Technology

Technology Technology Engineering Software Architecture

Dynatrace Perform 2024 Guide: Deriving business value from AI data analysis

Dynatrace

JANUARY 23, 2024

Composite’ AI, platform engineering, AI data analysis through custom apps This focus on data reliability and data quality also highlights the need for organizations to bring a “ composite AI ” approach to IT operations, security, and DevOps. To learn more about platform engineering, explore the following resources.

Performance

Performance DevOps Innovation Energy

Orchestrating Data/ML Workflows at Scale With Netflix Maestro

The Netflix TechBlog

OCTOBER 18, 2022

Meson was based on a single leader architecture with high availability. Data scientists, engineers, non-engineers, and even content producers all run their data pipelines to get the necessary insights. Figure 1 shows the high-level architecture. With the high growth of workflows in the past few years?

Java

Java Scalability Traffic Architecture

Edge Authentication and Token-Agnostic Identity Propagation

The Netflix TechBlog

FEBRUARY 9, 2021

Plus, the architecture of the Edge tier was evolving to a PaaS (platform as a service) model, and we had some tough decisions to make about how, and where, to handle identity token handling. The system architecture now takes the form of: Notice that tokens never traverse past the Edge gateway / EAS boundary. We are serving over 2.5

Architecture

Architecture Latency Servers Website

MLOps and DevOps: Why Data Makes It Different

O'Reilly

OCTOBER 19, 2021

This is both frustrating for companies that would prefer making ML an ordinary, fuss-free value-generating function like software engineering, as well as exciting for vendors who see the opportunity to create buzz around a new category of enterprise software. Software Architecture. This approach is not novel.

DevOps

DevOps Software Engineering Infrastructure Open Source

Forming an Architecture Modernization Enabling Team (AMET)

Strategic Tech

JANUARY 22, 2024

Architecture modernization initiatives are strategic efforts involving many teams, usually for many months or years. An AMET is an architecture Enabling Team that helps to coordinate and upskill all teams and stakeholders involved in a modernization initiative. They need a more loosely coupled architecture and empowered teams.

Architecture

Architecture Metrics Logistics Strategy

Open-Sourcing Metaflow, a Human-Centric Framework for Data Science

The Netflix TechBlog

DECEMBER 3, 2019

mainly because of mundane reasons related to software engineering. They know that feature engineering is critical for many models, so they want to stay in control of model inputs and feature engineering logic. Instead, we heard stories about projects where getting the first version to production took surprisingly long?—?mainly

Open Source

Open Source AWS Infrastructure Energy

Making our Android Studio Apps Reactive with UI Components & Redux

The Netflix TechBlog

MAY 30, 2019

Our very first mobile app is called Prodicle and was built for Android & iOS using the same reactive architecture in both platforms, which allowed us to build 2 apps from scratch in 3 months with 4 software engineers. Composable UIs contribute to fast engineering velocity and produce less side effect bugs.

Architecture

Architecture Mobile Testing Code

Re-Architecting the Video Gatekeeper

The Netflix TechBlog

JULY 12, 2019

By Drew Koszewnik This is the story about how the Content Setup Engineering team used Hollow, a Netflix OSS technology, to re-architect and simplify an essential component in our content pipeline?—?delivering A reduction in the time the Content Setup Engineering team spends on performance-related issues.

Cache

Cache Architecture Latency Engineering

Handling Flaky Unit Tests in Java

Uber Engineering

JUNE 15, 2021

It warns software engineers of bugs in newly-implemented code and regressions in existing code, before it is merged. This ensures increased software reliability. It also … The post Handling Flaky Unit Tests in Java appeared first on Uber Engineering Blog.

Java

Java Testing Software Engineering Engineering

Low-Maintenance Backend Architectures for Scalable Applications

Why applying chaos engineering to data-intensive applications matters

Trending Sources

Architecture Patterns: The Circuit-Breaker

Site reliability engineering: 5 things you need to know

Title Launch Observability at Netflix Scale

How platform engineering and IDP observability can accelerate developer velocity

Site reliability engineering: 5 things to you need to know

A Recap of the Data Engineering Open Forum at Netflix

Kubernetes Observability: Lessons Learned From Running Kubernetes in Production

5 powerful use cases beyond debugging for Dynatrace Live Debugger

The state of site reliability engineering: SRE challenges and best practices in 2023

Automating Success: Building a better developer experience with platform engineering

Up your quality and agility factor – using automation to build “performance-as-a-self-service”

Nurturing Design in Your Software Engineering Culture

Architected for resiliency: How Dynatrace withstands data center outages

Growth Engineering at Netflix?—?Automated Imagery Generation

Site reliability engineering: Six SRE trends to unleash DevOps innovation

Growth Engineering at Netflix- Creating a Scalable Offers Platform

Conducting log analysis with an observability platform and full data context

What is DevOps orchestration? And why invest in orchestration tools?

Data Engineers of Netflix?—?Interview with Samuel Setegne

Designing Instagram

How and Why the Developer-First Approach Is Changing the Observability Landscape

Using a GPU Boosts TiDB Analytics Performance from 10 to 150

Building and Scaling Data Lineage at Netflix to Improve Data Infrastructure Reliability, and…

The Show Must Go On: Securing Netflix Studios At Scale

SKP's Java/Java EE Gotchas: Clash of the Titans, C++ vs. Java!

Connect your software with the right people: Ownership drives effective collaboration

Site reliability done right: 5 SRE best practices that deliver on business objectives

What is application security? And why it needs a new approach

Scale DevOps and SRE with open source Keptn

Dynatrace extends distributed tracing for serverless on AWS Lambda (GA)?

Automated observability, security, and reliability at scale

Re-Architecting Cash and Digital Wallet Payments for India with Uber Engineering

Engineering well-rounded technology leaders

Dynatrace Perform 2024 Guide: Deriving business value from AI data analysis

Orchestrating Data/ML Workflows at Scale With Netflix Maestro

Edge Authentication and Token-Agnostic Identity Propagation

MLOps and DevOps: Why Data Makes It Different

Forming an Architecture Modernization Enabling Team (AMET)

Open-Sourcing Metaflow, a Human-Centric Framework for Data Science

Making our Android Studio Apps Reactive with UI Components & Redux

Re-Architecting the Video Gatekeeper

Handling Flaky Unit Tests in Java

Stay Connected