Design, Engineering and Infrastructure - Technology Performance Pulse

Sustainability: Thoughts from a software engineer

Dynatrace

MARCH 17, 2025

How to achieve sustainable IT practices Use observability tools The first step in driving improvements is to obtain a comprehensive view of your IT infrastructure’s climate impact. Platform engineers can set defaults for development teams, such as the number of replicas a service should have or whether it scales automatically.

Software Engineering

Software Engineering Engineering Software Software

Designing Instagram

High Scalability

JANUARY 11, 2022

Machine Learning Engineer at Amazon and has led several machine-learning initiatives across the Amazon ecosystem. Design a photo-sharing platform similar to Instagram where users can upload their photos and share it with their followers. High Level Design. Component Design. API Design. Problem Statement.

Design

Design Media Storage Logistics

What is platform engineering?

Dynatrace

NOVEMBER 3, 2023

In response to this shift, platform engineering is growing in popularity. The practice of platform engineering has evolved alongside the increasing complexity of cloud environments. A platform encompasses a set of tools, services, and infrastructure that enables developers to build, test, and deploy software applications.

Engineering

Engineering DevOps Software Engineering Scalability

Unlock the Power of DevSecOps with Newly Released Kubernetes Experience for Platform Engineering

Dynatrace

NOVEMBER 7, 2023

Platform engineering is on the rise. According to leading analyst firm Gartner, “80% of software engineering organizations will establish platform teams as internal providers of reusable services, components, and tools for application delivery…” by 2026. Automation, automation, automation.

Engineering

Engineering DevOps Best Practices Infrastructure

Building Netflix’s Distributed Tracing Infrastructure

The Netflix TechBlog

OCTOBER 19, 2020

a Netflix member via Twitter This is an example of a question our on-call engineers need to answer to help resolve a member issue?—?which Now let’s look at how we designed the tracing infrastructure that powers Edgar. Now let’s look at how we designed the tracing infrastructure that powers Edgar.

Infrastructure

Infrastructure Transportation Storage Open Source

Dynatrace joins the Microsoft Intelligent Security Association

Dynatrace

NOVEMBER 20, 2024

This latest integration with Microsoft Sentinel expands our partnership, providing joint customers with a holistic view of their entire cloud environment; from application to infrastructure, data, and security. “As The Davis AI engine automatically and continuously delivers actionable insights based on an environment’s current state.

Best Practices

Best Practices Innovation Azure Cloud

DevOps engineer tools: Deploy, test, evaluate, repeat

Dynatrace

DECEMBER 8, 2022

As cloud-native, distributed architectures proliferate, the need for DevOps technologies and DevOps platform engineers has increased as well. DevOps engineer tools can help ease the pressure as environment complexity grows. ” What does a DevOps platform engineer do? .” What are DevOps engineer tools and platforms.

DevOps

DevOps Engineering Testing Open Source

Building and Scaling Data Lineage at Netflix to Improve Data Infrastructure Reliability, and…

The Netflix TechBlog

MARCH 25, 2019

Building and Scaling Data Lineage at Netflix to Improve Data Infrastructure Reliability, and Efficiency By: Di Lin , Girish Lingappa , Jitender Aswani Imagine yourself in the role of a data-inspired decision maker staring at a metric on a dashboard about to make a critical business decision but pausing to ask a question?—?“Can

Infrastructure

Infrastructure Big Data Transportation Architecture

Dynatrace achieves Amazon RDS Service Ready designation

Dynatrace

MAY 5, 2020

We’re therefore excited to announce that Dynatrace has received the Amazon RDS Service Ready designation. Achieving this designation differentiates Dynatrace as an AWS Advanced Technology Partner with a product that is integrated with Amazon RDS and is generally available and fully supported.

Design

Design AWS Education Innovation

Why applying chaos engineering to data-intensive applications matters

Dynatrace

MAY 23, 2024

Stream processing enables software engineers to model their applications’ business logic as high-level representations in a directed acyclic graph without explicitly defining a physical execution plan. Failures can occur unpredictably across various levels, from physical infrastructure to software layers.

Engineering

Engineering Tuning Latency Open Source

Power Dashboarding, Part I: Start your exploration journey with Dashboards

Dynatrace

FEBRUARY 6, 2025

With Dashboards , you can monitor business performance, user interactions, security vulnerabilities, IT infrastructure health, and so much more, all in real time. Even if infrastructure metrics aren’t your thing, you’re welcome to join us on this creative journey simply swap out the suggested metrics for ones that interest you.

Metrics

Metrics Infrastructure Monitoring Best Practices

Demystifying Interviewing for Backend Engineers @ Netflix

The Netflix TechBlog

FEBRUARY 1, 2022

By Karen Casella, Director of Engineering, Access & Identity Management Have you ever experienced one of the following scenarios while looking for your next role? Most backend engineering teams follow a process very similar to what is shown below. If so, we invite you to begin the interview process.

Engineering

Engineering Games Entertainment Innovation

How Netflix Content Engineering makes a federated graph searchable

The Netflix TechBlog

APRIL 12, 2022

By Alex Hutter , Falguni Jhaveri and Senthil Sayeebaba Over the past few years Content Engineering at Netflix has been transitioning many of its services to use a federated GraphQL platform. The Studio Search platform was designed to take a portion of the federated graph, a subgraph rooted at an entity of interest, and make it searchable.

Engineering

Engineering Architecture Java Infrastructure

Data Engineers of Netflix?—?Interview with Pallavi Phadnis

The Netflix TechBlog

OCTOBER 28, 2021

Data Engineers of Netflix?—?Interview Interview with Pallavi Phadnis This post is part of our “ Data Engineers of Netflix ” series, where our very own data engineers talk about their journeys to Data Engineering @ Netflix. Pallavi Phadnis is a Senior Software Engineer at Netflix.

Data Engineering

Data Engineering Engineering Big Data Software Engineering

Introducing Configurable Metaflow

The Netflix TechBlog

DECEMBER 19, 2024

This has been a guiding design principle with Metaflow since its inception. Subsequent versions of the model will result from experimenting with hyper parameters, tweaking feature engineering, or conducting feature diets. demo.branch_demox.demo_features_f workflows/demo.main.sch.yaml (binding=default): cluster=sandbox, workflow.id=demo.branch_demox.main

Best Practices

Best Practices Cache Metrics Code

How to Prepare for Your DevOps Interview

DZone

SEPTEMBER 5, 2019

Over the past decade, DevOps has emerged as a new tech culture and career that marries the rapid iteration desired by software development with the rock-solid stability of the infrastructure operations team. As of August 2019, there are currently over 50,000 LinkedIn DevOps job listings in the United States alone.

DevOps

DevOps Software Engineering Infrastructure Engineering

OpenPipeline: Simplify access to critical business data

Dynatrace

NOVEMBER 4, 2024

Business events: Delivering the best data It’s been two years since we introduced business events , a special class of events designed to support even the most demanding business use cases. Our Business Analytics solution is a prominent beneficiary of this commitment. Business process monitoring and optimization.

Analytics

Analytics Airlines Metrics Monitoring

Dynatrace observability now available for Red Hat OpenShift on IBM Z and LinuxONE mainframes

Dynatrace

JULY 24, 2024

Dynatrace full stack observability for Red Hat OpenShift Dynatrace enhances software quality and operational efficiency, which drives innovation by unifying application, operation, and platform engineering teams on a single platform. Dynatrace is designed to scale easily across the entire Kubernetes stack.

Availability

Availability Infrastructure Metrics Monitoring

Title Launch Observability at Netflix Scale

The Netflix TechBlog

DECEMBER 17, 2024

The Challenge of Title Launch Observability As engineers, were wired to track system metrics like error rates, latencies, and CPU utilizationbut what about metrics that matter to a titlessuccess? How can we design systems that recognize these nuances and empower every title to shine and bring joy to ourmembers?

Traffic

Traffic Scalability Strategy Monitoring

Site reliability engineering: Six SRE trends to unleash DevOps innovation

Dynatrace

JUNE 2, 2022

Site reliability engineering (SRE) continues to gain popularity as organizations embrace hybrid cloud strategies and IT automation at scale. By applying software engineering principles to operations and infrastructure practices, SRE enables organizations to streamline and automate IT processes. Dynatrace news.

DevOps

DevOps Innovation Engineering Benchmarking

Engineering dependability and fault tolerance in a distributed system

High Scalability

FEBRUARY 19, 2021

In this article, we discuss the concepts of dependability and fault tolerance in detail and explain how the Ably platform is designed with fault tolerant approaches to uphold its dependability guarantees. Fault tolerant design approaches address these shortfalls to provide continuity both to business and to the user experience.

Engineering

Engineering Systems Availability Scalability

2019 Open Source Database Report: Top Databases, Public Cloud vs. On-Premise, Polyglot Persistence

Scalegrid

JUNE 11, 2019

Wondering whether an on-premise vs. public cloud vs. hybrid cloud infrastructure is best for your database strategy? Cloud Infrastructure Analysis : Public Cloud vs. On-Premise vs. Hybrid Cloud. This comes as no surprise, as MySQL has held this position consistently for many years according to DB-Engines. Commercial Databases.

Open Source

Open Source Database Cloud Infrastructure

Mastering Kubernetes with Dynatrace

Dynatrace

AUGUST 24, 2020

But there are other related components and processes (for example, cloud provider infrastructure) that can cause problems in applications running on Kubernetes. Dynatrace AWS monitoring gives you an overview of the resources that are used in your AWS infrastructure along with their historical usage. Monitoring your i nfrastructure.

Analytics

Analytics Infrastructure AWS Operating System

Simplified observability for your SNMP devices

Dynatrace

MARCH 22, 2021

While today’s IT world continues the shift toward treating everything as a service, many organizations need to keep their environments under strict control while managing their infrastructure themselves on-premises. Some SNMP-enabled devices are designed to report events on their own with so-called SNMP traps. SNMP observability.

Metrics

Metrics Network Infrastructure Traffic

Dynatrace supports Amazon Linux 2023 as an AWS launch partner

Dynatrace

MARCH 14, 2023

Amazon’s new general-purpose Linux for AWS is designed to provide a secure, stable, and high-performance execution environment to develop and run cloud applications. Saving your cloud operations and SRE teams hours of guesswork and manual tagging, the Davis AI engine analyzes billions of events in real time. How does Dynatrace help?

AWS

AWS Lambda Serverless Virtualization

Auth0 Architecture: Running In Multiple Cloud Providers And Regions

High Scalability

AUGUST 27, 2018

This is article was written by Dirceu Pereira Tiegs, Site Reliability Engineer at Auth0, and originally was originally published in Auth0. We designed Auth0 from the beginning so that it could run anywhere: on our cloud, on your cloud, or even on your own private infrastructure. A lot has changed since then in Auth0.

Architecture

Architecture Cloud Traffic Infrastructure

AWS serverless services: Exploring your options

Dynatrace

OCTOBER 7, 2021

Instead of worrying about infrastructure management functions, such as capacity provisioning and hardware maintenance, teams can focus on application design, deployment, and delivery. Serverless architecture offers several benefits for enterprises. Simplicity. The first benefit is simplicity. Let’s explore each in more detail.

Serverless

Serverless AWS Lambda Storage

Python at Netflix

The Netflix TechBlog

APRIL 29, 2019

An easy, though imprecise, way of thinking about Netflix infrastructure is that everything that happens before you press Play on your remote control (e.g., Various software systems are needed to design, build, and operate this CDN infrastructure, and a significant number of them are written in Python. are you logged in?

Open Source

Open Source Network Infrastructure Big Data

ShiftLeft on Refactoring a Live SaaS Environment

High Scalability

NOVEMBER 2, 2020

This is guest a post by Preetam Jinka , Senior Infrastructure Engineer at ShiftLeft. NG SAST was initially designed only for vulnerabilities. The analogy is that it’s like changing the engine on an airplane in flight without the passengers noticing. Originally published here.

Storage

Storage Design Infrastructure Engineering

Growth Engineering at Netflix- Creating a Scalable Offers Platform

The Netflix TechBlog

FEBRUARY 9, 2021

The Growth Engineering team is responsible for executing growth initiatives that help us anticipate and adapt to this change. In particular, it’s our job to design and build the systems and protocols that enable customers from all over the world to sign up for Netflix with the plan features and incentives that best suit their needs.

Engineering

Engineering Scalability Architecture Innovation

New SNMP platform extensions provide observability at scale for network devices

Dynatrace

NOVEMBER 24, 2021

The success of an organization often depends on the quality of the on-premises or physical IT infrastructure, among other things. Constantly monitoring infrastructure health state and making ongoing optimizations are essential for Ops teams, SREs (site-reliability engineers), and IT admins. Monitor Cisco and any other devices.

Network

Network Infrastructure Virtualization Metrics

How Netflix Scales its API with GraphQL Federation (Part 2)

The Netflix TechBlog

DECEMBER 11, 2020

Our Journey so Far Over the past year, we’ve implemented the core infrastructure pieces necessary for a federated GraphQL architecture as described in our previous post: Studio Edge Architecture The first Domain Graph Service (DGS) on the platform was the former GraphQL monolith that we discussed in our first post (Studio API).

Architecture

Architecture Best Practices Engineering Open Source

Building a Rule-Based Platform to Manage Netflix Membership SKUs at Scale

The Netflix TechBlog

FEBRUARY 16, 2021

Membership Engineering at Netflix is responsible for the plan and pricing configurations for every market worldwide. However, with our rapid product innovation speed, the whole approach experienced significant challenges: Business Complexity: The existing SKU management solution was designed years ago when the engagement rules were simple?—?three

Mobile

Mobile Engineering Infrastructure Scalability

Dynatrace again named a Leader in 2021 Gartner Magic Quadrant for APM, received highest scores in 4 of 5 use cases in 2021 Gartner Critical Capabilities for APM

Dynatrace

APRIL 19, 2021

In the Magic Quadrant report, Gartner defines APM as, “software that enables the observation of application behavior and its infrastructure dependencies, users, and business key performance indicators (KPIs) throughout the application’s life cycle.” Extend our best-in-class observability with unparalleled AIOps and automation.

DevOps

DevOps Innovation Speed Infrastructure

Driving your FinOps strategy with observability best practices

Dynatrace

MARCH 18, 2024

Following FinOps practices, engineering, finance, and business teams take responsibility for their cloud usage, making data-driven spending decisions in a scalable and sustainable manner. Suboptimal architecture design. Poorly designed cloud solutions can become costly over time.

Best Practices

Best Practices Strategy Cloud AWS

Introducing Netflix’s Key-Value Data Abstraction Layer

The Netflix TechBlog

SEPTEMBER 18, 2024

Vidhya Arvind , Rajasekhar Ummadisetty , Joey Lynch , Vinay Chella Introduction At Netflix our ability to deliver seamless, high-quality, streaming experiences to millions of users hinges on robust, global backend infrastructure. To overcome these challenges, we developed a holistic approach that builds upon our Data Gateway Platform.

Latency

Latency Storage Cache Efficiency

Dynatrace accelerates business transformation with new AI observability solution

Dynatrace

JANUARY 31, 2024

Data dependencies and framework intricacies require observing the lifecycle of an AI-powered application end to end, from infrastructure and model performance to semantic caches and workflow orchestration. Estimates show that NVIDIA, a semiconductor manufacturer, could release 1.5 million AI server units annually by 2027, consuming 75.4+

Cache

Cache Azure Infrastructure Monitoring

Dynatrace adds monitoring support for Microsoft Azure Kubernetes Service deployments using Azure Linux container host

Dynatrace

MAY 24, 2023

Microsoft initially designed the OS for internal use to develop and manage Azure services. Microsoft designed the kernel and other aspects of the OS with an emphasis on security due to its focused role in executing container workloads. This design approach helps eliminate the need to patch and maintain essential packages.

Azure

Azure Monitoring Operating System Virtualization

Auto-adaptive thresholds for AI-driven quality gating

Dynatrace

JUNE 4, 2024

Build an umbrella for Development and Operations In modern software engineering, the discipline of platform engineering delivers DevSecOps practices to developers to bridge the gaps between development, security, and operations and enhance the developer experience.

Metrics

Metrics Engineering Code Tuning

Foundation Model for Personalized Recommendation

The Netflix TechBlog

MARCH 28, 2025

Key insights from this shiftinclude: A Data-Centric Approach : Shifting focus from model-centric strategies, which heavily rely on feature engineering, to a data-centric one. This approach prioritizes the accumulation of large-scale, high-quality data and, where feasible, aims for end-to-end learning.

Tuning

Tuning Efficiency Latency Strategy

SRE vs DevOps: What you need to know

Dynatrace

FEBRUARY 24, 2021

SRE is the transformation of traditional operations practices by using software engineering and DevOps principles to improve the availability, performance, and scalability of releases by building resiliency into apps and infrastructure. Designating and managing Service Level Objectives (SLOs) as availability targets for a service.

DevOps

DevOps Software Engineering Speed Google

What is Greenplum Database? Intro to the Big Data Database

Scalegrid

MAY 13, 2020

It’s architecture was specially designed to manage large-scale data warehouses and business intelligence workloads by giving you the ability to spread your data out across a multitude of servers. Greenplum uses an MPP database design that can help you develop a scalable, high performance deployment. Greenplum Architectural Design.

Big Data

Big Data Database Artificial Intelligence Open Source

AIOps and observability: The sense-think-act model for modern observability

Dynatrace

JULY 7, 2022

But as IT teams increasingly design and manage cloud-native technologies, the tasks IT pros need to accomplish are equally variable and complex. The framework forms the basis of the SAE (Society of Automotive Engineers) automation levels 1 through 5 for cars. The sense-think-act model for AIOps and observability. Act’ with AIOps.

Artificial Intelligence

Artificial Intelligence Automotive DevOps Infrastructure

Scaling Media Machine Learning at Netflix

The Netflix TechBlog

FEBRUARY 13, 2023

Our goal in building a media-focused ML infrastructure is to reduce the time from ideation to productization for our media ML practitioners. Amber is a suite of multiple infrastructure components that offers triggering capabilities to initiate the computation of algorithms with recursive dependency resolution.

Media

Media Storage Infrastructure Systems

Sustainability: Thoughts from a software engineer

Designing Instagram

Trending Sources

What is platform engineering?

Unlock the Power of DevSecOps with Newly Released Kubernetes Experience for Platform Engineering

Building Netflix’s Distributed Tracing Infrastructure

Dynatrace joins the Microsoft Intelligent Security Association

DevOps engineer tools: Deploy, test, evaluate, repeat

Building and Scaling Data Lineage at Netflix to Improve Data Infrastructure Reliability, and…

Dynatrace achieves Amazon RDS Service Ready designation

Why applying chaos engineering to data-intensive applications matters

Power Dashboarding, Part I: Start your exploration journey with Dashboards

Demystifying Interviewing for Backend Engineers @ Netflix

How Netflix Content Engineering makes a federated graph searchable

Data Engineers of Netflix?—?Interview with Pallavi Phadnis

Introducing Configurable Metaflow

How to Prepare for Your DevOps Interview

OpenPipeline: Simplify access to critical business data

Dynatrace observability now available for Red Hat OpenShift on IBM Z and LinuxONE mainframes

Title Launch Observability at Netflix Scale

Site reliability engineering: Six SRE trends to unleash DevOps innovation

Engineering dependability and fault tolerance in a distributed system

2019 Open Source Database Report: Top Databases, Public Cloud vs. On-Premise, Polyglot Persistence

Mastering Kubernetes with Dynatrace

Simplified observability for your SNMP devices

Dynatrace supports Amazon Linux 2023 as an AWS launch partner

Auth0 Architecture: Running In Multiple Cloud Providers And Regions

AWS serverless services: Exploring your options

Python at Netflix

ShiftLeft on Refactoring a Live SaaS Environment

Growth Engineering at Netflix- Creating a Scalable Offers Platform

New SNMP platform extensions provide observability at scale for network devices

How Netflix Scales its API with GraphQL Federation (Part 2)

Building a Rule-Based Platform to Manage Netflix Membership SKUs at Scale

Dynatrace again named a Leader in 2021 Gartner Magic Quadrant for APM, received highest scores in 4 of 5 use cases in 2021 Gartner Critical Capabilities for APM

Driving your FinOps strategy with observability best practices

Introducing Netflix’s Key-Value Data Abstraction Layer

Dynatrace accelerates business transformation with new AI observability solution

Dynatrace adds monitoring support for Microsoft Azure Kubernetes Service deployments using Azure Linux container host

Auto-adaptive thresholds for AI-driven quality gating

Foundation Model for Personalized Recommendation

SRE vs DevOps: What you need to know

What is Greenplum Database? Intro to the Big Data Database

AIOps and observability: The sense-think-act model for modern observability

Scaling Media Machine Learning at Netflix

Stay Connected