Engineering and Processing - Technology Performance Pulse

Part 2: A Survey of Analytics Engineering Work at Netflix

The Netflix TechBlog

JANUARY 2, 2025

This article is the second in a multi-part series sharing a breadth of Analytics Engineering work at Netflix, recently presented as part of our annual internal Analytics Engineering conference. Each format has a different production process and different patterns of cash spend, called our Content Forecast. Need to catch up?

Analytics

Analytics Engineering Games Entertainment

Sustainability: Thoughts from a software engineer

Dynatrace

MARCH 17, 2025

For example, reporting jobs can process monthly data without running exactly at the end of the month. Platform engineers can set defaults for development teams, such as the number of replicas a service should have or whether it scales automatically. For instance, optimizing a frontend library can save resources for every website.

Software Engineering

Software Engineering Engineering Software Software

What Is Platform Engineering?

DZone

FEBRUARY 6, 2024

Platform engineering is the creation and management of foundational infrastructure and automated processes, incorporating principles like abstraction, automation, and self-service, to empower development teams, optimize resource utilization, ensure security, and foster collaboration for efficient and scalable software development.

Engineering

Engineering Scalability Infrastructure Efficiency

Close Site Search Indexing via Kubernetes HAProxy Ingress

DZone

OCTOBER 22, 2024

Ingress is essential for routing incoming traffic to your service; however, there may be scenarios in which you want to prevent search engines from indexing your service's content: it might be a development environment or something else.

Traffic

Traffic Engineering Processing Development

Breaking AWS Lambda: Chaos Engineering for Serverless Devs

DZone

MARCH 24, 2025

Our "serverless" order processing system built on AWS Lambda and API Gateway was humming along, handling 1,000 transactions/minute. The Day Our Serverless Dream Turned into a Nightmare It was 3 PM on a Tuesday. Then, disaster struck.

Lambda

Lambda Serverless AWS Engineering

What is platform engineering?

Dynatrace

NOVEMBER 3, 2023

In response to this shift, platform engineering is growing in popularity. The practice of platform engineering has evolved alongside the increasing complexity of cloud environments. Platform engineers design and implement these platforms, as well as ensure their security, scalability, and reliability.

Engineering

Engineering DevOps Software Engineering Scalability

Financial Data Engineering in SAS

DZone

JANUARY 8, 2024

Financial data engineering in SAS involves the management, processing, and analysis of financial data using the various tools and techniques provided by the SAS software suite. Here are some key aspects of financial data engineering in SAS: 1.

Data Engineering

Data Engineering Engineering Database Software

Our First Netflix Data Engineering Summit

The Netflix TechBlog

DECEMBER 14, 2023

Engineers from across the company came together to share best practices on everything from Data Processing Patterns to Building Reliable Data Pipelines. The result was a series of talks which we are now sharing with the rest of the Data Engineering community!

Data Engineering

Data Engineering Engineering Software Engineering Best Practices

Life of a Netflix Partner Engineer?—?The case of extra 40 ms

The Netflix TechBlog

DECEMBER 14, 2020

Life of a Netflix Partner Engineer?—?The The case of the extra 40 ms By: John Blair , Netflix Partner Engineering The Netflix application runs on hundreds of smart TVs, streaming sticks and pay TV set top boxes. The role of a Partner Engineer at Netflix is to help device manufacturers launch the Netflix application on their devices.

Engineering

Engineering Code Open Source Hardware

Dynatrace KSPM: Transforming Kubernetes security and compliance

Dynatrace

DECEMBER 9, 2024

Processes are time-intensive. Slow processes introduce risk. Continuous visibility and assessment provide platform engineering, DevSecOps, DevOps, and SRE teams with the ability to track, validate, and remediate potential compliance-relevant findings and create the necessary evidence for the auditing process. Reactivity.

Best Practices

Best Practices Benchmarking DevOps Efficiency

Why applying chaos engineering to data-intensive applications matters

Dynatrace

MAY 23, 2024

Stream processing One approach to such a challenging scenario is stream processing, a computing paradigm and software architectural style for data-intensive software systems that emerged to cope with requirements for near real-time processing of massive amounts of data.

Engineering

Engineering Tuning Latency Open Source

Build systems more reliably with Dynatrace: Chaos Engineering

Dynatrace

AUGUST 21, 2024

This approach enhances key DORA metrics and enables early detection of failures in the release process, allowing SREs more time for innovation. To enhance reliability, testing the software under these conditions is crucial to prepare for potential issues by leveraging chaos engineering or similar tools. Why reliability?

Engineering

Engineering Systems Latency Metrics

A Kubernetes platform engineering strategy tames Kubernetes complexity

Dynatrace

JULY 25, 2024

I spoke with Martin Spier, PicPay’s VP of Engineering, about the challenges PicPay experienced and the Kubernetes platform engineering strategy his team adopted in response. In addition, their logs-heavy approach to analysis made scaling processes complex and costly. “And these layers tend to be similar. .

Strategy

Strategy Engineering Open Source Java

DevOps engineer tools: Deploy, test, evaluate, repeat

Dynatrace

DECEMBER 8, 2022

As cloud-native, distributed architectures proliferate, the need for DevOps technologies and DevOps platform engineers has increased as well. DevOps engineer tools can help ease the pressure as environment complexity grows. ” What does a DevOps platform engineer do? .” What are DevOps engineer tools and platforms.

DevOps

DevOps Engineering Testing Open Source

Unlock the Power of DevSecOps with Newly Released Kubernetes Experience for Platform Engineering

Dynatrace

NOVEMBER 7, 2023

Platform engineering is on the rise. According to leading analyst firm Gartner, “80% of software engineering organizations will establish platform teams as internal providers of reusable services, components, and tools for application delivery…” by 2026.

Engineering

Engineering DevOps Best Practices Infrastructure

Dare to debug production with Dynatrace Live Debugger

Dynatrace

FEBRUARY 4, 2025

A production bug is the worst; besides impacting customer experience, you need special access privileges, making the process far more time-consuming. It also makes the process risky as production servers might be more exposed, leading to the need for real-time production data. This cumbersome process should not be the norm.

Open Source

Open Source Code Engineering Best Practices

Platform engineering: Empowering key Kubernetes use cases with Dynatrace

Dynatrace

OCTOBER 30, 2023

Today, speed and DevOps automation are critical to innovating faster, and platform engineering has emerged as an answer to some of the most significant challenges DevOps teams are facing. It needs to be engineered properly as a product or service, and it needs automation, observability, and security in itself.”

Engineering

Engineering DevOps Innovation Storage

Accelerate and empower Site Reliability Engineering with Dynatrace observability

Dynatrace

OCTOBER 10, 2023

Planned effort Site Reliability Engineering (SRE) effort and time allocation planning typically fall into two domains: Operations Management (50%) Operations Management includes on-call responsibilities, post-mortem assessments, addressing other interruptions, and buffer time. Streamlining the CI/CD process to ensure optimal efficiency.

Engineering

Engineering DevOps Innovation Strategy

The platform engineer role: A game-changer or just hype?

Dynatrace

SEPTEMBER 21, 2023

Site reliability engineering first emerged to address cloud computing’s new performance needs. Today, the platform engineer role is gaining speed as the newest byproduct of scaling DevOps in the emerging but complex cloud-native world. Understanding the platform engineer role DevOps is a constantly evolving discipline.

Games

Games Engineering DevOps Education

Enhancing Kubernetes cluster management key to platform engineering success

Dynatrace

MARCH 29, 2024

Five of the most common include cluster instability, resource and cost management, security, observability, and stress on engineering teams. Engineering teams are overwhelmed with stuff to do.” The post Enhancing Kubernetes cluster management key to platform engineering success appeared first on Dynatrace news.

Engineering

Engineering DevOps Operating System Cloud

How observability, application security, and AI enhance DevOps and platform engineering maturity

Dynatrace

APRIL 18, 2024

DevOps and platform engineering are essential disciplines that provide immense value in the realm of cloud-native technology and software delivery. Observability of applications and infrastructure serves as a critical foundation for DevOps and platform engineering, offering a comprehensive view into system performance and behavior.

DevOps

DevOps Engineering Artificial Intelligence Infrastructure

Rebuilding Netflix Video Processing Pipeline with Microservices

The Netflix TechBlog

JANUARY 10, 2024

Future blogs will provide deeper dives into each service, sharing insights and lessons learned from this process. The Netflix video processing pipeline went live with the launch of our streaming service in 2007. The Netflix video processing pipeline went live with the launch of our streaming service in 2007.

Processing

Processing Media Latency Innovation

1. Streamlining Membership Data Engineering at Netflix with Psyberg

The Netflix TechBlog

NOVEMBER 14, 2023

By Abhinaya Shetty , Bharath Mummadisetty At Netflix, our Membership and Finance Data Engineering team harnesses diverse data related to plans, pricing, membership life cycle, and revenue to fuel analytics, power various dashboards, and make data-informed decisions. Let’s dive in! What is late-arriving data? Some techniques we used were: 1.

Data Engineering

Data Engineering Engineering Processing Games

Low-Maintenance Backend Architectures for Scalable Applications

DZone

JANUARY 10, 2025

After years of working in the intricate world of software engineering, I learned that the most beautiful solutions are often those unseen: backends that hum along, scaling with grace and requiring very little attention. Developers could understand and manage the entire systems intricacies.

Architecture

Architecture Scalability Software Engineering Cloud

Automating Success: Building a better developer experience with platform engineering

Dynatrace

FEBRUARY 12, 2024

When it comes to platform engineering, not only does observability play a vital role in the success of organizations’ transformation journeys—it’s key to successful platform engineering initiatives. The various presenters in this session aligned platform engineering use cases with the software development lifecycle.

Engineering

Engineering Development Infrastructure Cloud

How platform engineering and IDP observability can accelerate developer velocity

Dynatrace

MARCH 6, 2024

As organizations look to expand DevOps maturity, improve operational efficiency, and increase developer velocity, they are embracing platform engineering as a key driver. Platform engineering: Build for self-service Self-service deployment is a key attribute of platform engineering. “It makes them more productive.

Engineering

Engineering Development DevOps Infrastructure

Incremental Processing using Netflix Maestro and Apache Iceberg

The Netflix TechBlog

NOVEMBER 20, 2023

by Jun He , Yingyi Zhang , and Pawan Dixit Incremental processing is an approach to process new or changed data in workflows. The key advantage is that it only incrementally processes data that are newly added or updated to a dataset, instead of re-processing the complete dataset.

Processing

Processing Big Data Efficiency Engineering

Dynatrace Observability for Developers saves time with real-time data

Dynatrace

FEBRUARY 4, 2025

Enterprise adoption with self-service: To facilitate enterprise adoption while minimizing tool sprawl and data silos, Dynatrace allows observability teams and platform engineers to implement a self-service model for developers. When a large-scale incident occurs, Davis AI identifies the root cause and connects all relevant log lines.

Development

Development Analytics Code Architecture

2. Diving Deeper into Psyberg: Stateless vs Stateful Data Processing

The Netflix TechBlog

NOVEMBER 14, 2023

By Abhinaya Shetty , Bharath Mummadisetty In the inaugural blog post of this series, we introduced you to the state of our pipelines before Psyberg and the challenges with incremental processing that led us to create the Psyberg framework within Netflix’s Membership and Finance data engineering team.

Processing

Processing Data Engineering Efficiency Analytics

The state of site reliability engineering: SRE challenges and best practices in 2023

Dynatrace

NOVEMBER 14, 2023

Site reliability engineering (SRE) has become increasingly important to organizations looking to keep up with the rapid pace of digital transformation. Effective site reliability engineering requires enterprise-wide transformation Without a unified understanding of SRE practices, organizational silos can quickly form between departments.

Best Practices

Best Practices Engineering DevOps Software Engineering

Leverage logs for an end-to-end view of your business processes via Dynatrace OpenPipeline

Dynatrace

SEPTEMBER 27, 2024

Unrealized optimization potential of business processes due to monitoring gaps Imagine a retail company facing gaps in its business process monitoring due to disparate data sources. Due to separated systems that handle different parts of the process, the view of the process is fragmented.

Processing

Processing Retail Analytics Monitoring

How automated workflows and multicloud automation can reduce engineering toil

Dynatrace

JUNE 5, 2023

Multicloud automation challenge: Manual processes don’t scale Manual processes pose multiple problems for organizations looking for increased application performance and efficiency. First, manual processes are naturally error prone because they rely on humans to input, review, and confirm data.

Engineering

Engineering Speed Monitoring Efficiency

A Recap of the Data Engineering Open Forum at Netflix

The Netflix TechBlog

JUNE 20, 2024

A summary of sessions at the first Data Engineering Open Forum at Netflix on April 18th, 2024 The Data Engineering Open Forum at Netflix on April 18th, 2024. At Netflix, we aspire to entertain the world, and our data engineering teams play a crucial role in this mission by enabling data-driven decision-making at scale.

Data Engineering

Data Engineering Engineering Entertainment Software Engineering

Observability is expanding: Transforming complexity into business opportunity

Dynatrace

MARCH 5, 2025

From developers leveraging platform engineering tools to optimize application performance, to Site Reliability Engineers (SREs) ensuring resilience, and executives gaining critical business insights, observability increases the velocity of innovation across every level of an organization.

Innovation

Innovation Speed Efficiency Engineering

Catching up with OpenTelemetry in 2025

Dynatrace

FEBRUARY 27, 2025

To get a better idea of OpenTelemetry trends in 2025 and how to get the most out of it in your observability strategy, some of our Dynatrace open-source engineers and advocates picked out the innovations they find most interesting. Because its constantly evolving, staying up to date with the latest in OpenTelemetry is no small feat.

Tuning

Tuning Open Source Innovation Monitoring

Observability engineering: Getting Prometheus metrics right for Kubernetes with Dynatrace and Kepler

Dynatrace

DECEMBER 18, 2023

For busy site reliability engineers, ensuring system reliability, scalability, and overall health is an imperative that’s getting harder to achieve in ever-expanding, cloud-native, container-based environments. Because of its adaptability, Prometheus has become an essential tool for observability engineering.

Metrics

Metrics Engineering Energy Tuning

Title Launch Observability at Netflix Scale

The Netflix TechBlog

MARCH 4, 2025

As a result, requests are uniformly handled, and responses are processed cohesively. This standardization enhances adoption within the personalization stack, simplifies the system, and improves understanding and debuggability for engineers. The request schema for the observability endpoint.

Traffic

Traffic Strategy Entertainment Innovation

5 powerful use cases beyond debugging for Dynatrace Live Debugger

Dynatrace

MARCH 25, 2025

This powerful tool can be leveraged across various environments, including production, to enhance development processes and ensure robust application performance. Following are some of the coolest things weve seen engineers do with Live Debugger. Live snapshot includes variables, process, stack trace, and tracing information.

Benchmarking

Benchmarking Code Open Source Engineering

Build resilient IT systems and manage regulatory requirements with compliance and resilience capabilities from Dynatrace

Dynatrace

JANUARY 15, 2025

Smartscape topology visualizes the relationships between applications, services, processes, hosts, and data centers, highlighting problems and vulnerabilities. This enables DevOps platform engineers to make the right release decisions for new versions and empowers SREs to apply Service-Level Objectives (SLOs) for their critical services.

Systems

Systems DevOps Monitoring Analytics

Title Launch Observability at Netflix Scale

The Netflix TechBlog

JANUARY 6, 2025

This process involves: Identifying Stakeholders: Determine who is impacted by the issue and whose input is crucial for a successful resolution. In this context, were focused on developing systems that ensure successful title launches, build trust between content creators and our brand, and reduce engineering operational overhead.

Scalability

Scalability Engineering Cache Systems

Reliability indicators that matter to your business: SLOs for all data types

Dynatrace

OCTOBER 31, 2024

Hence, having a dedicated dashboard tile visualizing the key parameters of each SLO simplifies the process of evaluating them. Additionally, the Dynatrace Automation Engine will leverage SLO alerts to create event-triggered workflows to inform relevant stakeholders, provide reports, or automatically kick off remediation activities.

Metrics

Metrics Availability Monitoring Scalability

OpenPipeline: Simplify access to critical business data

Dynatrace

NOVEMBER 4, 2024

Dynatrace OpenPipeline is a new stream processing technology that ingests and contextualizes data from any source. Business process monitoring and optimization. All of these steps are critical components of the process, likely to be implemented using different systems. Business event ingestion and analysis with log files.

Analytics

Analytics Airlines Metrics Monitoring

How to Scale Elasticsearch to Solve Your Scalability Issues

DZone

FEBRUARY 26, 2025

With the evolution of modern applications serving increasing needs for real-time data processing and retrieval, scalability does, too. One such open-source, distributed search and analytics engine is Elasticsearch, which is very efficient at handling data in large sets and high-velocity queries.

Scalability

Scalability Open Source Latency Architecture

Introducing Impressions at Netflix

The Netflix TechBlog

FEBRUARY 14, 2025

Every image you hover over isnt just a visual placeholder; its a critical data point that fuels our sophisticated personalization engine. It requires a state-of-the-art system that can track and process these impressions while maintaining a detailed history of each profiles exposure.

Tuning

Tuning Latency Efficiency Storage

Part 2: A Survey of Analytics Engineering Work at Netflix

Sustainability: Thoughts from a software engineer

Trending Sources

What Is Platform Engineering?

Close Site Search Indexing via Kubernetes HAProxy Ingress

Breaking AWS Lambda: Chaos Engineering for Serverless Devs

What is platform engineering?

Financial Data Engineering in SAS

Our First Netflix Data Engineering Summit

Life of a Netflix Partner Engineer?—?The case of extra 40 ms

Dynatrace KSPM: Transforming Kubernetes security and compliance

Why applying chaos engineering to data-intensive applications matters

Build systems more reliably with Dynatrace: Chaos Engineering

A Kubernetes platform engineering strategy tames Kubernetes complexity

DevOps engineer tools: Deploy, test, evaluate, repeat

Unlock the Power of DevSecOps with Newly Released Kubernetes Experience for Platform Engineering

Dare to debug production with Dynatrace Live Debugger

Platform engineering: Empowering key Kubernetes use cases with Dynatrace

Accelerate and empower Site Reliability Engineering with Dynatrace observability

The platform engineer role: A game-changer or just hype?

Enhancing Kubernetes cluster management key to platform engineering success

How observability, application security, and AI enhance DevOps and platform engineering maturity

Rebuilding Netflix Video Processing Pipeline with Microservices

1. Streamlining Membership Data Engineering at Netflix with Psyberg

Low-Maintenance Backend Architectures for Scalable Applications

Automating Success: Building a better developer experience with platform engineering

How platform engineering and IDP observability can accelerate developer velocity

Incremental Processing using Netflix Maestro and Apache Iceberg

Dynatrace Observability for Developers saves time with real-time data

2. Diving Deeper into Psyberg: Stateless vs Stateful Data Processing

The state of site reliability engineering: SRE challenges and best practices in 2023

Leverage logs for an end-to-end view of your business processes via Dynatrace OpenPipeline

How automated workflows and multicloud automation can reduce engineering toil

A Recap of the Data Engineering Open Forum at Netflix

Observability is expanding: Transforming complexity into business opportunity

Catching up with OpenTelemetry in 2025

Observability engineering: Getting Prometheus metrics right for Kubernetes with Dynatrace and Kepler

Title Launch Observability at Netflix Scale

5 powerful use cases beyond debugging for Dynatrace Live Debugger

Build resilient IT systems and manage regulatory requirements with compliance and resilience capabilities from Dynatrace

Title Launch Observability at Netflix Scale

Reliability indicators that matter to your business: SLOs for all data types

OpenPipeline: Simplify access to critical business data

How to Scale Elasticsearch to Solve Your Scalability Issues

Introducing Impressions at Netflix

Stay Connected