Engineering and Network - Technology Performance Pulse

Part 1: A Survey of Analytics Engineering Work at Netflix

The Netflix TechBlog

DECEMBER 17, 2024

This article is the first in a multi-part series sharing a breadth of Analytics Engineering work at Netflix, recently presented as part of our annual internal Analytics Engineering conference. Subsequent posts will detail examples of exciting analytic engineering domain applications and aspects of the technical craft.

Analytics

Analytics Engineering Entertainment Metrics

Chaos Engineering With Litmus: A CNCF Incubating Project

DZone

FEBRUARY 6, 2025

The problems with degraded service availability along with revenue impact occur mainly because of Kubernetes pod crashes along with resource exhaustion and network disruptions that hit during peak shopping seasons.

Engineering

Engineering Traffic Architecture Network

How Netflix uses eBPF flow logs at scale for network insight

The Netflix TechBlog

JUNE 7, 2021

By Alok Tiagi , Hariharan Ananthakrishnan , Ivan Porto Carrero and Keerti Lakshminarayan Netflix has developed a network observability sidecar called Flow Exporter that uses eBPF tracepoints to capture TCP flows at near real time. Without having network visibility, it’s difficult to improve our reliability, security and capacity posture.

Network

Network Transportation AWS Cloud

For your eyes only: improving Netflix video quality with neural networks

The Netflix TechBlog

NOVEMBER 17, 2022

Recently, we added another powerful tool to our arsenal: neural networks for video downscaling. In this tech blog, we describe how we improved Netflix video quality with neural networks, the challenges we faced and what lies ahead. How can neural networks fit into Netflix video encoding?

Network

Network Media Innovation Efficiency

How Kubernetes Changed the Networking Model and What Developers Should Know about eBPF and Cilium

DZone

JULY 30, 2024

Enterprise networking is a radically different discipline in today’s microservices, containers, and Kubernetes paradigm than what it used to be in the old three-tier architecture world. Q: How Did Kubernetes Change the Networking Model? A: In many ways, Kubernetes networking is similar to our traditional networking.

Network

Network Open Source Virtualization Development

Measuring Network Performance in Mobile Safari

CSS Wizardry

FEBRUARY 25, 2021

Google has a pretty tight grip on the tech industry: it makes by far the most popular browser with the best DevTools, and the most popular search engine, which means that web developers spend most of their time in Chrome, most of their visitors are in Chrome, and a lot of their search traffic will be coming from Google. somewhere sensible.

Network

Network Mobile Performance Traffic

JMeter Netconf Plug-in and Network Service Automation

DZone

JUNE 30, 2020

JMeter Netconf Plug-in and Network Service Automation. Network service automation-related requirements are usually realized by means of commercial or open-source network orchestrator or controller software system. JMeter Netconf plug-in implementation includes two modules.

Network

Network Open Source Engineering Software

Build systems more reliably with Dynatrace: Chaos Engineering

Dynatrace

AUGUST 21, 2024

These releases often assumed ideal conditions such as zero latency, infinite bandwidth, and no network loss, as highlighted in Peter Deutsch’s eight fallacies of distributed systems. Chaos engineering is a practice that extends beyond traditional failure testing by identifying unpredictable issues.

Engineering

Engineering Systems Latency Metrics

AI-powered DNS request tracking extends infrastructure observability for high quality network traffic

Dynatrace

OCTOBER 1, 2020

With all the data collected and powered by our Davis AI-driven causation engine, Dynatrace automatically identifies slowdowns in your applications and services and points you to their root cause. Ensure high quality network traffic by tracking DNS requests out-of-the-box. Network services visibility (DNS, NTP, ActiveDirectory).

Traffic

Traffic Network Infrastructure Artificial Intelligence

Transform data into insights with Dynatrace Dashboards and Notebooks

Dynatrace

OCTOBER 16, 2024

These ready-made dashboards offer your platform engineers, who oversee Kubernetes environments, immediate and comprehensive data visibility. This allows platform engineers to focus on high-value tasks like resolving issues and optimizing performance rather than spending time on data discovery and exploration.

Social Media

Social Media Metrics Network Analytics

Embracing Resilience: The Power of Chaos Engineering

DZone

OCTOBER 20, 2023

But chaos engineering stands out for its exceptional capacity to identify weaknesses and proactively fortify systems. Businesses rely heavily on intricate systems and networks to run effectively in today's technology-driven world.

Engineering

Engineering Strategy Network Technology

New SNMP platform extensions provide observability at scale for network devices

Dynatrace

NOVEMBER 24, 2021

Constantly monitoring infrastructure health state and making ongoing optimizations are essential for Ops teams, SREs (site-reliability engineers), and IT admins. Quick and easy network infrastructure monitoring. Begin network monitoring by simply deploying an extension with just a few clicks. Start monitoring in minutes.

Network

Network Infrastructure Virtualization Metrics

DevOps engineer tools: Deploy, test, evaluate, repeat

Dynatrace

DECEMBER 8, 2022

As cloud-native, distributed architectures proliferate, the need for DevOps technologies and DevOps platform engineers has increased as well. DevOps engineer tools can help ease the pressure as environment complexity grows. ” What does a DevOps platform engineer do? .” What are DevOps engineer tools and platforms.

DevOps

DevOps Engineering Testing Open Source

Network performance monitoring top of mind for CloudOps teams

Dynatrace

MAY 19, 2023

For cloud operations teams, network performance monitoring is central in ensuring application and infrastructure performance. If the network is sluggish, an application may also be slow, frustrating users. Worse, a malicious attacker may gain access to the network, compromising sensitive application data.

Network

Network Monitoring Performance Traffic

Engineering a Studio Quality Experience With High-Quality Audio at Netflix

The Netflix TechBlog

MAY 1, 2019

Our engineering team and Creative Technologies sound expert joined forces to quickly solve the issue, but a larger conversation about higher quality audio continued. This approach selects the audio bitrate based on network conditions at the start of playback. channel stream, as well as audible degradation of high frequencies.

Engineering

Engineering Network Media Entertainment

Building Resilience With Chaos Engineering and Litmus

DZone

JUNE 15, 2023

Various factors, such as network communication, inter-service dependencies, external dependencies, and scalability issues, can contribute to outages. The scalability, agility, and continuous delivery offered by microservices architecture make it a popular option for businesses today.

Engineering

Engineering Architecture Scalability Google

Dynatrace extends Synthetic Monitoring capabilities with Network Availability Monitors to validate the availability of infrastructure and services

Dynatrace

JULY 15, 2024

Why browser and HTTP monitors might not be sufficient In modern IT environments, which are complex and dynamically changing, you often need deeper insights into the Transport or Network layers. Is it a bug in the codebase, a malfunctioning backend service, an overloaded hosting infrastructure, or perhaps a misconfigured network?

Availability

Availability Network Monitoring Infrastructure

Accelerate resolution of network issues with AI-powered event reporting based on SNMP traps

Dynatrace

NOVEMBER 30, 2022

Monitoring modern IT infrastructure is difficult, sometimes impossible, without advanced network monitoring tools. While the market is saturated with many Network Administrator support solutions, Dynatrace can help you analyze the impact on your organization in an automated manner. Sample SNMP-enabled device configuration. What’s next?

Network

Network Infrastructure Metrics Monitoring

Native App Network Performance Analysis

DZone

APRIL 7, 2021

When 54 percent of the internet traffic share is accounted for by Mobile , it's certainly nontrivial to acknowledge how your app can make a difference to that of the competitor!

Network

Network Performance Cache Internet

1. Streamlining Membership Data Engineering at Netflix with Psyberg

The Netflix TechBlog

NOVEMBER 14, 2023

By Abhinaya Shetty , Bharath Mummadisetty At Netflix, our Membership and Finance Data Engineering team harnesses diverse data related to plans, pricing, membership life cycle, and revenue to fuel analytics, power various dashboards, and make data-informed decisions. Our audits would detect this and alert the on-call data engineer (DE).

Data Engineering

Data Engineering Engineering Processing Games

Cut costs and complexity: 5 strategies for reducing tool sprawl with Dynatrace

Dynatrace

APRIL 10, 2025

Business-focused, unified platform approach : A unified platform approach enables platform engineering and self-service portals, simplifying operations and reducing costs. Davis, the causal AI engine, instantly identifies root causes and predicts service degradation before it impacts users.

Strategy

Strategy Storage Network Architecture

How Netflix Accurately Attributes eBPF Flow Logs

The Netflix TechBlog

APRIL 8, 2025

By Cheng Xie , Bryan Shultz , and Christine Xu In a previous blog post , we described how Netflix uses eBPF to capture TCP flow logs at scale for enhanced network insights. Conclusion With misattribution solved, eBPF flow logs now deliver dependable, fleet-wide insights into Netflixs service topology and network health.

AWS

AWS Traffic Network Programming

HTTP monitors on the latest Dynatrace platform extend insights into the health of your API endpoints and simplify test management

Dynatrace

DECEMBER 18, 2024

It now fully supports not only Network Availability Monitors but also HTTP synthetic monitors. Thanks to the power of Grail, those details are available for all executions stored for the entire retention period during which synthetic results are kept. The new Dynatrace Synthetic app allows you to analyze these results.

Monitoring

Monitoring Testing Metrics Analytics

Observability engineering: Getting Prometheus metrics right for Kubernetes with Dynatrace and Kepler

Dynatrace

DECEMBER 18, 2023

For busy site reliability engineers, ensuring system reliability, scalability, and overall health is an imperative that’s getting harder to achieve in ever-expanding, cloud-native, container-based environments. Because of its adaptability, Prometheus has become an essential tool for observability engineering. Jolly good!

Metrics

Metrics Engineering Energy Tuning

How to Scale Elasticsearch to Solve Your Scalability Issues

DZone

FEBRUARY 26, 2025

One such open-source, distributed search and analytics engine is Elasticsearch, which is very efficient at handling data in large sets and high-velocity queries. This extra network overhead will easily result in increased latency compared to a single-node architecture where data access is straightforward.

Scalability

Scalability Open Source Latency Architecture

The keys to selecting a platform for end-to-end observability

Dynatrace

DECEMBER 2, 2024

Such fragmented approaches fall short of giving teams the insights they need to run IT and site reliability engineering operations effectively. Get to the root cause of issues Most AI today uses machine learning models like neural networks that find correlations and make predictions based on them.

Artificial Intelligence

Artificial Intelligence DevOps Architecture Cloud

A BGP Guide for the Non-Network Engineer

DZone

AUGUST 10, 2022

BGP provides network stability as it guarantees routers can rapidly adapt to send packets via a different connection if one Internet pathway goes down. The original function of BGP was to carry internet reachability information between edge routers (it is sometimes described as a reachability protocol).

Network

Network Engineering Internet Internet

Growth Engineering at Netflix?—?Automated Imagery Generation

The Netflix TechBlog

FEBRUARY 9, 2021

Growth Engineering at Netflix?—?Automated In the Growth Engineering team, we refer to this as the top of the signup funnel. For more background on the signup funnel and Growth Engineering’s role in the signup funnel, please read our initial post on the topic: Growth Engineering at Netflix? Growth Engineering at Netflix?—?Automated

Engineering

Engineering Storage Latency Entertainment

Leveraging Infrastructure as Code for Data Engineering Projects: A Comprehensive Guide

DZone

JULY 3, 2023

Data engineering projects often require the setup and management of complex infrastructures that support data processing, storage, and analysis. In this article, we will explore the benefits of leveraging IaC for data engineering projects and provide detailed implementation steps to get started.

Data Engineering

Data Engineering Infrastructure Engineering Code

It’s time to migrate from NAM to Dynatrace

Dynatrace

APRIL 2, 2020

For two decades, Dynatrace NAM—Network Application Monitoring, formerly known as DC RUM—has been successfully monitoring the user experience of our customers’ enterprise applications. SNMP managed the costs of network links well, but not the sources of those costs (i.e., Dynatrace news. Performance has always mattered.

Network

Network Traffic Monitoring Java

Designing Instagram

High Scalability

JANUARY 11, 2022

Machine Learning Engineer at Amazon and has led several machine-learning initiatives across the Amazon ecosystem. FUN FACT : In this talk , Rodrigo Schmidt, director of engineering at Instagram talks about the different challenges they have faced in scaling the data infrastructure at Instagram. This is a guest post by Ankit Sirmorya.

Design

Design Media Storage Logistics

Engineering dependability and fault tolerance in a distributed system

High Scalability

FEBRUARY 19, 2021

This means a system that is not merely available but is also engineered with extensive redundant measures to continue to work as its users expect. reliability situations, where continuity of service is essential, with redundant elements continuously in-service, such as with airplane engines. This ensures reliability.

Engineering

Engineering Systems Availability Scalability

Python at Netflix

The Netflix TechBlog

APRIL 29, 2019

Open Connect Open Connect is Netflix’s content delivery network (CDN). video streaming) takes place in the Open Connect network. The network devices that underlie a large portion of the CDN are mostly managed by Python applications. If any of this interests you, check out the jobs site or find us at PyCon. are you logged in?

Open Source

Open Source Network Infrastructure Big Data

How to Prepare for Your DevOps Interview

DZone

SEPTEMBER 5, 2019

For system administrators, operations engineers, and others with strong systems and software backgrounds, there’s perhaps no better time than the present to transition into DevOps. Interviews can range from standard software engineer coding questions to questions on system design, Linux debugging, and DevOps tools.

DevOps

DevOps Software Engineering Infrastructure Engineering

Better dashboarding with Dynatrace Davis AI: Instant meaningful insights

Dynatrace

JANUARY 21, 2025

For example, if you’re monitoring network traffic and the average over the past 7 days is 500 Mbps, the threshold will adapt to this baseline. An anomaly will be identified if traffic suddenly drops below 200 Mbps or above 800 Mbps, helping you identify unusual spikes or drops.

Traffic

Traffic Metrics Analytics Monitoring

How to Configure Istio, Prometheus and Grafana for Monitoring

DZone

AUGUST 29, 2023

Intro to Istio Observability Using Prometheus Istio service mesh abstracts the network from the application layers using sidecar proxies. You can implement security and advance networking policies to all the communication across your infrastructure using Istio. But another important feature of Istio is observability.

Monitoring

Monitoring Latency Network Infrastructure

Power Dashboarding, Part I: Start your exploration journey with Dashboards

Dynatrace

FEBRUARY 6, 2025

Host Monitoring dashboards offer real-time visibility into the health and performance of servers and network infrastructure, enabling proactive issue detection and resolution. This information is crucial for identifying network issues, troubleshooting connectivity problems, and ensuring reliable domain name resolution.

Metrics

Metrics Infrastructure Monitoring Best Practices

Simplified observability for your SNMP devices

Dynatrace

MARCH 22, 2021

As a Network Engineer, you need to ensure the operational functionality, availability, efficiency, backup/recovery, and security of your company’s network. Exceeded throughput levels can be a sign that some changes to the network configuration might be required. Synthetic network monitoring. Events and alerts.

Metrics

Metrics Network Infrastructure Traffic

Platform Engineering Teams Done Right…

Adrian Cockcroft

FEBRUARY 9, 2023

There are three current underlying reasons for the platform engineering meme today. The virtualization and networking platform could be datacenter based, with something like VMware, or cloud based using one of the cloud providers such as AWS EC2. The second is that some companies with tools to sell are marketing the term.

Engineering

Engineering Serverless Lambda AWS

Dynatrace adds support for AWS Transit Gateway with VPC Flow Logs

Dynatrace

JULY 25, 2022

This new service enhances the user visibility of network details with direct delivery of Flow Logs for Transit Gateway to your desired endpoint via Amazon Simple Storage Service (S3) bucket or Amazon CloudWatch Logs. AWS Transit Gateway is a service offering from Amazon Web Services that connects network resources via a centralized hub.

AWS

AWS Transportation Network Traffic

Kubernetes vs Docker: What’s the difference?

Dynatrace

SEPTEMBER 29, 2021

Running containers : Docker Engine is a container runtime that runs in almost any environment: Mac and Windows PCs, Linux and Windows servers, the cloud, and on edge devices. Docker Engine is built on top containerd , the leading open-source container runtime, a project of the Cloud Native Computing Foundation (DNCF). Networking.

Open Source

Open Source Traffic DevOps Cloud

Optimized shot-based encodes for 4K: Now streaming!

The Netflix TechBlog

AUGUST 28, 2020

As the number of 4K titles in our catalog continues to grow and more devices support the premium features, we expect these video streams to have an increasing impact on our members and the network. The fixed-bitrate ladder starts at 560 kbps which may be too high for some cellular networks. shot-optimized encoding and 4K VMAF model ?—?and

Network

Network Storage Innovation Mobile

What is container orchestration?

Dynatrace

MARCH 24, 2023

But managing the deployment, modification, networking, and scaling of multiple containers can quickly outstrip the capabilities of development and operations teams. This orchestration includes provisioning, scheduling, networking, ensuring availability, and monitoring container lifecycles. How does container orchestration work?

Infrastructure

Infrastructure Open Source Operating System Cloud

Scale your enterprise cloud environment with enhanced AI-powered observability of all AWS services

Dynatrace

AUGUST 27, 2020

The latest batch of services cover databases, networks, machine learning and computing. Each service comes with zero-configuration, automatic instance detection, continuous data capture in context, and what’s most important – thanks to our AI engine Davis – is each service provides answers, not just data.

AWS

AWS Cloud IoT Database

Part 1: A Survey of Analytics Engineering Work at Netflix

Chaos Engineering With Litmus: A CNCF Incubating Project

Trending Sources

How Netflix uses eBPF flow logs at scale for network insight

For your eyes only: improving Netflix video quality with neural networks

How Kubernetes Changed the Networking Model and What Developers Should Know about eBPF and Cilium

Measuring Network Performance in Mobile Safari

JMeter Netconf Plug-in and Network Service Automation

Build systems more reliably with Dynatrace: Chaos Engineering

AI-powered DNS request tracking extends infrastructure observability for high quality network traffic

Transform data into insights with Dynatrace Dashboards and Notebooks

Embracing Resilience: The Power of Chaos Engineering

New SNMP platform extensions provide observability at scale for network devices

DevOps engineer tools: Deploy, test, evaluate, repeat

Network performance monitoring top of mind for CloudOps teams

Engineering a Studio Quality Experience With High-Quality Audio at Netflix

Building Resilience With Chaos Engineering and Litmus

Dynatrace extends Synthetic Monitoring capabilities with Network Availability Monitors to validate the availability of infrastructure and services

Accelerate resolution of network issues with AI-powered event reporting based on SNMP traps

Native App Network Performance Analysis

1. Streamlining Membership Data Engineering at Netflix with Psyberg

Cut costs and complexity: 5 strategies for reducing tool sprawl with Dynatrace

How Netflix Accurately Attributes eBPF Flow Logs

HTTP monitors on the latest Dynatrace platform extend insights into the health of your API endpoints and simplify test management

Observability engineering: Getting Prometheus metrics right for Kubernetes with Dynatrace and Kepler

How to Scale Elasticsearch to Solve Your Scalability Issues

The keys to selecting a platform for end-to-end observability

A BGP Guide for the Non-Network Engineer

Growth Engineering at Netflix?—?Automated Imagery Generation

Leveraging Infrastructure as Code for Data Engineering Projects: A Comprehensive Guide

It’s time to migrate from NAM to Dynatrace

Designing Instagram

Engineering dependability and fault tolerance in a distributed system

Python at Netflix

How to Prepare for Your DevOps Interview

Better dashboarding with Dynatrace Davis AI: Instant meaningful insights

How to Configure Istio, Prometheus and Grafana for Monitoring

Power Dashboarding, Part I: Start your exploration journey with Dashboards

Simplified observability for your SNMP devices

Platform Engineering Teams Done Right…

Dynatrace adds support for AWS Transit Gateway with VPC Flow Logs

Kubernetes vs Docker: What’s the difference?

Optimized shot-based encodes for 4K: Now streaming!

What is container orchestration?

Scale your enterprise cloud environment with enhanced AI-powered observability of all AWS services

Stay Connected