Engineering, Infrastructure and Network - Technology Performance Pulse

Part 1: A Survey of Analytics Engineering Work at Netflix

The Netflix TechBlog

DECEMBER 17, 2024

This article is the first in a multi-part series sharing a breadth of Analytics Engineering work at Netflix, recently presented as part of our annual internal Analytics Engineering conference. Subsequent posts will detail examples of exciting analytic engineering domain applications and aspects of the technical craft.

Analytics

Analytics Engineering Entertainment Metrics

AI-powered DNS request tracking extends infrastructure observability for high quality network traffic

Dynatrace

OCTOBER 1, 2020

The Dynatrace Software Intelligence Platform gives you a complete Infrastructure Monitoring solution for the monitoring of cloud platforms and virtual infrastructure, along with log monitoring and AIOps. Ensure high quality network traffic by tracking DNS requests out-of-the-box. Average query response time.

Traffic

Traffic Network Infrastructure Artificial Intelligence

How Netflix uses eBPF flow logs at scale for network insight

The Netflix TechBlog

JUNE 7, 2021

By Alok Tiagi , Hariharan Ananthakrishnan , Ivan Porto Carrero and Keerti Lakshminarayan Netflix has developed a network observability sidecar called Flow Exporter that uses eBPF tracepoints to capture TCP flows at near real time. Without having network visibility, it’s difficult to improve our reliability, security and capacity posture.

Network

Network Transportation AWS Cloud

Dynatrace extends Synthetic Monitoring capabilities with Network Availability Monitors to validate the availability of infrastructure and services

Dynatrace

JULY 15, 2024

Why browser and HTTP monitors might not be sufficient In modern IT environments, which are complex and dynamically changing, you often need deeper insights into the Transport or Network layers. Is it a bug in the codebase, a malfunctioning backend service, an overloaded hosting infrastructure, or perhaps a misconfigured network?

Availability

Availability Network Monitoring Infrastructure

For your eyes only: improving Netflix video quality with neural networks

The Netflix TechBlog

NOVEMBER 17, 2022

Recently, we added another powerful tool to our arsenal: neural networks for video downscaling. In this tech blog, we describe how we improved Netflix video quality with neural networks, the challenges we faced and what lies ahead. How can neural networks fit into Netflix video encoding?

Network

Network Media Innovation Serverless

Building Netflix’s Distributed Tracing Infrastructure

The Netflix TechBlog

OCTOBER 19, 2020

a Netflix member via Twitter This is an example of a question our on-call engineers need to answer to help resolve a member issue?—?which Now let’s look at how we designed the tracing infrastructure that powers Edgar. We needed to increase engineering productivity via distributed request tracing.

Infrastructure

Infrastructure Transportation Storage Open Source

New SNMP platform extensions provide observability at scale for network devices

Dynatrace

NOVEMBER 24, 2021

The success of an organization often depends on the quality of the on-premises or physical IT infrastructure, among other things. Constantly monitoring infrastructure health state and making ongoing optimizations are essential for Ops teams, SREs (site-reliability engineers), and IT admins. Start monitoring in minutes.

Network

Network Infrastructure Virtualization Metrics

Measuring Network Performance in Mobile Safari

CSS Wizardry

FEBRUARY 25, 2021

Google has a pretty tight grip on the tech industry: it makes by far the most popular browser with the best DevTools, and the most popular search engine, which means that web developers spend most of their time in Chrome, most of their visitors are in Chrome, and a lot of their search traffic will be coming from Google. somewhere sensible.

Network

Network Mobile Performance Traffic

Infrastructure Monitoring tools: 3 steps to evolve ITOps into AIOps

Dynatrace

JUNE 28, 2021

Infrastructure monitoring is the process of collecting critical data about your IT environment, including information about availability, performance and resource efficiency. Many organizations respond by adding a proliferation of infrastructure monitoring tools, which in many cases, just adds to the noise. Dynatrace news.

Infrastructure

Infrastructure Monitoring Artificial Intelligence Open Source

Build systems more reliably with Dynatrace: Chaos Engineering

Dynatrace

AUGUST 21, 2024

These releases often assumed ideal conditions such as zero latency, infinite bandwidth, and no network loss, as highlighted in Peter Deutsch’s eight fallacies of distributed systems. Chaos engineering is a practice that extends beyond traditional failure testing by identifying unpredictable issues.

Engineering

Engineering Systems Latency Metrics

Network performance monitoring top of mind for CloudOps teams

Dynatrace

MAY 19, 2023

For cloud operations teams, network performance monitoring is central in ensuring application and infrastructure performance. If the network is sluggish, an application may also be slow, frustrating users. Worse, a malicious attacker may gain access to the network, compromising sensitive application data.

Network

Network Monitoring Performance Traffic

The keys to selecting a platform for end-to-end observability

Dynatrace

DECEMBER 2, 2024

On average, organizations use 10 different tools to monitor applications, infrastructure, and user experiences across these environments. Such fragmented approaches fall short of giving teams the insights they need to run IT and site reliability engineering operations effectively.

Artificial Intelligence

Artificial Intelligence DevOps Architecture Cloud

Building Resilience With Chaos Engineering and Litmus

DZone

JUNE 15, 2023

Various factors, such as network communication, inter-service dependencies, external dependencies, and scalability issues, can contribute to outages. The scalability, agility, and continuous delivery offered by microservices architecture make it a popular option for businesses today.

Engineering

Engineering Architecture Scalability Google

Accelerate resolution of network issues with AI-powered event reporting based on SNMP traps

Dynatrace

NOVEMBER 30, 2022

Complexity and data volume for IT infrastructure soars to new heights. The volume of data and events grows in tandem with the rising complexity of IT infrastructure. Monitoring modern IT infrastructure is difficult, sometimes impossible, without advanced network monitoring tools.

Network

Network Infrastructure Metrics Monitoring

Vulnerability assessment: key to protecting applications and infrastructure

Dynatrace

OCTOBER 13, 2021

Protecting IT infrastructure, applications, and data requires that you understand security weaknesses attackers can exploit. Examples of such weaknesses are errors in application code, misconfigured network devices, and overly permissive access controls in a database. NMAP is an example of a well-known open-source network scanner.

Infrastructure

Infrastructure Open Source Virtualization Operating System

Power Dashboarding, Part I: Start your exploration journey with Dashboards

Dynatrace

FEBRUARY 6, 2025

With Dashboards , you can monitor business performance, user interactions, security vulnerabilities, IT infrastructure health, and so much more, all in real time. Even if infrastructure metrics aren’t your thing, you’re welcome to join us on this creative journey simply swap out the suggested metrics for ones that interest you.

Metrics

Metrics Infrastructure Monitoring Best Practices

DevOps engineer tools: Deploy, test, evaluate, repeat

Dynatrace

DECEMBER 8, 2022

As cloud-native, distributed architectures proliferate, the need for DevOps technologies and DevOps platform engineers has increased as well. DevOps engineer tools can help ease the pressure as environment complexity grows. ” What does a DevOps platform engineer do? .” What are DevOps engineer tools and platforms.

DevOps

DevOps Engineering Testing Open Source

Infrastructure monitoring for enterprise cloud – 4 key requirements

Dynatrace

JANUARY 8, 2020

If you’re doing it right, cloud represents a fundamental change in how you build, deliver and operate your applications and infrastructure. And that includes infrastructure monitoring. This also implies a fundamental change to the role of infrastructure and operations teams. Able to provide answers, not just data.

Infrastructure

Infrastructure Cloud Monitoring Metrics

Cut costs and complexity: 5 strategies for reducing tool sprawl with Dynatrace

Dynatrace

APRIL 10, 2025

Business-focused, unified platform approach : A unified platform approach enables platform engineering and self-service portals, simplifying operations and reducing costs. Davis, the causal AI engine, instantly identifies root causes and predicts service degradation before it impacts users.

Strategy

Strategy Storage Network Architecture

Five-nines availability: Always-on infrastructure delivers system availability during the holidays’ peak loads

Dynatrace

NOVEMBER 22, 2022

Five-nines availability has long been the goal of site reliability engineers (SREs) to provide system availability that is “always on.” Site reliability engineering teams often measure system availability in percentages in the pursuit of 100% uptime. What is always-on infrastructure?

Infrastructure

Infrastructure Availability Systems Retail

AI-powered infrastructure monitoring for your SAP HANA database (Preview)

Dynatrace

DECEMBER 9, 2020

However, if you’re an operations engineer who’s been tasked with migrating to HANA from a legacy database system, you’ll need to get up to speed quickly. Enable the Davis AI causation engine to automatically analyze every metric. Enable the Davis AI causation engine to automatically analyze every metric.

Infrastructure

Infrastructure Database Monitoring Metrics

Hybrid cloud infrastructure explained: Weighing the pros, cons, and complexities

Dynatrace

JUNE 29, 2022

More than 90% of enterprises now rely on a hybrid cloud infrastructure to deliver innovative digital services and capture new markets. That’s because cloud platforms offer flexibility and extensibility for an organization’s existing infrastructure. Dynatrace news. With public clouds, multiple organizations share resources.

Infrastructure

Infrastructure Cloud Azure AWS

Who will watch the watchers? Extended infrastructure observability for WSO2 API Manager

Dynatrace

SEPTEMBER 18, 2020

Sure, cloud infrastructure requires comprehensive performance visibility, as Dynatrace provides , but the services that leverage cloud infrastructures also require close attention. Extend infrastructure observability to WSO2 API Manager. Cloud-based application architectures commonly leverage microservices. What’s next?

Infrastructure

Infrastructure Latency Metrics Cloud

Leveraging Infrastructure as Code for Data Engineering Projects: A Comprehensive Guide

DZone

JULY 3, 2023

Data engineering projects often require the setup and management of complex infrastructures that support data processing, storage, and analysis. In this article, we will explore the benefits of leveraging IaC for data engineering projects and provide detailed implementation steps to get started.

Data Engineering

Data Engineering Infrastructure Code Engineering

Extend the AI and automation core of Dynatrace with host extensions to resolve infrastructure problems

Dynatrace

MAY 13, 2020

OneAgent gives you all the operational and business performance metrics you need, from the front end to the back end and everything in between—cloud instances, hosts, network health, processes, and services. Let the Davis AI causation engine analyze additional metrics. Example 1: Gain visibility into your NVIDIA GPUs.

Infrastructure

Infrastructure Metrics Monitoring Software Engineering

How Data Inspires Building a Scalable, Resilient and Secure Cloud Infrastructure At Netflix

The Netflix TechBlog

MARCH 5, 2019

Netflix’s engineering culture is predicated on Freedom & Responsibility, the idea that everyone (and every team) at Netflix is entrusted with a core responsibility and they are free to operate with freedom to satisfy their mission. All these micro-services are currently operated in AWS cloud infrastructure.

Infrastructure

Infrastructure Cloud Scalability AWS

Better dashboarding with Dynatrace Davis AI: Instant meaningful insights

Dynatrace

JANUARY 21, 2025

Ensuring smooth operations is no small feat, whether you’re in charge of application performance, IT infrastructure, or business processes. For example, if you’re monitoring network traffic and the average over the past 7 days is 500 Mbps, the threshold will adapt to this baseline.

Traffic

Traffic Metrics Analytics Monitoring

Uber Infrastructure in 2019: Improving Reliability, Driving Customer Satisfaction

Uber Engineering

DECEMBER 19, 2019

Every day around the world, millions of trips take place across the Uber network, giving users more reliable transportation through ridesharing, bikes, and scooters, drivers and truckers additional opportunities to earn, employees and employers more convenient business travel, and hungry … The post Uber Infrastructure in 2019: Improving Reliability, (..)

Infrastructure

Infrastructure Transportation Network Engineering

How to Prepare for Your DevOps Interview

DZone

SEPTEMBER 5, 2019

Over the past decade, DevOps has emerged as a new tech culture and career that marries the rapid iteration desired by software development with the rock-solid stability of the infrastructure operations team. As of August 2019, there are currently over 50,000 LinkedIn DevOps job listings in the United States alone.

DevOps

DevOps Software Engineering Infrastructure Engineering

Hyper Scale VPC Flow Logs enrichment to provide Network Insight

The Netflix TechBlog

MAY 26, 2020

Without having network visibility, it’s not possible to improve our reliability, security and capacity posture. Network Availability: The expected continued growth of our ecosystem makes it difficult to understand our network bottlenecks and potential limits we may be reaching. 43416 5001 52.213.180.42

Network

Network Tuning AWS Traffic

Power dashboarding part 2: Dynatrace dashboard tutorial to gain better, faster answers using AI and formatting

Dynatrace

MARCH 31, 2025

You can either continue with the custom infrastructure metrics dashboard you created in Part I or use the dashboard we prepared here (Dynatrace login required). In our Dynatrace Dashboard tutorial, we want to add a chart that shows the bytes in and out per host over time to enhance visibility into network traffic.

Metrics

Metrics Infrastructure Network Best Practices

Python at Netflix

The Netflix TechBlog

APRIL 29, 2019

Open Connect Open Connect is Netflix’s content delivery network (CDN). An easy, though imprecise, way of thinking about Netflix infrastructure is that everything that happens before you press Play on your remote control (e.g., video streaming) takes place in the Open Connect network. are you logged in? what plan do you have?

Open Source

Open Source Network Infrastructure Big Data

Observability engineering: Getting Prometheus metrics right for Kubernetes with Dynatrace and Kepler

Dynatrace

DECEMBER 18, 2023

For busy site reliability engineers, ensuring system reliability, scalability, and overall health is an imperative that’s getting harder to achieve in ever-expanding, cloud-native, container-based environments. Because of its adaptability, Prometheus has become an essential tool for observability engineering. Jolly good!

Metrics

Metrics Engineering Energy Tuning

How to Configure Istio, Prometheus and Grafana for Monitoring

DZone

AUGUST 29, 2023

Intro to Istio Observability Using Prometheus Istio service mesh abstracts the network from the application layers using sidecar proxies. You can implement security and advance networking policies to all the communication across your infrastructure using Istio. But another important feature of Istio is observability.

Monitoring

Monitoring Latency Network Infrastructure

Simplified observability for your SNMP devices

Dynatrace

MARCH 22, 2021

While today’s IT world continues the shift toward treating everything as a service, many organizations need to keep their environments under strict control while managing their infrastructure themselves on-premises. Exceeded throughput levels can be a sign that some changes to the network configuration might be required.

Metrics

Metrics Network Infrastructure Traffic

It’s time to migrate from NAM to Dynatrace

Dynatrace

APRIL 2, 2020

For two decades, Dynatrace NAM—Network Application Monitoring, formerly known as DC RUM—has been successfully monitoring the user experience of our customers’ enterprise applications. SNMP managed the costs of network links well, but not the sources of those costs (i.e., Dynatrace news. Performance has always mattered.

Network

Network Traffic Monitoring Java

What is container orchestration?

Dynatrace

MARCH 24, 2023

Containers enable developers to package microservices or applications with the libraries, configuration files, and dependencies needed to run on any infrastructure, regardless of the target system environment. This orchestration includes provisioning, scheduling, networking, ensuring availability, and monitoring container lifecycles.

Infrastructure

Infrastructure Open Source Operating System Cloud

Mastering Kubernetes with Dynatrace

Dynatrace

AUGUST 24, 2020

But there are other related components and processes (for example, cloud provider infrastructure) that can cause problems in applications running on Kubernetes. Dynatrace AWS monitoring gives you an overview of the resources that are used in your AWS infrastructure along with their historical usage. Monitoring your i nfrastructure.

Analytics

Analytics Infrastructure AWS Operating System

From syslog to AWS Firehose: Dynatrace log management innovations that enhance observability

Dynatrace

SEPTEMBER 5, 2024

Native support for Syslog messages Syslog messages are generated by default in Linux and Unix operating systems, security devices, network devices, and applications such as web servers and databases. Native support for syslog messages extends our infrastructure log support to all Linux/Unix systems and network devices.

Innovation

Innovation AWS Analytics Storage

Kubernetes vs Docker: What’s the difference?

Dynatrace

SEPTEMBER 29, 2021

Think of containers as the packaging for microservices that separate the content from its environment – the underlying operating system and infrastructure. Running containers : Docker Engine is a container runtime that runs in almost any environment: Mac and Windows PCs, Linux and Windows servers, the cloud, and on edge devices.

Open Source

Open Source DevOps Traffic Cloud

Why business resiliency depends on unified observability and security

Dynatrace

SEPTEMBER 3, 2024

Software performance can be compromised in many ways, including software bugs, cyberattacks, overwhelming demand, backup failures, network issues, and human error. Teams can use this information to optimize infrastructure and application performance, ensuring that systems can handle increased traffic without compromising user experience.

Infrastructure

Infrastructure Innovation Monitoring Software Performance

Bring syslog into Dynatrace using OpenTelemetry to get open source value with enterprise support

Dynatrace

MARCH 15, 2024

Getting insights into the health and disruptions of your networking or infrastructure is fundamental to enterprise observability. Even for a supported component, delivering logs from applications and infrastructure to DevSecBizOps workflows requires significant manual configuration.

Open Source

Open Source Infrastructure Network Government

Designing Instagram

High Scalability

JANUARY 11, 2022

Machine Learning Engineer at Amazon and has led several machine-learning initiatives across the Amazon ecosystem. FUN FACT : In this talk , Rodrigo Schmidt, director of engineering at Instagram talks about the different challenges they have faced in scaling the data infrastructure at Instagram. System Components. References.

Design

Design Media Storage Logistics

What is IT operations analytics? Extract more data insights from more sources

Dynatrace

MAY 1, 2023

IT operations analytics is the process of unifying, storing, and contextually analyzing operational data to understand the health of applications, infrastructure, and environments and streamline everyday operations. Here are the six steps of a typical ITOA process : Define the data infrastructure strategy. Apache Spark.

Analytics

Analytics Artificial Intelligence Big Data Open Source

Part 1: A Survey of Analytics Engineering Work at Netflix

AI-powered DNS request tracking extends infrastructure observability for high quality network traffic

Trending Sources

How Netflix uses eBPF flow logs at scale for network insight

Dynatrace extends Synthetic Monitoring capabilities with Network Availability Monitors to validate the availability of infrastructure and services

For your eyes only: improving Netflix video quality with neural networks

Building Netflix’s Distributed Tracing Infrastructure

New SNMP platform extensions provide observability at scale for network devices

Measuring Network Performance in Mobile Safari

Infrastructure Monitoring tools: 3 steps to evolve ITOps into AIOps

Build systems more reliably with Dynatrace: Chaos Engineering

Network performance monitoring top of mind for CloudOps teams

The keys to selecting a platform for end-to-end observability

Building Resilience With Chaos Engineering and Litmus

Accelerate resolution of network issues with AI-powered event reporting based on SNMP traps

Vulnerability assessment: key to protecting applications and infrastructure

Power Dashboarding, Part I: Start your exploration journey with Dashboards

DevOps engineer tools: Deploy, test, evaluate, repeat

Infrastructure monitoring for enterprise cloud – 4 key requirements

Cut costs and complexity: 5 strategies for reducing tool sprawl with Dynatrace

Five-nines availability: Always-on infrastructure delivers system availability during the holidays’ peak loads

AI-powered infrastructure monitoring for your SAP HANA database (Preview)

Hybrid cloud infrastructure explained: Weighing the pros, cons, and complexities

Who will watch the watchers? Extended infrastructure observability for WSO2 API Manager

Leveraging Infrastructure as Code for Data Engineering Projects: A Comprehensive Guide

Extend the AI and automation core of Dynatrace with host extensions to resolve infrastructure problems

How Data Inspires Building a Scalable, Resilient and Secure Cloud Infrastructure At Netflix

Better dashboarding with Dynatrace Davis AI: Instant meaningful insights

Uber Infrastructure in 2019: Improving Reliability, Driving Customer Satisfaction

How to Prepare for Your DevOps Interview

Hyper Scale VPC Flow Logs enrichment to provide Network Insight

Power dashboarding part 2: Dynatrace dashboard tutorial to gain better, faster answers using AI and formatting

Python at Netflix

Observability engineering: Getting Prometheus metrics right for Kubernetes with Dynatrace and Kepler

How to Configure Istio, Prometheus and Grafana for Monitoring

Simplified observability for your SNMP devices

It’s time to migrate from NAM to Dynatrace

What is container orchestration?

Mastering Kubernetes with Dynatrace

From syslog to AWS Firehose: Dynatrace log management innovations that enhance observability

Kubernetes vs Docker: What’s the difference?

Why business resiliency depends on unified observability and security

Bring syslog into Dynatrace using OpenTelemetry to get open source value with enterprise support

Designing Instagram

What is IT operations analytics? Extract more data insights from more sources

Stay Connected