Data Engineering and Speed - Technology Performance Pulse

Data Engineers of Netflix?—?Interview with Kevin Wylie

The Netflix TechBlog

JULY 15, 2021

Data Engineers of Netflix?—?Interview Interview with Kevin Wylie This post is part of our “Data Engineers of Netflix” series, where our very own data engineers talk about their journeys to Data Engineering @ Netflix. Kevin, what drew you to data engineering?

Data Engineering

Data Engineering Engineering Entertainment Big Data

Data Engineers of Netflix?—?Interview with Samuel Setegne

The Netflix TechBlog

JUNE 1, 2021

Data Engineers of Netflix?—?Interview Interview with Samuel Setegne Samuel Setegne This post is part of our “Data Engineers of Netflix” interview series, where our very own data engineers talk about their journeys to Data Engineering @ Netflix. What drew you to Netflix?

Data Engineering

Data Engineering Engineering Big Data Healthcare

Automated Testing in Data Engineering: An Imperative for Quality and Efficiency

DZone

JANUARY 9, 2024

This holds true for the critical field of data engineering as well. As organizations gather and process astronomical volumes of data, manual testing is no longer feasible or reliable. Automated testing methodologies are now imperative to deliver speed, accuracy, and integrity.

Data Engineering

Data Engineering Efficiency Engineering Testing

Secrets Detection: Optimizing Filter Processes

DZone

FEBRUARY 8, 2022

While increasing both the precision and the recall of our secrets detection engine, we felt the need to keep a close eye on speed. In a gearbox, if you want to increase torque, you need to decrease speed. So it wasn’t a surprise to find that our engine had the same problem: more power, less speed.

Processing

Processing Benchmarking Speed Engineering

Introducing Impressions at Netflix

The Netflix TechBlog

FEBRUARY 14, 2025

Our Flink configuration includes 8 task managers per region, each equipped with 8 CPU cores and 32GB of memory, operating at a parallelism of 48, allowing us to handle the necessary scale and speed for seamless performance delivery.

Tuning

Tuning Latency Efficiency Storage

Ready-to-go sample data pipelines with Dataflow

The Netflix TechBlog

DECEMBER 3, 2022

Having a well-documented starting point removes some of the struggle that comes with starting from scratch and considerably speeds up the first iteration of the development cycle. Onboarding Ramping up on a new team or a business vertical always takes some effort, especially in a “highly aligned, loosely coupled” culture.

Best Practices

Best Practices Code Testing Data Engineering

How Data Inspires Building a Scalable, Resilient and Secure Cloud Infrastructure At Netflix

The Netflix TechBlog

MARCH 5, 2019

While our engineering teams have and continue to build solutions to lighten this cognitive load (better guardrails, improved tooling, …), data and its derived products are critical elements to understanding, optimizing and abstracting our infrastructure. Give us a holler if you are interested in a thought exchange.

Infrastructure

Infrastructure Cloud Scalability AWS

Incremental Processing using Netflix Maestro and Apache Iceberg

The Netflix TechBlog

NOVEMBER 20, 2023

These challenges are currently addressed in suboptimal and less cost efficient ways by individual local teams to fulfill the needs, such as Lookback: This is a generic and simple approach that data engineers use to solve the data accuracy problem. Users configure the workflow to read the data in a window (e.g.

Processing

Processing Big Data Efficiency Engineering

Pushy to the Limit: Evolving Netflix’s WebSocket proxy for the future

The Netflix TechBlog

SEPTEMBER 10, 2024

The folks on the Cloud Data Engineering (CDE) team, the ones building the paved path for internal data at Netflix, graciously helped us scale it up and make adjustments, but it ended up being an involved process as we kept growing. As Pushy’s portfolio grew, we experienced some pain points with Dynomite.

Latency

Latency Cache Tuning Efficiency

Hyper Scale VPC Flow Logs enrichment to provide Network Insight

The Netflix TechBlog

MAY 26, 2020

Spark could look up and retrieve the data in the s3 files that the Mouthful represented. This intermediate step of persisting Mouthfuls allowed us to easily “eat” through S3 event SQS messages at great speed, converting them to far fewer Mouthful SQS Messages which would each be consumed by a single Spark app instance.

Network

Network Tuning AWS Traffic

Expanding the Cloud: Introducing Amazon QuickSight

All Things Distributed

OCTOBER 7, 2015

While BI solutions have existed for decades, customers have told us that it takes an enormous amount of time, engineering effort, and money to bridge this gap. These solutions lack interactive data exploration and visualization capabilities, limiting most business users to canned reports and pre-selected queries.

Cloud

Cloud Big Data AWS Analytics

5 data integration trends that will define the future of ETL in 2018

Abhishek Tiwari

DECEMBER 27, 2017

Data solution vendors like SnapLogic and Informatica are already developing machine learning and artificial intelligence (AI) based smart data integration assistants. These assistants can recommend next-best-action or suggest datasets, transforms, and rules to a data engineer working on a data integration project.

Big Data

Big Data Artificial Intelligence Storage Hardware

Organise your engineering teams around the work by reteaming

Abhishek Tiwari

JULY 20, 2019

Depending on work you can choose a smaller team of similar expertise (for example a team with mostly frontend engineers) or a smaller team of diverse expertise (team with balanced frontend, backend, data engineers). Thirdly, let engineers themselves choose the delivery teams and organise them around the initiative.

Engineering

Engineering Retail Airlines Healthcare

Sustainability at AWS re:Invent 2022 All the talks and videos I could find…

Adrian Cockcroft

FEBRUARY 13, 2023

STP213 Scaling global carbon footprint management — Blake Blackwell Persefoni Manager Data Engineering and Michael Floyd AWS Head of Sustainability Solutions. Partner oriented session getting everyone up to speed on what AWS sees as the customer needs, motivations, business outcomes and architectures around sustainability.

AWS

AWS Energy Architecture Programming

Top 20 Websites For Online Automation Testing Courses and Certifications

Testsigma

NOVEMBER 28, 2019

Udacity Udacity provides nanodegree programs on all automation languages like C++, Machine Learning, Data engineer, Robotics and more. Alternatively, you could also take the help of some amazing services to multiply the speed of testing by evaluating easy to use test automation tools like Testsigma with zero upfront investments.

Website

Website Testing Programming Automotive

Reimagining Experimentation Analysis at Netflix

The Netflix TechBlog

SEPTEMBER 10, 2019

This enables us to optimize their experience at speed. Our data scientists faced numerous challenges in our previous infrastructure. Complex business logic was embedded directly into the ETL pipelines by data engineers. In order to replicate results, scientists had to delve deep into the data, code, and documentation.

Metrics

Metrics Architecture Infrastructure Innovation

Spice up your Analytics: Amazon QuickSight Now Generally Available in N. Virginia, Oregon, and Ireland.

All Things Distributed

NOVEMBER 15, 2016

They require teams of data engineers to spend months building complex data models and synthesizing the data before they can generate their first report. The cost and complexity to implement, scale, and use BI makes it difficult for most companies to make data analysis ubiquitous across their organizations.

Analytics

Analytics Availability Media Social Media

Educating a New Generation of Workers

O'Reilly

NOVEMBER 26, 2024

Entirely new paradigms rise quickly: cloud computing, data engineering, machine learning engineering, mobile development, and large language models. It’s less risky to hire adjunct professors with industry experience to fill teaching roles that have a vocational focus: mobile development, data engineering, and cloud computing.

Education

Education Azure AWS Java

Technology Performance Pulse

Data Engineers of Netflix?—?Interview with Kevin Wylie

Data Engineers of Netflix?—?Interview with Samuel Setegne

Trending Sources

Automated Testing in Data Engineering: An Imperative for Quality and Efficiency

Secrets Detection: Optimizing Filter Processes

Introducing Impressions at Netflix

Ready-to-go sample data pipelines with Dataflow

How Data Inspires Building a Scalable, Resilient and Secure Cloud Infrastructure At Netflix

Incremental Processing using Netflix Maestro and Apache Iceberg

Pushy to the Limit: Evolving Netflix’s WebSocket proxy for the future

Hyper Scale VPC Flow Logs enrichment to provide Network Insight

Expanding the Cloud: Introducing Amazon QuickSight

5 data integration trends that will define the future of ETL in 2018

Organise your engineering teams around the work by reteaming

Sustainability at AWS re:Invent 2022 All the talks and videos I could find…

Top 20 Websites For Online Automation Testing Courses and Certifications

Reimagining Experimentation Analysis at Netflix

Spice up your Analytics: Amazon QuickSight Now Generally Available in N. Virginia, Oregon, and Ireland.

Educating a New Generation of Workers

Stay Connected