Engineering, Scalability and Software Engineering

Low-Maintenance Backend Architectures for Scalable Applications

DZone

JANUARY 10, 2025

After years of working in the intricate world of software engineering, I learned that the most beautiful solutions are often those unseen: backends that hum along, scaling with grace and requiring very little attention.

Architecture

Architecture Scalability Software Engineering Cloud

What is platform engineering?

Dynatrace

NOVEMBER 3, 2023

With growing multicloud complexity and the need for organization-wide scalability, self-service and automation capabilities have become increasingly essential for developer productivity. In response to this shift, platform engineering is growing in popularity. The result is a cloud-native approach to software delivery.

Engineering

Engineering DevOps Software Engineering Scalability

SRE Best Practices for Java Applications

DZone

MARCH 12, 2025

Site reliability engineering (SRE) plays a vital role in ensuring Java applications' high availability, performance, and scalability. This discipline merges software engineering and operations, aiming to create a robust infrastructure that supports seamless user experiences.

Best Practices

Best Practices Java Software Engineering Scalability

Key Elements of Site Reliability Engineering (SRE)

DZone

MARCH 14, 2023

Site Reliability Engineering (SRE) is a systematic and data-driven approach to improving the reliability, scalability, and efficiency of systems. It combines principles of software engineering, operations, and quality assurance to ensure that systems meet performance goals and business objectives.

Engineering

Engineering Software Engineering Scalability Efficiency

Unlock the Power of DevSecOps with Newly Released Kubernetes Experience for Platform Engineering

Dynatrace

NOVEMBER 7, 2023

Platform engineering is on the rise. According to leading analyst firm Gartner, “80% of software engineering organizations will establish platform teams as internal providers of reusable services, components, and tools for application delivery…” by 2026.

Engineering

Engineering DevOps Best Practices Infrastructure

Why applying chaos engineering to data-intensive applications matters

Dynatrace

MAY 23, 2024

Stream processing One approach to such a challenging scenario is stream processing, a computing paradigm and software architectural style for data-intensive software systems that emerged to cope with requirements for near real-time processing of massive amounts of data. We designed experimental scenarios inspired by chaos engineering.

Engineering

Engineering Tuning Latency Open Source

Site reliability engineering: 5 things you need to know

Dynatrace

FEBRUARY 4, 2021

What is site reliability engineering? Site reliability engineering (SRE) is the practice of applying software engineering principles to operations and infrastructure processes to help organizations create highly reliable and scalable software systems. Dynatrace news. SRE focuses on automation.

Engineering

Engineering DevOps Government Latency

Site Reliability Engineering

DZone

JANUARY 19, 2024

In the dynamic world of online services, the concept of site reliability engineering (SRE) has risen as a pivotal discipline, ensuring that large-scale systems maintain their performance and reliability.

Engineering

Engineering Tuning Software Engineering Internet

Demystifying Interviewing for Backend Engineers @ Netflix

The Netflix TechBlog

FEBRUARY 1, 2022

By Karen Casella, Director of Engineering, Access & Identity Management Have you ever experienced one of the following scenarios while looking for your next role? Most backend engineering teams follow a process very similar to what is shown below. If so, we invite you to begin the interview process.

Engineering

Engineering Games Entertainment Innovation

A Recap of the Data Engineering Open Forum at Netflix

The Netflix TechBlog

JUNE 20, 2024

A summary of sessions at the first Data Engineering Open Forum at Netflix on April 18th, 2024 The Data Engineering Open Forum at Netflix on April 18th, 2024. At Netflix, we aspire to entertain the world, and our data engineering teams play a crucial role in this mission by enabling data-driven decision-making at scale.

Data Engineering

Data Engineering Engineering Entertainment Software Engineering

Site reliability engineering: 5 things to you need to know

Dynatrace

FEBRUARY 4, 2021

Site reliability engineering (SRE) is the practice of applying software engineering principles to operations and infrastructure processes to help organizations create highly reliable and scalable software systems. Organizations can then integrate these skilled engineers at key points in the DevOps life cycle.

Engineering

Engineering DevOps Government Latency

Scaling Is Not Just About Products – It’s About Teams, Too

DZone

SEPTEMBER 19, 2021

We are well aware of what is meant by system scalability. System scalability is about maintaining the SLA of the system as the user base continues to grow and as the user activity continues to rise. However, to build highly successful products, this is not the only type of scalability that we should worry about. Introduction.

Scalability

Scalability Software Engineering Engineering Systems

Title Launch Observability at Netflix Scale

The Netflix TechBlog

MARCH 4, 2025

This standardization enhances adoption within the personalization stack, simplifies the system, and improves understanding and debuggability for engineers. They must also provide enough information for partner engineers to identify the problem with the underlying service in cases of system-level issues.

Traffic

Traffic Strategy Entertainment Innovation

Growth Engineering at Netflix- Creating a Scalable Offers Platform

The Netflix TechBlog

FEBRUARY 9, 2021

The Growth Engineering team is responsible for executing growth initiatives that help us anticipate and adapt to this change. For more background on Growth Engineering and the signup funnel, please have a look at our previous blog post that covers the basics. We need to be constantly adapting and innovating as a result of this change.

Engineering

Engineering Scalability Architecture Innovation

Up your quality and agility factor – using automation to build “performance-as-a-self-service”

Dynatrace

MARCH 3, 2020

For software engineering teams, this demand means not only delivering new features faster but ensuring quality, performance, and scalability too. One way to apply improvements is transforming the way application performance engineering and testing is done. Here is the definition of this model: ?.

Performance

Performance Education Innovation Software Architecture

Computational Causal Inference at Netflix

The Netflix TechBlog

AUGUST 11, 2020

These methods can provide rich information for decision making, such as in experimentation platforms (“XP”) or in algorithmic policy engines. We want to amplify the effectiveness of our researchers by providing them software that can estimate causal effects models efficiently, and can integrate causal effects into large engineering systems.

Software Engineering

Software Engineering Scalability Engineering Strategy

SRE vs DevOps: What you need to know

Dynatrace

FEBRUARY 24, 2021

SRE is the transformation of traditional operations practices by using software engineering and DevOps principles to improve the availability, performance, and scalability of releases by building resiliency into apps and infrastructure. Investing in automation and tooling to avoid toil. SRE vs DevOps?

DevOps

DevOps Software Engineering Speed Google

Stuff The Internet Says On Scalability For October 5th, 2018

High Scalability

OCTOBER 5, 2018

antirez : "After 20 years as a software engineer, I've started commenting heavily. Don't miss all that the Internet has to say on Scalability, click below and become eventually consistent with all scalability knowledge (which means this post has many more items to read so please keep on reading). So many more quotes.

Internet

Internet Internet Scalability Software Engineering

Beyond “Prompt and Pray”

O'Reilly

JANUARY 21, 2025

These workflows are then implemented as traditional software, which can be tested, versioned, and maintained. This approach is well understood in software engineering and contrasts sharply with building agents that rely on runtime decisionsan inherently less reliable and harder-to-maintain model.

Software Engineering

Software Engineering Efficiency Engineering Systems

Designing Instagram

High Scalability

JANUARY 11, 2022

Machine Learning Engineer at Amazon and has led several machine-learning initiatives across the Amazon ecosystem. FUN FACT : In this talk , Rodrigo Schmidt, director of engineering at Instagram talks about the different challenges they have faced in scaling the data infrastructure at Instagram. This is a guest post by Ankit Sirmorya.

Design

Design Media Storage Logistics

Data Engineers of Netflix?—?Interview with Dhevi Rajendran

The Netflix TechBlog

JUNE 1, 2021

Data Engineers of Netflix?—?Interview Interview with Dhevi Rajendran Dhevi Rajendran This post is part of our “Data Engineers of Netflix” interview series, where our very own data engineers talk about their journeys to Data Engineering @ Netflix. The culture was also something that piqued my interest.

Data Engineering

Data Engineering Engineering Software Engineering Big Data

Stuff The Internet Says On Scalability For July 13th, 2018

High Scalability

JULY 13, 2018

billion : made by Pokeman GO; $13 billion : Netflix's new content budget; Quotable Quotes: @davidbrunelle : The best developers and engineering leaders I've personally worked with do *not* have a notable presence on GitHub or public bodies of speaking or writing work. Margaret Hamilton started the field of software engineering.

Internet

Internet Internet Scalability AWS

Starting an SRE Team? Stay Away From Uptime.

DZone

DECEMBER 8, 2021

A good SRE engineer will tell you your service is never down. A great SRE engineer will tell you that’s not what you should be measuring. In fact, they’ll tell you their job is customer service.

Engineering

Engineering Scalability Systems Traffic

Mastering System Design: A Comprehensive Guide to System Scaling for Millions (Part 1)

DZone

JANUARY 19, 2024

A transformative journey into the realm of system design with our tutorial, tailored for software engineers aspiring to architect solutions that seamlessly scale to serve millions of users.

Systems

Systems Design Software Engineering Scalability

Scaling Appsec at Netflix (Part 2)

The Netflix TechBlog

JUNE 6, 2022

By Astha Singhal , Lakshmi Sudheer , Julia Knecht The Application Security teams at Netflix are responsible for securing the software footprint that we create to run the Netflix product, the Netflix studio, and the business. Our customers are product and engineering teams at Netflix that build these software services and platforms.

Software Engineering

Software Engineering Scalability Education Engineering

What is DevOps orchestration? And why invest in orchestration tools?

Dynatrace

DECEMBER 5, 2022

One key advantage of this integration is a single point of access to monitoring, logging, and other information needed to keep software development operations running efficiently. Orchestration leverages DevOps tools that allow for rapid updates and releases, version control, and other best practices for software engineering.

DevOps

DevOps Virtualization Best Practices Innovation

Conducting log analysis with an observability platform and full data context

Dynatrace

APRIL 20, 2023

Causal AI—which brings AI-enabled actionable insights to IT operations—and a data lakehouse, such as Dynatrace Grail , can help break down silos among ITOps, DevSecOps, site reliability engineering, and business analytics teams. “It’s quite a big scale,” said an engineer at the financial services group.

Analytics

Analytics Infrastructure Storage Architecture

Building and Scaling Data Lineage at Netflix to Improve Data Infrastructure Reliability, and…

The Netflix TechBlog

MARCH 25, 2019

Now, imagine yourself in the role of a software engineer responsible for a micro-service which publishes data consumed by few critical customer facing services (e.g. In this model, we scan system logs and metadata generated by various compute engines to collect corresponding lineage data.

Infrastructure

Infrastructure Big Data Transportation Architecture

How and Why the Developer-First Approach Is Changing the Observability Landscape

DZone

DECEMBER 11, 2024

In our quest for greater scalability, resilience, and flexibility within the digital infrastructure of our organization, there has been a strategic pivot away from traditional monolithic application architectures towards embracing modern software engineering practices such as microservices architecture coupled with cloud-native applications.

Development

Development Software Engineering Architecture Scalability

Netflix at AWS re:Invent 2019

The Netflix TechBlog

NOVEMBER 22, 2019

by Shefali Vyas Dalal AWS re:Invent is a couple weeks away and our engineers & leaders are thrilled to be in attendance yet again this year! Over the years, this platform took on support for both elastic online services and fully featured batch workloads supporting use cases across Netflix engineering.

AWS

AWS Entertainment Open Source Benchmarking

DevOps observability: A guide for DevOps and DevSecOps teams

Dynatrace

JANUARY 18, 2023

From site reliability engineering to service-level objectives and DevSecOps, these resources focus on how organizations are using these best practices to innovate at speed without sacrificing quality, reliability, or security. SRE applies software engineering principles to operations and infrastructure processes. – blog.

DevOps

DevOps Best Practices Innovation Strategy

What is application security? And why it needs a new approach

Dynatrace

MARCH 17, 2021

Application security is a software engineering term that refers to several different types of security practices designed to ensure applications do not contain vulnerabilities that could allow illicit access to sensitive data, unauthorized code modification, or resource hijacking. Dynatrace news.

Open Source

Open Source Cloud Games Java

Automated observability, security, and reliability at scale

Dynatrace

JULY 18, 2023

To handle this challenge, enterprises need to automate and streamline the onboarding and lifecycle of tool configurations in the software development processes, including aspects of observability, security, alerting, and remediation. Development teams must set up tailored configurations for each tool and component they’re responsible for.

Best Practices

Best Practices Code Infrastructure Latency

Orchestrating Data/ML Workflows at Scale With Netflix Maestro

The Netflix TechBlog

OCTOBER 18, 2022

As Big data and ML became more prevalent and impactful, the scalability, reliability, and usability of the orchestrating ecosystem have increasingly become more important for our data scientists and the company. Motivation Scalability and usability are essential to enable large-scale workflows and support a wide range of use cases.

Java

Java Scalability Traffic Architecture

MLOps and DevOps: Why Data Makes It Different

O'Reilly

OCTOBER 19, 2021

This is both frustrating for companies that would prefer making ML an ordinary, fuss-free value-generating function like software engineering, as well as exciting for vendors who see the opportunity to create buzz around a new category of enterprise software. The new category is often called MLOps. This approach is not novel.

DevOps

DevOps Software Engineering Infrastructure Open Source

Dynatrace Perform 2024 Guide: Deriving business value from AI data analysis

Dynatrace

JANUARY 23, 2024

Composite’ AI, platform engineering, AI data analysis through custom apps This focus on data reliability and data quality also highlights the need for organizations to bring a “ composite AI ” approach to IT operations, security, and DevOps. To learn more about platform engineering, explore the following resources.

Performance

Performance DevOps Innovation Energy

AWS observability: AWS monitoring best practices for resiliency

Dynatrace

NOVEMBER 22, 2021

To gain insight into these problems, software engineers typically deploy application instrumentation frameworks that provide insight into applications and code. While this provides greater scalability than on-site instrumentation, it also introduces complexity. AWS monitoring best practices. Automate monitoring tasks.

Best Practices

Best Practices AWS Monitoring Serverless

The Show Must Go On: Securing Netflix Studios At Scale

The Netflix TechBlog

SEPTEMBER 13, 2021

Supporting developers through those checklists for edge cases, and then validating that each team’s choices resulted in an architecture with all the desired security properties, was similarly not scalable for our security engineers. Netflix engineers talk a lot about the concept of a “ Paved Road ”.

Internet

Internet Internet Cloud Traffic

Dynatrace extends distributed tracing for serverless on AWS Lambda (GA)?

Dynatrace

JANUARY 19, 2021

The Dynatrace AI engine, Davis,?automatically The new Dynatrace AWS Lambda extension further improves enterprise-grade scalability with low memory overhead, effortless manageability, continuous automation, and granular access-permission controls that support the structures of cloud-native applications teams within large organizations.

Lambda

Lambda Serverless AWS Mobile

Best PostgreSQL GUI [2024]

Scalegrid

OCTOBER 18, 2024

They are lightweight and scalable, and do not require a significant financial investment. Team Size Small Teams: For smaller teams or solo developers, free and open-source tools such as pgAdmin or OmniDB offer more than enough functionality for routine database management.

Open Source

Open Source Database Cloud Operating System

Open-Sourcing Metaflow, a Human-Centric Framework for Data Science

The Netflix TechBlog

DECEMBER 3, 2019

mainly because of mundane reasons related to software engineering. They know that feature engineering is critical for many models, so they want to stay in control of model inputs and feature engineering logic. The user can benefit from infinitely scalable compute clusters by adding a single line in their code: @batch.

Open Source

Open Source AWS Infrastructure Energy

Low-Maintenance Backend Architectures for Scalable Applications

What is platform engineering?

Trending Sources

SRE Best Practices for Java Applications

Key Elements of Site Reliability Engineering (SRE)

Unlock the Power of DevSecOps with Newly Released Kubernetes Experience for Platform Engineering

Why applying chaos engineering to data-intensive applications matters

Site reliability engineering: 5 things you need to know

Site Reliability Engineering

Demystifying Interviewing for Backend Engineers @ Netflix

A Recap of the Data Engineering Open Forum at Netflix

Site reliability engineering: 5 things to you need to know

Scaling Is Not Just About Products – It’s About Teams, Too

Title Launch Observability at Netflix Scale

Growth Engineering at Netflix- Creating a Scalable Offers Platform

Up your quality and agility factor – using automation to build “performance-as-a-self-service”

Computational Causal Inference at Netflix

SRE vs DevOps: What you need to know

Stuff The Internet Says On Scalability For October 5th, 2018

Beyond “Prompt and Pray”

Designing Instagram

Data Engineers of Netflix?—?Interview with Dhevi Rajendran

Stuff The Internet Says On Scalability For July 13th, 2018

Starting an SRE Team? Stay Away From Uptime.

Mastering System Design: A Comprehensive Guide to System Scaling for Millions (Part 1)

Scaling Appsec at Netflix (Part 2)

What is DevOps orchestration? And why invest in orchestration tools?

Conducting log analysis with an observability platform and full data context

Building and Scaling Data Lineage at Netflix to Improve Data Infrastructure Reliability, and…

How and Why the Developer-First Approach Is Changing the Observability Landscape

Netflix at AWS re:Invent 2019

DevOps observability: A guide for DevOps and DevSecOps teams

What is application security? And why it needs a new approach

Automated observability, security, and reliability at scale

Orchestrating Data/ML Workflows at Scale With Netflix Maestro

MLOps and DevOps: Why Data Makes It Different

Dynatrace Perform 2024 Guide: Deriving business value from AI data analysis

AWS observability: AWS monitoring best practices for resiliency

The Show Must Go On: Securing Netflix Studios At Scale

Dynatrace extends distributed tracing for serverless on AWS Lambda (GA)?

Best PostgreSQL GUI [2024]

Open-Sourcing Metaflow, a Human-Centric Framework for Data Science

Sponsored Post: InMemory.Net, Triplebyte, Etleap, Stream, Scalyr

Sponsored Post: ipdata, StackHawk, InterviewCamp.io, Educative, Triplebyte, Stream, Fauna

Sponsored Post: PerfOps, InMemory.Net, Triplebyte, Etleap, Stream, Scalyr

Stay Connected