Engineering, Infrastructure and Software Engineering

Sustainability: Thoughts from a software engineer

Dynatrace

MARCH 17, 2025

How to achieve sustainable IT practices Use observability tools The first step in driving improvements is to obtain a comprehensive view of your IT infrastructure’s climate impact. Platform engineers can set defaults for development teams, such as the number of replicas a service should have or whether it scales automatically.

Software Engineering

Software Engineering Engineering Software Software

Bringing Software Engineering Rigor to Data

DZone

FEBRUARY 20, 2023

In software engineering, we've learned that building robust and stable applications has a direct correlation with overall organization performance. The data community is striving to incorporate the core concepts of engineering rigor found in software communities but still has further to go. Posted with permission.

Software Engineering

Software Engineering Engineering Software Software

What is platform engineering?

Dynatrace

NOVEMBER 3, 2023

In response to this shift, platform engineering is growing in popularity. The practice of platform engineering has evolved alongside the increasing complexity of cloud environments. The result is a cloud-native approach to software delivery. Why is platform engineering important?

Engineering

Engineering DevOps Software Engineering Scalability

SRE Best Practices for Java Applications

DZone

MARCH 12, 2025

Site reliability engineering (SRE) plays a vital role in ensuring Java applications' high availability, performance, and scalability. This discipline merges software engineering and operations, aiming to create a robust infrastructure that supports seamless user experiences.

Best Practices

Best Practices Java Software Engineering Scalability

Unlock the Power of DevSecOps with Newly Released Kubernetes Experience for Platform Engineering

Dynatrace

NOVEMBER 7, 2023

Platform engineering is on the rise. According to leading analyst firm Gartner, “80% of software engineering organizations will establish platform teams as internal providers of reusable services, components, and tools for application delivery…” by 2026. Automation, automation, automation.

Engineering

Engineering DevOps Best Practices Infrastructure

Key Elements of Site Reliability Engineering (SRE)

DZone

MARCH 14, 2023

Site Reliability Engineering (SRE) is a systematic and data-driven approach to improving the reliability, scalability, and efficiency of systems. It combines principles of software engineering, operations, and quality assurance to ensure that systems meet performance goals and business objectives.

Engineering

Engineering Software Engineering Scalability Efficiency

How platform engineering and IDP observability can accelerate developer velocity

Dynatrace

MARCH 6, 2024

As organizations look to expand DevOps maturity, improve operational efficiency, and increase developer velocity, they are embracing platform engineering as a key driver. The goal is to abstract away the underlying infrastructure’s complexities while providing a streamlined and standardized environment for development teams.

Engineering

Engineering Development DevOps Infrastructure

Site reliability engineering: 5 things you need to know

Dynatrace

FEBRUARY 4, 2021

What is site reliability engineering? Site reliability engineering (SRE) is the practice of applying software engineering principles to operations and infrastructure processes to help organizations create highly reliable and scalable software systems. Dynatrace news. SRE focuses on automation.

Engineering

Engineering DevOps Government Latency

Why applying chaos engineering to data-intensive applications matters

Dynatrace

MAY 23, 2024

Stream processing One approach to such a challenging scenario is stream processing, a computing paradigm and software architectural style for data-intensive software systems that emerged to cope with requirements for near real-time processing of massive amounts of data. We designed experimental scenarios inspired by chaos engineering.

Engineering

Engineering Tuning Latency Open Source

Automating Success: Building a better developer experience with platform engineering

Dynatrace

FEBRUARY 12, 2024

When it comes to platform engineering, not only does observability play a vital role in the success of organizations’ transformation journeys—it’s key to successful platform engineering initiatives. The various presenters in this session aligned platform engineering use cases with the software development lifecycle.

Engineering

Engineering Development Infrastructure Cloud

Site reliability engineering: 5 things to you need to know

Dynatrace

FEBRUARY 4, 2021

Site reliability engineering (SRE) is the practice of applying software engineering principles to operations and infrastructure processes to help organizations create highly reliable and scalable software systems. Dynatrace news. SRE focuses on automation. SRE drives a “shift left” mindset.

Engineering

Engineering DevOps Government Latency

Site Reliability Engineering

DZone

JANUARY 19, 2024

In the dynamic world of online services, the concept of site reliability engineering (SRE) has risen as a pivotal discipline, ensuring that large-scale systems maintain their performance and reliability.

Engineering

Engineering Tuning Software Engineering Internet

Demystifying Interviewing for Backend Engineers @ Netflix

The Netflix TechBlog

FEBRUARY 1, 2022

By Karen Casella, Director of Engineering, Access & Identity Management Have you ever experienced one of the following scenarios while looking for your next role? Most backend engineering teams follow a process very similar to what is shown below. If so, we invite you to begin the interview process.

Engineering

Engineering Games Entertainment Innovation

Building and Scaling Data Lineage at Netflix to Improve Data Infrastructure Reliability, and…

The Netflix TechBlog

MARCH 25, 2019

Building and Scaling Data Lineage at Netflix to Improve Data Infrastructure Reliability, and Efficiency By: Di Lin , Girish Lingappa , Jitender Aswani Imagine yourself in the role of a data-inspired decision maker staring at a metric on a dashboard about to make a critical business decision but pausing to ask a question?—?“Can

Infrastructure

Infrastructure Big Data Transportation Architecture

Data Engineers of Netflix?—?Interview with Pallavi Phadnis

The Netflix TechBlog

OCTOBER 28, 2021

Data Engineers of Netflix?—?Interview Interview with Pallavi Phadnis This post is part of our “ Data Engineers of Netflix ” series, where our very own data engineers talk about their journeys to Data Engineering @ Netflix. Pallavi Phadnis is a Senior Software Engineer at Netflix.

Data Engineering

Data Engineering Engineering Big Data Software Engineering

Podcast: Interview with Software Engineering Daily

Sutter's Mill

JUNE 7, 2024

Also in April, I was interviewed by Jordi Mon Companys for Software Engineering Daily, and that interview was just published on the SE Daily podcast. government recently released a report calling on the technical community to proactively reduce the attack surface area of software infrastructure.

Software Engineering

Software Engineering Engineering Software Software

How to Prepare for Your DevOps Interview

DZone

SEPTEMBER 5, 2019

Over the past decade, DevOps has emerged as a new tech culture and career that marries the rapid iteration desired by software development with the rock-solid stability of the infrastructure operations team. As of August 2019, there are currently over 50,000 LinkedIn DevOps job listings in the United States alone.

DevOps

DevOps Software Engineering Infrastructure Engineering

Extend the AI and automation core of Dynatrace with host extensions to resolve infrastructure problems

Dynatrace

MAY 13, 2020

Let the Davis AI causation engine analyze additional metrics. All the data bound to hosts is analyzed by the Davis AI causation engine and made available on custom dashboards and events pages. All the data bound to hosts is analyzed by the Davis AI causation engine and made available on custom dashboards and events pages.

Infrastructure

Infrastructure Metrics Monitoring Software Engineering

Software engineering for machine learning: a case study

The Morning Paper

JULY 7, 2019

Software engineering for machine learning: a case study Amershi et al., More specifically, we’ll be looking at the results of an internal study with over 500 participants designed to figure out how product development and software engineering is changing at Microsoft with the rise of AI and ML. ICSE’19.

Software Engineering

Software Engineering Engineering Software Software

SRE vs DevOps: What you need to know

Dynatrace

FEBRUARY 24, 2021

SRE is the transformation of traditional operations practices by using software engineering and DevOps principles to improve the availability, performance, and scalability of releases by building resiliency into apps and infrastructure. Investing in automation and tooling to avoid toil. SRE vs DevOps?

DevOps

DevOps Software Engineering Speed Google

How Red Hat and Dynatrace intelligently automate your production environment

Dynatrace

MAY 6, 2024

Problem remediation is too time-consuming According to the DevOps Automation Pulse Survey 2023 , on average, a software engineer takes nine hours to remediate a problem within a production application. With that, Software engineers, SREs, and DevOps can define a broad automation and remediation mapping.

DevOps

DevOps Software Engineering Games Java

Nurturing Design in Your Software Engineering Culture

Strategic Tech

MARCH 16, 2021

There are a few qualities that differentiate average from high performing software engineering organisations. In my experience, the culture is better and the results are better in orgs where engineers and architects obsess over the design of code and architecture. My experience is the opposite.

Software Engineering

Software Engineering Design Engineering Software

Auto-adaptive thresholds for AI-driven quality gating

Dynatrace

JUNE 4, 2024

Build an umbrella for Development and Operations In modern software engineering, the discipline of platform engineering delivers DevSecOps practices to developers to bridge the gaps between development, security, and operations and enhance the developer experience.

Metrics

Metrics Engineering Code Tuning

Site reliability engineering: Six SRE trends to unleash DevOps innovation

Dynatrace

JUNE 2, 2022

Site reliability engineering (SRE) continues to gain popularity as organizations embrace hybrid cloud strategies and IT automation at scale. By applying software engineering principles to operations and infrastructure practices, SRE enables organizations to streamline and automate IT processes. Dynatrace news.

DevOps

DevOps Innovation Engineering Benchmarking

Conducting log analysis with an observability platform and full data context

Dynatrace

APRIL 20, 2023

With more automated approaches to log monitoring and log analysis, however, organizations can gain visibility into their applications and infrastructure efficiently and with greater precision—even as cloud environments grow. They enable IT teams to identify and address the precise cause of application and infrastructure issues.

Analytics

Analytics Infrastructure Storage Architecture

Site reliability done right: 5 SRE best practices that deliver on business objectives

Dynatrace

MAY 31, 2023

Site reliability engineering (SRE) has recently become a critical discipline in recent years as the world has shifted in favor of web-based interactions. This shift is leading more organizations to hire site reliability engineers to guarantee the reliability and resiliency of their services. Mobile retail e-commerce spending in the U.

Best Practices

Best Practices DevOps Latency Metrics

What is DevOps orchestration? And why invest in orchestration tools?

Dynatrace

DECEMBER 5, 2022

Today, DevOps orchestration is necessary to gain a comprehensive view and means of control over infrastructure, services, and software development practices. One key advantage of this integration is a single point of access to monitoring, logging, and other information needed to keep software development operations running efficiently.

DevOps

DevOps Virtualization Best Practices Innovation

Automated observability, security, and reliability at scale

Dynatrace

JULY 18, 2023

While infrastructure has historically been treated as a bottleneck where proper scaling and compute power are applied to improve performance, these aspects are now typically addressed by hyperscalers that offer cloud-based infrastructure and infrastructure as a service.

Best Practices

Best Practices Code Infrastructure Latency

Designing Instagram

High Scalability

JANUARY 11, 2022

Machine Learning Engineer at Amazon and has led several machine-learning initiatives across the Amazon ecosystem. FUN FACT : In this talk , Rodrigo Schmidt, director of engineering at Instagram talks about the different challenges they have faced in scaling the data infrastructure at Instagram. System Components.

Design

Design Media Storage Logistics

Netflix at AWS re:Invent 2019

The Netflix TechBlog

NOVEMBER 22, 2019

by Shefali Vyas Dalal AWS re:Invent is a couple weeks away and our engineers & leaders are thrilled to be in attendance yet again this year! Netflix shares how Amazon EC2 Auto Scaling allows its infrastructure to automatically adapt to changing traffic patterns in order to keep its audience entertained and its costs on target.

AWS

AWS Entertainment Open Source Benchmarking

Scaling Appsec at Netflix (Part 2)

The Netflix TechBlog

JUNE 6, 2022

By Astha Singhal , Lakshmi Sudheer , Julia Knecht The Application Security teams at Netflix are responsible for securing the software footprint that we create to run the Netflix product, the Netflix studio, and the business. Our customers are product and engineering teams at Netflix that build these software services and platforms.

Software Engineering

Software Engineering Scalability Education Engineering

AWS observability: AWS monitoring best practices for resiliency

Dynatrace

NOVEMBER 22, 2021

Because of its matrix of cloud services across multiple environments, AWS and other multicloud environments can be more difficult to manage and monitor compared with traditional on-premises infrastructure. EC2 is Amazon’s Infrastructure-as-a-service (IaaS) compute platform designed to handle any workload at scale. Amazon EC2.

Best Practices

Best Practices AWS Monitoring Serverless

The Show Must Go On: Securing Netflix Studios At Scale

The Netflix TechBlog

SEPTEMBER 13, 2021

Supporting developers through those checklists for edge cases, and then validating that each team’s choices resulted in an architecture with all the desired security properties, was similarly not scalable for our security engineers. Netflix engineers talk a lot about the concept of a “ Paved Road ”.

Internet

Internet Internet Cloud Traffic

All of Netflix’s HDR video streaming is now dynamically optimized

The Netflix TechBlog

NOVEMBER 29, 2023

Join us and be a part of the amazing team that brought you this tech-blog; open positions: Software Engineer, Cloud Gaming Software Engineer, Live Streaming References [1] L. Krasula, A. Choudhury, S. Malfait, A. 263–1–8 (2023) [ online ] [2] A.

Open Source

Open Source Software Engineering Internet Internet

Growth Engineering at Netflix- Creating a Scalable Offers Platform

The Netflix TechBlog

FEBRUARY 9, 2021

The Growth Engineering team is responsible for executing growth initiatives that help us anticipate and adapt to this change. For more background on Growth Engineering and the signup funnel, please have a look at our previous blog post that covers the basics. We need to be constantly adapting and innovating as a result of this change.

Engineering

Engineering Scalability Architecture Innovation

Connect your software with the right people: Ownership drives effective collaboration

Dynatrace

MARCH 28, 2023

Incident management with clearly defined responsibilities Site Reliability Engineers (SRE) are challenged not only to detect problems and identify the root cause quickly but also to remediate problems immediately. Any software engineer can search for monitored entities that relate to specific deployments and their respective teams.

Software

Software Software Monitoring Software Engineering

How and Why the Developer-First Approach Is Changing the Observability Landscape

DZone

DECEMBER 11, 2024

In our quest for greater scalability, resilience, and flexibility within the digital infrastructure of our organization, there has been a strategic pivot away from traditional monolithic application architectures towards embracing modern software engineering practices such as microservices architecture coupled with cloud-native applications.

Development

Development Software Engineering Architecture Scalability

Open-Sourcing Metaflow, a Human-Centric Framework for Data Science

The Netflix TechBlog

DECEMBER 3, 2019

About two years ago, we, at our newly formed Machine Learning Infrastructure team started asking our data scientists a question: “What is the hardest thing for you as a data scientist at Netflix?” mainly because of mundane reasons related to software engineering. like they would do in a Jupyter notebook.

Open Source

Open Source AWS Infrastructure Energy

The State of DevOps Automation assessment: How automated are you?

Dynatrace

APRIL 22, 2024

In response to the scale and complexity of modern cloud-native technology, organizations are increasingly reliant on automation to properly manage their infrastructure and workflows. Operations automation: The operations section addresses the level of automation organizations use in maintaining and managing existing software.

DevOps

DevOps Government Artificial Intelligence Innovation

MLOps and DevOps: Why Data Makes It Different

O'Reilly

OCTOBER 19, 2021

As with many burgeoning fields and disciplines, we don’t yet have a shared canonical infrastructure stack or best practices for developing and deploying data-intensive applications. In effect, the engineer designs and builds the world wherein the software operates. The new category is often called MLOps.

DevOps

DevOps Software Engineering Infrastructure Open Source

Application observability meets developer observability: Unlock a 360º view of your environment

Dynatrace

NOVEMBER 6, 2023

In a recent webinar , Dynatrace DevOps activist Andi Grabner and senior software engineer Yarden Laifenfeld explored developer observability. Why is developer observability important for engineers? They also care about infrastructure: SREs require system visibility and incident management.

Development

Development DevOps Programming Cloud

Autonomous Cloud Enablement aka Scaling NoOps via Self-Service

Dynatrace

FEBRUARY 6, 2020

To do that, Anita’s team drove innovation around a common delivery pipeline to enable developers automating operational tasks such as runbook execution to solve infrastructure problems. At Dynatrace our ACE team is enabling our engineers to deliver better software faster. Autonomous Cloud with Dynatrace.

Cloud

Cloud DevOps Engineering Speed

Dynatrace Perform 2024 Guide: Deriving business value from AI data analysis

Dynatrace

JANUARY 23, 2024

Composite’ AI, platform engineering, AI data analysis through custom apps This focus on data reliability and data quality also highlights the need for organizations to bring a “ composite AI ” approach to IT operations, security, and DevOps. To learn more about platform engineering, explore the following resources.

Performance

Performance DevOps Innovation Energy

ConsoleMe: A Central Control Plane for AWS Permissions and Access

The Netflix TechBlog

MARCH 10, 2021

Motivation Growth in the cloud has exploded, and it is now easier than ever to create infrastructure on the fly. Groups beyond software engineering teams are standing up their own systems and automation. At many companies, managing cloud hygiene and security usually falls under the infrastructure or security teams.

AWS

AWS Cloud Games Infrastructure

Sustainability: Thoughts from a software engineer

Bringing Software Engineering Rigor to Data

Trending Sources

What is platform engineering?

SRE Best Practices for Java Applications

Unlock the Power of DevSecOps with Newly Released Kubernetes Experience for Platform Engineering

Key Elements of Site Reliability Engineering (SRE)

How platform engineering and IDP observability can accelerate developer velocity

Site reliability engineering: 5 things you need to know

Why applying chaos engineering to data-intensive applications matters

Automating Success: Building a better developer experience with platform engineering

Site reliability engineering: 5 things to you need to know

Site Reliability Engineering

Demystifying Interviewing for Backend Engineers @ Netflix

Building and Scaling Data Lineage at Netflix to Improve Data Infrastructure Reliability, and…

Data Engineers of Netflix?—?Interview with Pallavi Phadnis

Podcast: Interview with Software Engineering Daily

How to Prepare for Your DevOps Interview

Extend the AI and automation core of Dynatrace with host extensions to resolve infrastructure problems

Software engineering for machine learning: a case study

SRE vs DevOps: What you need to know

How Red Hat and Dynatrace intelligently automate your production environment

Nurturing Design in Your Software Engineering Culture

Auto-adaptive thresholds for AI-driven quality gating

Site reliability engineering: Six SRE trends to unleash DevOps innovation

Conducting log analysis with an observability platform and full data context

Site reliability done right: 5 SRE best practices that deliver on business objectives

What is DevOps orchestration? And why invest in orchestration tools?

Automated observability, security, and reliability at scale

Designing Instagram

Netflix at AWS re:Invent 2019

Scaling Appsec at Netflix (Part 2)

AWS observability: AWS monitoring best practices for resiliency

The Show Must Go On: Securing Netflix Studios At Scale

All of Netflix’s HDR video streaming is now dynamically optimized

Growth Engineering at Netflix- Creating a Scalable Offers Platform

Connect your software with the right people: Ownership drives effective collaboration

How and Why the Developer-First Approach Is Changing the Observability Landscape

Open-Sourcing Metaflow, a Human-Centric Framework for Data Science

The State of DevOps Automation assessment: How automated are you?

MLOps and DevOps: Why Data Makes It Different

Application observability meets developer observability: Unlock a 360º view of your environment

Autonomous Cloud Enablement aka Scaling NoOps via Self-Service

Dynatrace Perform 2024 Guide: Deriving business value from AI data analysis

ConsoleMe: A Central Control Plane for AWS Permissions and Access

Stay Connected