Big Data and Software - Technology Performance Pulse

Sustainability: Thoughts from a software engineer

Dynatrace

MARCH 17, 2025

Until recently, improvements in data center power efficiency compensated almost entirely for the increasing demand for computing resources. The rise of big data, cryptocurrencies, and AI means the IT sector contributes significantly to global greenhouse gas emissions. However, this trend is now reversing.

Software Engineering

Software Engineering Engineering Software Software

What is software automation? Optimize the software lifecycle with intelligent automation

Dynatrace

JUNE 26, 2023

In today’s digital world, software is everywhere. Software is behind most of our human and business interactions. This, in turn, accelerates the need for businesses to implement the practice of software automation to improve and streamline processes. What is software automation? What is software analytics?

Software

Software Software Analytics Big Data

In-Stream Big Data Processing

Highly Scalable

AUGUST 20, 2013

The shortcomings and drawbacks of batch-oriented data processing were widely recognized by the Big Data community quite a long time ago. Incremental computations over sliding windows is a group of techniques that are widely used in digital signal processing, in both software and hardware. Apache Spark [10]. References.

Big Data

Big Data Processing Lambda Database

Top 15 Software Testing Trends to Watch Out in 2021

DZone

DECEMBER 28, 2020

The introduction of innovative technologies has brought the newest updates in software testing, development, design, and delivery. Nowadays, Big Data tests mainly include data testing, paving the way for the Internet of Things to become the center point. Besides, AI and ML seem to reach a new level.

Software

Software Software Testing Big Data

What is IT operations analytics? Extract more data insights from more sources

Dynatrace

MAY 1, 2023

IT operations analytics is the process of unifying, storing, and contextually analyzing operational data to understand the health of applications, infrastructure, and environments and streamline everyday operations. ITOA collects operational data to identify patterns and anomalies for faster incident management and near-real-time insights.

Analytics

Analytics Artificial Intelligence Big Data Open Source

Kubernetes for Big Data Workloads

Abhishek Tiwari

DECEMBER 27, 2017

Kubernetes has emerged as go to container orchestration platform for data engineering teams. In 2018, a widespread adaptation of Kubernetes for big data processing is anitcipated. Organisations are already using Kubernetes for a variety of workloads [1] [2] and data workloads are up next. Key challenges. Performance.

Big Data

Big Data Storage Benchmarking Hardware

Data Engineers of Netflix?—?Interview with Pallavi Phadnis

The Netflix TechBlog

OCTOBER 28, 2021

Interview with Pallavi Phadnis This post is part of our “ Data Engineers of Netflix ” series, where our very own data engineers talk about their journeys to Data Engineering @ Netflix. Pallavi Phadnis is a Senior Software Engineer at Netflix. Pallavi, what’s your journey to data engineering at Netflix?

Data Engineering

Data Engineering Engineering Big Data Software Engineering

Kubernetes in the wild report 2023

Dynatrace

JANUARY 16, 2023

The study analyzes factual Kubernetes production data from thousands of organizations worldwide that are using the Dynatrace Software Intelligence Platform to keep their Kubernetes clusters secure, healthy, and high performing. Open-source software drives a vibrant Kubernetes ecosystem. Java, Go, and Node.js

Open Source

Open Source Java Operating System Programming

Auto-Diagnosis and Remediation in Netflix Data Platform

The Netflix TechBlog

JANUARY 13, 2022

This blog will explore these two systems and how they perform auto-diagnosis and remediation across our Big Data Platform and Real-time infrastructure. This has led to a dramatic reduction in the time it takes to detect issues in hardware or bugs in recently rolled out data platform software.

Big Data

Big Data Infrastructure Metrics Games

Path to NoOps part 1: How modern AIOps brings NoOps within reach

Dynatrace

OCTOBER 25, 2022

NoOps is a concept in software development that seeks to automate processes and eliminate the need for an extensive IT operations team. Organizations adopt DevOps, where developers and operations work together in a continuous loop, so they can develop software and resolve issues efficiently before they affect users. What is NoOps?

DevOps

DevOps Big Data Cloud Innovation

Seven benefits of AIOps to transform your business operations

Dynatrace

JULY 5, 2022

AIOps combines big data and machine learning to automate key IT operations processes, including anomaly detection and identification, event correlation, and root-cause analysis. A truly modern AIOps solution also serves the entire software development lifecycle to address the volume, velocity, and complexity of multicloud environments.

Artificial Intelligence

Artificial Intelligence Cloud Innovation Strategy

Big / Bug Data: Analyzing the Apache Flink Source Code

DZone

DECEMBER 21, 2020

Applications used in the field of Big Data process huge amounts of information, and this often happens in real time. Naturally, such applications must be highly reliable so that no error in the code can interfere with data processing. It is an open-source framework for distributed processing of large amounts of data.

Code

Code Java Big Data Open Source

What is IT automation?

Dynatrace

JULY 6, 2022

This kind of automation can support key IT operations, such as infrastructure, digital processes, business processes, and big-data automation. Big data automation tools. These tools provide the means to collect, transfer, and process large volumes of data that are increasingly common in analytics applications.

Artificial Intelligence

Artificial Intelligence Tuning Strategy Big Data

DynatraceGo! APAC 2021: Lessons in thick data and keeping pace with the market

Dynatrace

AUGUST 10, 2021

BPAY is in the midst of its digital transformation journey in which it is discovering the critical importance of developing “contemporary ways of designing, operating, and using” its software. On the other hand, every single step you take towards intelligently observing data across your organization brings increasingly greater rewards.

DevOps

DevOps Innovation Big Data Cloud

What is cloud monitoring? How to improve your full-stack visibility

Dynatrace

JANUARY 11, 2023

As cloud and big data complexity scales beyond the ability of traditional monitoring tools to handle, next-generation cloud monitoring and observability are becoming necessities for IT teams. With agent monitoring, third-party software collects data and reports from the component that’s attached to the agent.

Cloud

Cloud Monitoring Best Practices Infrastructure

Building and Scaling Data Lineage at Netflix to Improve Data Infrastructure Reliability, and…

The Netflix TechBlog

MARCH 25, 2019

Building and Scaling Data Lineage at Netflix to Improve Data Infrastructure Reliability, and Efficiency By: Di Lin , Girish Lingappa , Jitender Aswani Imagine yourself in the role of a data-inspired decision maker staring at a metric on a dashboard about to make a critical business decision but pausing to ask a question?—?“Can

Infrastructure

Infrastructure Big Data Transportation Architecture

What is APM?

Dynatrace

JUNE 1, 2020

The variables that can impact the performance of an application vary; from coding errors or ‘bugs’ in the software, database slowdowns, hosting and network performance, to operating system and device type support. The Dynatrace Software Intelligence Platform provides all-in-one advanced observability. What sets Dynatrace apart?

Artificial Intelligence

Artificial Intelligence Social Media Monitoring IoT

AIOps observability adoption ascends in healthcare

Dynatrace

MARCH 14, 2022

Over the past decade, the industry moved from paper-based to electronic health records (EHRs)—digitizing the backbone of patient data. As patient care continues to evolve, IT teams have accelerated this shift from legacy, on-premises systems to cloud technology to more build, test, and deploy software, and fuel healthcare innovation.

Healthcare

Healthcare Artificial Intelligence Innovation Strategy

What is a data lakehouse? Combining data lakes and warehouses for the best of both worlds

Dynatrace

OCTOBER 4, 2022

While data lakehouses combine the flexibility and cost-efficiency of data lakes with the querying capabilities of data warehouses, it’s important to understand how these storage environments differ. Data warehouses. Data warehouses were the original big data storage option.

Artificial Intelligence

Artificial Intelligence Storage Analytics Government

A Recap of the Data Engineering Open Forum at Netflix

The Netflix TechBlog

JUNE 20, 2024

Data Productivity at Scale Recording Speaker : Iaroslav Zeigerman (Co-Founder and Chief Architect at Tobiko Data) Summary : The development and evolution of data pipelines are hindered by outdated tooling compared to software development.

Data Engineering

Data Engineering Engineering Entertainment Software Engineering

What is container orchestration?

Dynatrace

MARCH 24, 2023

By embracing public cloud and hybrid cloud computing environments, IT teams can further accelerate development and automate software deployment and management. A container is a small, self-contained, fully functional software package that can run an application or service, isolated from other applications running on the same host.

Infrastructure

Infrastructure Open Source Operating System Cloud

Python at Netflix

The Netflix TechBlog

APRIL 29, 2019

Various software systems are needed to design, build, and operate this CDN infrastructure, and a significant number of them are written in Python. Orchestration The Big Data Orchestration team is responsible for providing all of the services and tooling to schedule and execute ETL and Adhoc pipelines.

Open Source

Open Source Network Infrastructure Big Data

Any analysis, any time: Dynatrace Log Management and Analytics powered by Grail

Dynatrace

OCTOBER 4, 2022

Stop worrying about log data ingest and storage — start creating value instead. Dynatrace® Grail , an additional core technology for the Dynatrace® Software Intelligence platform , is the world’s first data lakehouse with massively parallel processing (MPP) for context-rich observability, business, and security analytics.

Analytics

Analytics Artificial Intelligence Storage Serverless

Hybrid cloud infrastructure explained: Weighing the pros, cons, and complexities

Dynatrace

JUNE 29, 2022

A hybrid cloud, however, combines public infrastructure and services with on-premises resources or a private data center to create a flexible, interconnected IT environment. Hybrid environments provide more options for storing and analyzing ever-growing volumes of big data and for deploying digital services.

Infrastructure

Infrastructure Cloud Azure AWS

Conducting log analysis with an observability platform and full data context

Dynatrace

APRIL 20, 2023

At Dynatrace Perform 2023 , Maciej Pawlowski, senior director of product management for infrastructure monitoring at Dynatrace, and a senior software engineer at a U.K.-based based financial services group, discussed how the bank uses log monitoring on the Dynatrace platform with an emphasis on observability and security data.

Analytics

Analytics Infrastructure Storage Architecture

Formulating ‘Out of Memory Kill’ Prediction on the Netflix App as a Machine Learning Problem

The Netflix TechBlog

JULY 21, 2022

We at Netflix, as a streaming service running on millions of devices, have a tremendous amount of data about device capabilities/characteristics and runtime data in our big data platform. With large data, comes the opportunity to leverage the data for predictive and classification based analysis.

Big Data

Big Data Cache Engineering Data Engineering

RSA Guide 2023: Cloud application security remains core challenge for organizations

Dynatrace

APRIL 11, 2023

Application vulnerabilities remain a key concern Application vulnerabilities—weaknesses or flaws in software applications that malicious attackers can use to exploit IT systems—exist in any type of software, including web and mobile applications. Together they equal better software. Shift-right ensures reliability in production.

Cloud

Cloud DevOps Open Source Retail

Revolutionizing System Testing With AI and ML

DZone

JUNE 6, 2023

This can include the use of cloud computing, artificial intelligence, big data analytics, the Internet of Things (IoT), and other digital tools. One of the significant challenges that come with digital transformation is ensuring that software systems remain reliable and secure. This is where software testing comes in.

Artificial Intelligence

Artificial Intelligence Systems IoT Testing

Applying real-world AIOps use cases to your operations

Dynatrace

OCTOBER 17, 2022

Artificial intelligence for IT operations, or AIOps, combines big data and machine learning to provide actionable insight for IT teams to shape and automate their operational strategy. With modern multicloud environments, AIOps must evolve to include the full software delivery lifecycle. Taking AIOps to the next level.

DevOps

DevOps Artificial Intelligence Healthcare Innovation

What is ITOps? Why IT operations is more crucial than ever in a multicloud world

Dynatrace

DECEMBER 15, 2022

Besides the traditional system hardware, storage, routers, and software, ITOps also includes virtual components of the network and cloud infrastructure. AIOps (artificial intelligence for IT operations) combines big data, AI algorithms, and machine learning for actionable, real-time insights that help ITOps continuously improve operations.

Artificial Intelligence

Artificial Intelligence DevOps Hardware Virtualization

What is AIOps? Everything you wanted to know

Dynatrace

OCTOBER 14, 2021

Gartner defines AIOps as the combination of “big data and machine learning to automate IT operations processes, including event correlation, anomaly detection, and causality determination.” But what is AIOps, exactly? And how can it support your organization? What is AIOps? Why is AIOps needed?

Artificial Intelligence

Artificial Intelligence DevOps Innovation Metrics

A guide to Autonomous Performance Optimization

Dynatrace

SEPTEMBER 15, 2020

During the Performance Clinic episode, I asked Stefano to tell us more about this changing world and how we can leverage automation, AI and machine learning to optimize modern software stacks despite the increased complexity.

Performance

Performance Java Metrics Cloud

Expanding the AWS Cloud: Introducing the AWS Europe (London) Region

All Things Distributed

DECEMBER 13, 2016

With the launch of the AWS Europe (London) Region, AWS can enable many more UK enterprise, public sector and startup customers to reduce IT costs, address data locality needs, and embark on rapid transformations in critical new areas, such as big data analysis and Internet of Things. Fraud.net is a good example of this.

AWS

AWS Cloud Artificial Intelligence IoT

How Netflix uses eBPF flow logs at scale for network insight

The Netflix TechBlog

JUNE 7, 2021

Netflix software infrastructure is a large distributed ecosystem that consists of specialized functional tiers that are operated on the AWS and Netflix owned services.

Network

Network Transportation AWS Cloud

What is behavior analytics?

Dynatrace

AUGUST 14, 2023

How behavior analytics works User behavior analytics works by first collecting, then analyzing user behavior data. Collect user behavior data Organizations typically use analytics software to collect a large volume of data on user behavior from relevant sources.

Analytics

Analytics Social Media Website IoT

Data lakehouse innovations advance the three pillars of observability for more collaborative analytics

Dynatrace

FEBRUARY 16, 2023

As teams try to gain insight into this data deluge, they have to balance the need for speed, data fidelity, and scale with capacity constraints and cost. To solve this problem, Dynatrace launched Grail, its causational data lakehouse , in 2022.

Analytics

Analytics Innovation Metrics Database

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

The Netflix TechBlog

MAY 4, 2023

Utilizing cloned real traffic, we can exercise the diversity of inputs from a wide range of devices and device application software versions in production. Additionally, for mismatches, we record the normalized and unnormalized responses from both sides to another big data table along with other relevant parameters, such as the diff.

Traffic

Traffic Latency Tuning Systems

Data Engineers of Netflix?—?Interview with Dhevi Rajendran

The Netflix TechBlog

JUNE 1, 2021

Dhevi joined Netflix in July 2020 and is one of many Data Engineers who have onboarded remotely during the pandemic. In this post, Dhevi talks about her passion for data engineering and taking on a new role during the pandemic. One great thing about working with data is the impact you can create as an engineer.

Data Engineering

Data Engineering Engineering Software Engineering Big Data

What is Application Performance Monitoring?

Dynatrace

JUNE 1, 2020

The variables that can impact the performance of an application vary; from coding errors or ‘bugs’ in the software, database slowdowns, hosting and network performance, to operating system and device type support. And I’m sure we’ve all experienced frustration when an application crashes, is slow to load, or doesn’t load at all.

Monitoring

Monitoring Performance Social Media Artificial Intelligence

Incremental Processing using Netflix Maestro and Apache Iceberg

The Netflix TechBlog

NOVEMBER 20, 2023

For example, a job would reprocess aggregates for the past 3 days because it assumes that there would be late arriving data, but data prior to 3 days isn’t worth the cost of reprocessing. Backfill: Backfilling datasets is a common operation in big data processing. append, overwrite, etc.).

Processing

Processing Big Data Efficiency Engineering

Helios: hyperscale indexing for the cloud & edge – part 1

The Morning Paper

OCTOBER 26, 2020

Helios also serves as a reference architecture for how Microsoft envisions its next generation of distributed big-data processing systems being built. What follows is a discussion of where big data systems might be heading, heavily inspired by the remarks in this paper, but with several of my own thoughts mixed in.

Cloud

Cloud Big Data Latency Architecture

Seer: leveraging big data to navigate the complexity of performance debugging in cloud microservices

The Morning Paper

MAY 14, 2019

Seer: leveraging big data to navigate the complexity of performance debugging in cloud microservices Gan et al., There are multiple sources of queueing in both hardware and software, and Seer works best when using deep instrumentation to capture these. ASPLOS’19.

Big Data

Big Data Cloud Performance Hardware

Data Engineers of Netflix?—?Interview with Samuel Setegne

The Netflix TechBlog

JUNE 1, 2021

Interview with Samuel Setegne Samuel Setegne This post is part of our “Data Engineers of Netflix” interview series, where our very own data engineers talk about their journeys to Data Engineering @ Netflix. Samuel Setegne is a Senior Software Engineer on the Core Data Science and Engineering team.

Data Engineering

Data Engineering Engineering Big Data Healthcare

What is a Distributed Storage System

Scalegrid

FEBRUARY 8, 2024

Speedier access to stored information within distributed storage is achieved by leveraging software-defined storage solutions and strategies like sharding or distributing sections of large databases and improving scalability by dividing tasks among many servers.

Storage

Storage Systems Big Data Azure

Sustainability: Thoughts from a software engineer

What is software automation? Optimize the software lifecycle with intelligent automation

Trending Sources

In-Stream Big Data Processing

Top 15 Software Testing Trends to Watch Out in 2021

What is IT operations analytics? Extract more data insights from more sources

Kubernetes for Big Data Workloads

Data Engineers of Netflix?—?Interview with Pallavi Phadnis

Kubernetes in the wild report 2023

Auto-Diagnosis and Remediation in Netflix Data Platform

Path to NoOps part 1: How modern AIOps brings NoOps within reach

Seven benefits of AIOps to transform your business operations

Big / Bug Data: Analyzing the Apache Flink Source Code

What is IT automation?

DynatraceGo! APAC 2021: Lessons in thick data and keeping pace with the market

What is cloud monitoring? How to improve your full-stack visibility

Building and Scaling Data Lineage at Netflix to Improve Data Infrastructure Reliability, and…

What is APM?

AIOps observability adoption ascends in healthcare

What is a data lakehouse? Combining data lakes and warehouses for the best of both worlds

A Recap of the Data Engineering Open Forum at Netflix

What is container orchestration?

Python at Netflix

Any analysis, any time: Dynatrace Log Management and Analytics powered by Grail

Hybrid cloud infrastructure explained: Weighing the pros, cons, and complexities

Conducting log analysis with an observability platform and full data context

Formulating ‘Out of Memory Kill’ Prediction on the Netflix App as a Machine Learning Problem

RSA Guide 2023: Cloud application security remains core challenge for organizations

Revolutionizing System Testing With AI and ML

Applying real-world AIOps use cases to your operations

What is ITOps? Why IT operations is more crucial than ever in a multicloud world

What is AIOps? Everything you wanted to know

A guide to Autonomous Performance Optimization

Expanding the AWS Cloud: Introducing the AWS Europe (London) Region

How Netflix uses eBPF flow logs at scale for network insight

What is behavior analytics?

Data lakehouse innovations advance the three pillars of observability for more collaborative analytics

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

Data Engineers of Netflix?—?Interview with Dhevi Rajendran

What is Application Performance Monitoring?

Incremental Processing using Netflix Maestro and Apache Iceberg

Helios: hyperscale indexing for the cloud & edge – part 1

Seer: leveraging big data to navigate the complexity of performance debugging in cloud microservices

Data Engineers of Netflix?—?Interview with Samuel Setegne

What is a Distributed Storage System

Stay Connected