Design, Event and Scalability - Technology Performance Pulse

Designing and Maintaining Event-Driven Architectures

DZone

MARCH 19, 2025

Event-driven architecture (EDA) gives your system the ability to receive and respond to changes in real time, making it easier to scale. Decoupling components is the core theme of EDA, which makes it flexible, allowing it to scale asynchronously based on events. This approach makes systems reactive, scalable, and resilient to failures.

Architecture

Architecture Design Scalability Monitoring

New integrations announced at AWS re:Invent enhance cloud performance, security, and automation

Dynatrace

DECEMBER 3, 2024

This year’s AWS re:Invent will showcase a suite of new AWS and Dynatrace integrations designed to enhance cloud performance, security, and automation. Gaining precise insights with Dynatrace integration for AWS EventBridge Now supporting a deeper integration with AWS EventBridge, Dynatrace is able to act as a consumer of AWS events.

AWS

AWS Cloud Performance Innovation

How To Design For High-Traffic Events And Prevent Your Website From Crashing

Smashing Magazine

JANUARY 7, 2025

How To Design For High-Traffic Events And Prevent Your Website From Crashing How To Design For High-Traffic Events And Prevent Your Website From Crashing Saad Khan 2025-01-07T14:00:00+00:00 2025-01-07T22:04:48+00:00 This article is sponsored by Cloudways Product launches and sales typically attract large volumes of traffic.

Traffic

Traffic Website Design Cache

Designing Instagram

High Scalability

JANUARY 11, 2022

Design a photo-sharing platform similar to Instagram where users can upload their photos and share it with their followers. High Level Design. Component Design. API Design. We have provided the API design of posting an image on Instagram below. API Design. Problem Statement. Architecture. Data Models.

Design

Design Media Storage Logistics

Rapid Event Notification System at Netflix

The Netflix TechBlog

FEBRUARY 18, 2022

To this end, we developed a Rapid Event Notification System (RENO) to support use cases that require server initiated communication with devices in a scalable and extensible manner. In this blog post, we will give an overview of the Rapid Event Notification System at Netflix and share some of the learnings we gained along the way.

Systems

Systems Traffic Architecture Mobile

Netflix’s Distributed Counter Abstraction

The Netflix TechBlog

NOVEMBER 12, 2024

By: Rajiv Shringi , Oleksii Tkachuk , Kartik Sathyanarayanan Introduction In our previous blog post, we introduced Netflix’s TimeSeries Abstraction , a distributed service designed to store and query large volumes of temporal event data with low millisecond latencies. Today, we’re excited to present the Distributed Counter Abstraction.

Latency

Latency Cache Infrastructure Strategy

RabbitMQ vs. Kafka: Key Differences

Scalegrid

FEBRUARY 6, 2025

RabbitMQ is designed for flexible routing and message reliability, while Kafka handles high-throughput event streaming and real-time data processing. Kafka is optimized for high-throughput event streaming , excelling in real-time analytics and large-scale data ingestion. What is RabbitMQ?

Latency

Latency Analytics Architecture Storage

Mastering Scalability and Performance: A Deep Dive Into Azure Load Balancing Options

DZone

JANUARY 8, 2024

As organizations increasingly migrate their applications to the cloud, efficient and scalable load balancing becomes pivotal for ensuring optimal performance and high availability. Each of these services addresses specific use cases, offering diverse functionalities to meet the demands of modern applications. What Is Load Balancing?

Azure

Azure Scalability Traffic Performance

DevOps automation: From event-driven automation to answer-driven automation [with causal AI]

Dynatrace

JULY 24, 2023

In the world of DevOps and SRE, DevOps automation answers the undeniable need for efficiency and scalability. They need event-driven automation that not only responds to events and triggers but also analyzes and interprets the context to deliver precise and proactive actions.

DevOps

DevOps Traffic Efficiency Servers

Foundation Model for Personalized Recommendation

The Netflix TechBlog

MARCH 28, 2025

These insights have shaped the design of our foundation model, enabling a transition from maintaining numerous small, specialized models to building a scalable, efficient system. To harness this data effectively, we employ a process of interaction tokenization, ensuring meaningful events are identified and redundancies are minimized.

Tuning

Tuning Efficiency Latency Strategy

Title Launch Observability at Netflix Scale

The Netflix TechBlog

DECEMBER 17, 2024

How can we design systems that recognize these nuances and empower every title to shine and bring joy to ourmembers? The complexity of these operational demands underscored the urgent need for a scalable solution. Using the source of truth: Logs serve as a reliable source of truth by providing a comprehensive record of system events.

Traffic

Traffic Scalability Strategy Monitoring

Best Practices for Scaling RabbitMQ

Scalegrid

FEBRUARY 24, 2025

Key Takeaways RabbitMQ improves scalability and fault tolerance in distributed systems by decoupling applications, enabling reliable message exchanges. This decoupling is crucial in modern architectures where scalability and fault tolerance are paramount. This setup prioritizes data safety, with most replicas online at any given time.

Best Practices

Best Practices Traffic Strategy Efficiency

AWS serverless services: Exploring your options

Dynatrace

OCTOBER 7, 2021

Instead of worrying about infrastructure management functions, such as capacity provisioning and hardware maintenance, teams can focus on application design, deployment, and delivery. Scalability. Finally, there’s scalability. The first benefit is simplicity. Let’s explore each in more detail. Compute services.

Serverless

Serverless AWS Lambda Storage

Google Cloud Next 2024: AI innovation for Google Cloud

Dynatrace

MARCH 29, 2024

This year, Google’s event will take place from April 9 to 11 in Las Vegas. As organizations continue to expand within cloud-native environments using Google Cloud, ensuring scalability becomes a top priority. The annual Google Cloud Next conference explores the latest innovations for cloud technology and Google Cloud.

Google

Google Innovation Cloud Analytics

Title Launch Observability at Netflix Scale

The Netflix TechBlog

MARCH 4, 2025

In order to allow for this mimicking, many systems implement an event handling, where they convert our request into a call to the real service with properties enabled to log when titles are filtered out of their response and why. Conclusion Throughout this series, weve explored the journey of enhancing title launch observability at Netflix.

Traffic

Traffic Strategy Entertainment Innovation

Efficient Message Distribution Using AWS SNS Fanout

DZone

FEBRUARY 29, 2024

In the world of cloud computing and event-driven applications, efficiency and flexibility are absolute necessities. A smooth flow of messages in an event-driven application is the key to its performance and efficiency. A critical component of such an application is message distribution.

Efficiency

Efficiency AWS Scalability Architecture

Distributed tracing with Dynatrace just got even better

Dynatrace

MARCH 11, 2025

The Dynatrace platform now enables comprehensive data exploration and interactive analytics across data sets (trace, logs, events, and metrics)empowering you to solve complex use cases, handle any observability scenario, and gain unprecedented visibility into your systems.

Analytics

Analytics Games Innovation Metrics

Dynatrace supports Amazon Linux 2023 as an AWS launch partner

Dynatrace

MARCH 14, 2023

Amazon’s new general-purpose Linux for AWS is designed to provide a secure, stable, and high-performance execution environment to develop and run cloud applications. Saving your cloud operations and SRE teams hours of guesswork and manual tagging, the Davis AI engine analyzes billions of events in real time.

AWS

AWS Lambda Serverless Virtualization

Dynatrace OpenPipeline: Stream processing data ingestion converges observability, security, and business data at massive scale for analytics and automation in context

Dynatrace

JANUARY 31, 2024

The exponential growth of data volume—including observability, security, software lifecycle, and business data—forces organizations to deal with cost increases while providing flexible, robust, and scalable ingest. Figure 2: Configuration and ingest throughput for each source, grouped by type Protect your sensitive data Privacy by design.

Analytics

Analytics Processing Transportation Storage

Embrace event-driven computing: Amazon expands DynamoDB with streams, cross-region replication, and database triggers

All Things Distributed

JULY 14, 2015

A more scalable option is to decouple these systems and build a pipe that connects these engines and feeds all change records from the source database to the data warehouse (e.g., DynamoDB Streams simplifies and improves this design pattern with a distributed systems approach. Amazon Redshift) and Elasticsearch machines.

Database

Database Lambda AWS IoT

Introducing Netflix TimeSeries Data Abstraction Layer

The Netflix TechBlog

OCTOBER 8, 2024

The Key-Value Abstraction offers a flexible, scalable solution for storing and accessing structured key-value data, while the Data Gateway Platform provides essential infrastructure for protecting, configuring, and deploying the data tier.

Latency

Latency Storage Traffic Tuning

What is function as a service? App development gets FaaS and furious

Dynatrace

AUGUST 11, 2022

Before an organization moves to function as a service, it’s important to understand how it works, its benefits and challenges, its effect on scalability, and why cloud-native observability is essential for attaining peak performance. How does function as a service affect scalability? But what is FaaS? What is FaaS?

Development

Development Serverless Best Practices Lambda

Six causes of major software outages–And how to avoid them

Dynatrace

AUGUST 8, 2024

As recent events have demonstrated, major software outages are an ever-present threat in our increasingly digital world. High demand Sudden spikes in demand can overwhelm systems that are not designed to handle such loads, leading to outages. This often occurs during major events, promotions, or unexpected surges in usage.

Software

Software Software Infrastructure Network

Accelerate and empower Site Reliability Engineering with Dynatrace observability

Dynatrace

OCTOBER 10, 2023

Process Improvements (50%) The allocation for process improvements is devoted to automation and continuous improvement SREs help to ensure that systems are scalable, reliable, and efficient. SREs invest significant effort in enhancing software reliability, scalability, and dependability. However, this is highly unlikely.

Engineering

Engineering DevOps Innovation Strategy

What is observability? Not just logs, metrics and traces

Dynatrace

OCTOBER 1, 2021

Many organizations also adopt an observability solution to help them detect and analyze the significance of events to their operations, software development life cycles, application security, and end-user experiences. The architects and developers who create the software must design it to be observed. Benefits of observability.

Metrics

Metrics Open Source Monitoring Cloud

What is Greenplum Database? Intro to the Big Data Database

Scalegrid

MAY 13, 2020

It’s architecture was specially designed to manage large-scale data warehouses and business intelligence workloads by giving you the ability to spread your data out across a multitude of servers. Greenplum uses an MPP database design that can help you develop a scalable, high performance deployment. At a glance – TLDR.

Big Data

Big Data Database Artificial Intelligence Open Source

Ten Tips For The Aspiring Designer Beginners (Part 1)

Smashing Magazine

JANUARY 5, 2022

Ten Tips For The Aspiring Designer Beginners (Part 1). Ten Tips For The Aspiring Designer Beginners (Part 1). In this article, I want to share ten tips that helped me grow and become a better designer, and I hope these tips will also help you while you’re trying to find more solid ground under your feet. Luis Ouriach.

Design

Design Website Social Media Best Practices

Expanded Grail data lakehouse and new Dynatrace user experience unlock boundless analytics

Dynatrace

FEBRUARY 15, 2023

Grail – the foundation of exploratory analytics Grail can already store and process log and business events. Introducing Metrics on Grail Despite their many advantages, modern cloud-native architectures can result in scalability and fragmentation challenges. Grail solves this scalability issue!

Analytics

Analytics Social Media Metrics Design

Unlock the Power of DevSecOps with Newly Released Kubernetes Experience for Platform Engineering

Dynatrace

NOVEMBER 7, 2023

The old saying in the software development community, “You build it, you run it,” no longer works as a scalable approach in the modern cloud-native world. The ability to effectively manage multi-cluster infrastructure is critical to consistent and scalable service delivery. Automation, automation, automation.

Engineering

Engineering DevOps Best Practices Infrastructure

Architecture Patterns: Publish/Subscribe

DZone

JANUARY 2, 2024

The Publish/Subscribe (Pub/Sub) pattern is a widely-used software architecture paradigm, particularly relevant in the design of distributed, messaging-driven systems. The communication framework is decoupled, scalable, and dynamic, making it useful for addressing complex software requirements in modern application development.

Architecture

Architecture Software Architecture Scalability Design

Enhance data management with Grail: Ultimate guide to custom buckets and security policies

Dynatrace

OCTOBER 6, 2023

Grail: Enterprise-ready data lakehouse Grail, the Dynatrace causational data lakehouse, was explicitly designed for observability and security data, with artificial intelligence integrated into its foundation. Buckets are similar to folders, a physical storage location. There is a default bucket for each table.

Artificial Intelligence

Artificial Intelligence Metrics Analytics Storage

From syslog to AWS Firehose: Dynatrace log management innovations that enhance observability

Dynatrace

SEPTEMBER 5, 2024

Dynatrace supports scalable data ingestion, ensuring your observability infrastructure grows with your cloud environment. Dynatrace enhances Fluent Bit’s log management by integrating observability signals like traces, events, and metrics, providing a complete view of cloud-native application performance.

Innovation

Innovation AWS Analytics Storage

What is security analytics?

Dynatrace

JUNE 10, 2024

Improved compliance A better understanding of data security across multiple applications and environments provides a unified view of events and information. Security analytics vs. SIEM Security information and event management (SIEM) tools are staples of enterprise security. This offers two advantages for compliance.

Analytics

Analytics Network Open Source Hardware

Why growing AI adoption requires an AI observability strategy

Dynatrace

JANUARY 17, 2024

By adopting a cloud- and edge-based AI approach, teams can benefit from the flexibility, scalability, and pay-per-use model of the cloud while also reducing the latency, bandwidth, and cost of sending AI data to cloud-based operations. Causal AI is a technique that determines the precise root causes and effects of events or behaviors.

Strategy

Strategy Artificial Intelligence Storage Cloud

Scalable Solutions with Percona Distribution for PostgreSQL (Part 2): Using Citus

Percona

OCTOBER 20, 2023

You can read part one here: Scalable Solutions With Percona Distribution for PostgreSQL: Set Up Three PostgreSQL Database Instances. Check the “ Scalable Solutions with Percona Distribution for PostgreSQL (Part 1) ” blog post to set up three nodes. event_time: A timestamp to record when the event occurred.

Scalability

Scalability Database Open Source Systems

Accelerate your cloud journey with Dynatrace observability for AWS S3 logs

Dynatrace

JUNE 27, 2023

Logs complement out-of-the-box metrics and enable automated actions for responding to availability, security, and other service events. Centralized log management for scalable ingestion into Grail As AWS S3 proves to be the preferred way of storing cloud logs, enterprise customers face mounting challenges in putting S3 log data to use.

AWS

AWS Cloud Lambda Analytics

Answer-driven DevOps automation: Automation use cases that accelerate insights

Dynatrace

MARCH 9, 2023

In turn, manual approaches to identifying code issues and troubleshooting are not scalable. Event-driven automation is typically the next stage in DevOps automation maturity, adding functions as a service to handle problem remediation and threat protection.

DevOps

DevOps Infrastructure Software Software

Monitoring of Kubernetes Infrastructure for day 2 operations

Dynatrace

JULY 8, 2020

One of the promises of container orchestration platforms is to make i t easier for the developers to accelerate the deployment of their app lication s without having to worry about scalability and infrastructure dependencies. Kubernetes events are a type of object providing context on what ’s happening inside a cluster.

Infrastructure

Infrastructure Monitoring Cloud Metrics

SRE vs DevOps: What you need to know

Dynatrace

FEBRUARY 24, 2021

The events of 2020 accelerated the trend of organizations shifting to cloud-native technologies in response to the dramatic increase in demand for online services. As Google’s Ben Treynor explains , “Fundamentally, it’s what happens when you ask a software engineer to design an operations function.”

DevOps

DevOps Software Engineering Speed Google

Building and Scaling Data Lineage at Netflix to Improve Data Infrastructure Reliability, and…

The Netflix TechBlog

MARCH 25, 2019

You are designing a learning system to forecast Service Level Agreement (SLA) violations and would want to factor in all upstream dependencies and corresponding historical states. Design a flexible data model ? —?Represent Therefore, the ingestion approach for data lineage is designed to work with many disparate data sources.

Infrastructure

Infrastructure Big Data Transportation Architecture

Black Hat 2024: Observability for DevSecOps and scaled security posture management

Dynatrace

JULY 29, 2024

Security analysts are drowning, with 70% of security events left unexplored , crucial months or even years can pass before breaches are understood. After a security event, many organizations often don’t know for months—or even years—when, why, or how it happened. Read now and learn more!

Analytics

Analytics Government DevOps Efficiency

Towards a Reliable Device Management Platform

The Netflix TechBlog

AUGUST 30, 2021

In the Device Management Platform, this is achieved by having device updates be event-sourced through the control plane to the cloud so that NTS will always have the most up-to-date information about the devices available for testing. The RAE is configured to be effectively a router that devices under test (DUTs) are connected to.

Latency

Latency Traffic Transportation Cloud

The history of Grail: Why you need a data lakehouse

Dynatrace

OCTOBER 4, 2022

A data lakehouse addresses these limitations and introduces an entirely new architectural design. From the beginning, Grail was built to be fast and scalable to manage massive volumes of data. Consider a log event in which the event itself has fields such as error code, severity, or time stamp.

Artificial Intelligence

Artificial Intelligence Analytics Storage Architecture

What is Cloud Computing? According to ChatGPT.

High Scalability

DECEMBER 16, 2022

This model of computing has become increasingly popular in recent years, as it offers a number of benefits, including cost savings, flexibility, scalability, and increased efficiency. I'm sorry, but as a large language model trained by OpenAI, I don't have the ability to browse the internet or keep up-to-date with current events.

Cloud

Cloud Serverless Internet Internet

Designing and Maintaining Event-Driven Architectures

New integrations announced at AWS re:Invent enhance cloud performance, security, and automation

Trending Sources

How To Design For High-Traffic Events And Prevent Your Website From Crashing

Designing Instagram

Rapid Event Notification System at Netflix

Netflix’s Distributed Counter Abstraction

RabbitMQ vs. Kafka: Key Differences

Mastering Scalability and Performance: A Deep Dive Into Azure Load Balancing Options

DevOps automation: From event-driven automation to answer-driven automation [with causal AI]

Foundation Model for Personalized Recommendation

Title Launch Observability at Netflix Scale

Best Practices for Scaling RabbitMQ

AWS serverless services: Exploring your options

Google Cloud Next 2024: AI innovation for Google Cloud

Title Launch Observability at Netflix Scale

Efficient Message Distribution Using AWS SNS Fanout

Distributed tracing with Dynatrace just got even better

Dynatrace supports Amazon Linux 2023 as an AWS launch partner

Dynatrace OpenPipeline: Stream processing data ingestion converges observability, security, and business data at massive scale for analytics and automation in context

Embrace event-driven computing: Amazon expands DynamoDB with streams, cross-region replication, and database triggers

Introducing Netflix TimeSeries Data Abstraction Layer

What is function as a service? App development gets FaaS and furious

Six causes of major software outages–And how to avoid them

Accelerate and empower Site Reliability Engineering with Dynatrace observability

What is observability? Not just logs, metrics and traces

What is Greenplum Database? Intro to the Big Data Database

Ten Tips For The Aspiring Designer Beginners (Part 1)

Expanded Grail data lakehouse and new Dynatrace user experience unlock boundless analytics

Unlock the Power of DevSecOps with Newly Released Kubernetes Experience for Platform Engineering

Architecture Patterns: Publish/Subscribe

Enhance data management with Grail: Ultimate guide to custom buckets and security policies

From syslog to AWS Firehose: Dynatrace log management innovations that enhance observability

What is security analytics?

Why growing AI adoption requires an AI observability strategy

Scalable Solutions with Percona Distribution for PostgreSQL (Part 2): Using Citus

Accelerate your cloud journey with Dynatrace observability for AWS S3 logs

Answer-driven DevOps automation: Automation use cases that accelerate insights

Monitoring of Kubernetes Infrastructure for day 2 operations

SRE vs DevOps: What you need to know

Building and Scaling Data Lineage at Netflix to Improve Data Infrastructure Reliability, and…

Black Hat 2024: Observability for DevSecOps and scaled security posture management

Towards a Reliable Device Management Platform

The history of Grail: Why you need a data lakehouse

What is Cloud Computing? According to ChatGPT.

Stay Connected