Performance and Storage - Technology Performance Pulse

Data Storage Formats for Big Data Analytics: Performance and Cost Implications of Parquet, Avro, and ORC

DZone

SEPTEMBER 9, 2024

One key factor that significantly affects the performance of data processing is the storage format of the data. This article explores the impact of different storage formats, specifically Parquet, Avro, and ORC on query performance and costs in big data environments on Google Cloud Platform (GCP).

Big Data

Big Data Storage Analytics Benchmarking

Storage Types Used on Cloud Computing Platforms

DZone

JANUARY 24, 2024

Because of the emergence of cloud services, a broad range of storage choices are now easily available to fulfill the different demands of both organizations and people. These storage alternatives have been designed to meet a range of requirements, including performance, scalability, durability, and price.

Storage

Storage Cloud Scalability Design

Dynatrace + Metis: Helping developers & SREs solve Database issues with AI

Dynatrace

MARCH 5, 2025

Site Reliability Engineers (SREs) also face significant challenges in maintaining database reliability, ensuring performance, and preventing disruptions in highly dynamic and distributed environments. Why this matters Databases are the backbone of modern applications, but they can also be a major source of performance bottlenecks.

Database

Database Development Tuning DevOps

How We Built a High-Performance Storage Layer for Our Ultra-Heterogeneous Computing Cluster

DZone

AUGUST 7, 2023

Finding a storage solution for our ultra-heterogeneous computing cluster was challenging. We tried two solutions: object storage with s3fs + network-attached storage (NAS) and Alluxio + Fluid + object storage , but they had limitations and performance issues.

Storage

Storage Performance Innovation Network

Implementing LSM Trees in Golang: A Comprehensive Guide

DZone

OCTOBER 30, 2024

They offer significant performance benefits through batching writes and optimizing reads with sorted data structures. We’ll also dive deeper into SSTables , MemTables , and compaction strategies for optimizing performance in high-load environments.

Strategy

Strategy Storage Efficiency Database

Block Size and Its Impact on Storage Performance

DZone

JUNE 21, 2024

This article analyzes the correlation between block sizes and their impact on storage performance. This paper deals with definitions and understanding of structured data vs unstructured data, how various storage segments react to block size changes, and differences between I/O-driven and throughput-driven workloads.

Storage

Storage Performance Benchmarking Processing

Optimizing data warehouse storage

The Netflix TechBlog

DECEMBER 21, 2020

At this scale, we can gain a significant amount of performance and cost benefits by optimizing the storage layout (records, objects, partitions) as the data lands into our warehouse. We built AutoOptimize to efficiently and transparently optimize the data and metadata storage layout while maximizing their cost and performance benefits.

Storage

Storage Latency Efficiency Data Engineering

Catching up with OpenTelemetry in 2025

Dynatrace

FEBRUARY 27, 2025

OpenTelemetry is enhancing GenAI observability : By defining semantic conventions for GenAI and implementing Python-based instrumentation for OpenAI, OpenTel is moving towards addressing GenAI monitoring and performance tuning needs. The Collector is expected to be ready for prime time in 2025, reaching the v1.0

Tuning

Tuning Open Source Innovation Monitoring

How To Debug Mobile App Database Problems and Optimize Data Storage Performance

DZone

JUNE 28, 2023

However, lurking beneath the surface lies a complex web of data storage and retrieval. That's why knowing how to debug mobile app database problems and optimize data storage performance is essential for developers seeking excellence. When database problems arise, they can disrupt even the most well-crafted applications.

Storage

Storage Mobile Database Performance

Storage handling improvements increase retention of transaction data for Dynatrace Managed

Dynatrace

JULY 29, 2021

Using existing storage resources optimally is key to being able to capture the right data over time. This decompression of data is achieved with minimal impact on performance and costs. Increased storage space availability. Storage quotas defined for your Dynatrace Managed deployment and its environments.

Storage

Storage Virtualization Infrastructure Availability

Netflix’s Distributed Counter Abstraction

The Netflix TechBlog

NOVEMBER 12, 2024

This counting service, built on top of the TimeSeries Abstraction, enables distributed counting at scale while maintaining similar low latency performance. After selecting a mode, users can interact with APIs without needing to worry about the underlying storage mechanisms and counting methods.

Latency

Latency Cache Infrastructure Strategy

Empowering Developers With Scalable, Secure, and Customizable Storage Solutions

DZone

MARCH 22, 2024

As a developer, engineer, or architect, finding the right storage solution that seamlessly integrates with your infrastructure while providing the necessary scalability, security, and performance can be a daunting task. Whether you're a small startup or a large enterprise, StoneFly's storage solutions can grow with your business.

Storage

Storage Scalability Development Network

Partitioning Hot and Cold Data Tier in Apache Kafka Cluster for Optimal Performance

DZone

JUNE 28, 2024

At first, data tiering was a tactic used by storage systems to reduce data storage costs. This involved grouping data that was not accessed as often into more affordable, if less effective, storage array choices. Even though they are quite costly, SSDs and flash can be categorized as high-performance storage classes.

Azure

Azure Storage Performance Cloud

Microsoft Ignite 2024 guide: Cloud observability for AI transformation

Dynatrace

NOVEMBER 18, 2024

The Grail™ data lakehouse provides fast, auto-indexed, schema-on-read storage with massively parallel processing (MPP) to deliver immediate, contextualized answers from all data at scale. By prioritizing observability, organizations can ensure the availability, performance, and security of business-critical applications.

Cloud

Cloud Azure Artificial Intelligence Innovation

Choosing the Right Storage for PostgreSQL on Kubernetes: A Benchmark Analysis

Percona

APRIL 1, 2025

As more organizations move their PostgreSQL databases onto Kubernetes, a common question arises: Which storage solution best handles its demands? Picking the right option is critical, directly impacting performance, reliability, and scalability.

Storage

Storage Benchmarking Scalability Database

Performing Sentiment Analysis Video

DZone

SEPTEMBER 13, 2021

This video talks about an end-to-end flow, wherein an email content having a specific subject line will be read, the email body would be analyzed using Azure Cognitive Services (Sentiment analysis), analysis results would be saved in Azure Table Storage and finally, the chart would be drawn in Excel.

Azure

Azure Performance Storage Code

Optimize your environment: Unveiling Dynatrace Hyper-V extension for enhanced performance and efficient troubleshooting

Dynatrace

OCTOBER 23, 2023

Secondly, determining the correct allocation of resources (CPU, memory, storage) to each virtual machine to ensure optimal performance without over-provisioning can be difficult. This presents a challenge for IT operations teams, specifically in identifying and addressing performance issues or planning how to prevent future issues.

Efficiency

Efficiency Virtualization Hardware Performance

RabbitMQ vs. Kafka: Key Differences

Scalegrid

FEBRUARY 6, 2025

This article outlines the key differences in architecture, performance, and use cases to help determine the best fit for your workload. Message brokers handle validation, routing, storage, and delivery, ensuring efficient and reliable communication. What is RabbitMQ?

Latency

Latency Analytics Architecture Storage

Battle of the RabbitMQ Queues: Performance Insights on Classic and Quorum

DZone

SEPTEMBER 19, 2024

RabbitMQ is a powerful and widely used message broker that facilitates communication between distributed applications by handling the transmission, storage, and delivery of messages.

Storage

Storage Performance Scalability Architecture

Analyze OpenTelemetry traces and log data at scale: Accelerate troubleshooting and optimize application performance

Dynatrace

OCTOBER 3, 2024

Dynatrace OTel Collector Understand your applications with ease Due to a lack of contextual insights and actionable intelligence, application teams often find themselves overwhelmed by data, unable to quickly identify the root causes of performance issues. There is no need to think about schema and indexes, re-hydration, or hot/cold storage.

Performance

Performance Architecture Innovation Latency

Comparing PostgreSQL DigitalOcean Performance & Pricing – ScaleGrid vs. DigitalOcean Managed Databases

Scalegrid

JUNE 4, 2020

In this post, we are going to compare the performance and pricing of DigitalOcean PostgreSQL vs. ScaleGrid PostgreSQL to help you determine the best PostgreSQL hosting service on DigitalOcean. On average, ScaleGrid provides over 30% more storage vs. DigitalOcean for PostgreSQL at the same affordable price. Compare Pricing. Single Node.

Database

Database Latency Benchmarking Performance

Perform 2023 Guide: Organizations mine efficiencies with automation, causal AI

Dynatrace

FEBRUARY 10, 2023

These are just some of the topics being showcased at Perform 2023 in Las Vegas. Perform 2023 news At Perform 2023 in Las Vegas, the headliner theme is IT automation. What’s more, organizations are no longer concerned only about application performance and sales numbers. We’ll post news here as it happens!

Efficiency

Efficiency Performance Analytics DevOps

The Power of Caching: Boosting API Performance and Scalability

DZone

AUGUST 16, 2023

Caching is the process of storing frequently accessed data or resources in a temporary storage location, such as memory or disk, to improve retrieval speed and reduce the need for repetitive processing.

Cache

Cache Scalability Performance Latency

What is Greenplum Database? Intro to the Big Data Database

Scalegrid

MAY 13, 2020

Greenplum uses an MPP database design that can help you develop a scalable, high performance deployment. High performance, query optimization, open source and polymorphic data storage are the major Greenplum advantages. The Greenplum Architecture. Greenplum Advantages. Major Use Cases. Query Optimization. over Greenplum 5.

Big Data

Big Data Database Artificial Intelligence Open Source

CDNs: Speed Up Performance by Reducing Latency

DZone

MAY 3, 2023

In the previous posts, we covered things we had to do to upload files on the front end, things we had to do on the back end, and optimizing costs by moving file uploads to object storage.

Latency

Latency Speed Performance Storage

Mastering Disk Space Management with MongoDB® Storage Engines

Scalegrid

MAY 11, 2024

MongoDB offers several storage engines that cater to various use cases. The default storage engine in earlier versions was MMAPv1, which utilized memory-mapped files and document-level locking. This allowed for sequential access and indexed access, but random writes could cause performance issues.

Storage

Storage Engineering Cache Database

Best MySQL DigitalOcean Performance – ScaleGrid vs. DigitalOcean Managed Databases

Scalegrid

JUNE 22, 2020

ScaleGrid provides 30% more storage on average vs. DigitalOcean for MySQL at the same affordable price. MySQL DigitalOcean Performance Benchmark. We are going to use a common, popular plan size using the below configurations for this performance benchmark: Comparison Overview. Compare Pricing. DigitalOcean. Instance Type.

Database

Database Benchmarking Latency Performance

OpenPipeline: Simplify access to critical business data

Dynatrace

NOVEMBER 4, 2024

Track business metrics, key performance indicators (KPIs), and service level objectives (SLOs) — automatically and in context with IT infrastructure and services — to promote collaboration between business and IT teams. Reduced storage and query overhead for business use cases. Improved data management.

Analytics

Analytics Airlines Metrics Monitoring

What is a Distributed Storage System

Scalegrid

FEBRUARY 8, 2024

A distributed storage system is foundational in today’s data-driven landscape, ensuring data spread over multiple servers is reliable, accessible, and manageable. Understanding distributed storage is imperative as data volumes and the need for robust storage solutions rise.

Storage

Storage Systems Big Data Azure

Storage Strategies for PostgreSQL on Kubernetes

Percona

DECEMBER 11, 2023

There are a wealth of options on how you can approach storage configuration in Percona Operator for PostgreSQL , and in this blog post, we review various storage strategies — from basics to more sophisticated use cases. For example, you can choose the public cloud storage type – gp3, io2, etc, or set file system.

Storage

Storage Strategy Cloud Network

Optimizing IoT Performance in Industrial Environments

DZone

OCTOBER 9, 2024

A good starting point is to examine the storage, memory, and processing performance and verify that it aligns with the proposed use cases.

IoT

IoT Hardware Performance Internet

Best practices and key metrics for improving mobile app performance

Dynatrace

DECEMBER 13, 2023

Mobile applications (apps) are an increasingly important channel for reaching customers, but the distributed nature of mobile app platforms and delivery networks can cause performance problems that leave users frustrated, or worse, turning to competitors. What is mobile app performance? Issue remediation.

Best Practices

Best Practices Mobile Metrics Performance

Introducing Impressions at Netflix

The Netflix TechBlog

FEBRUARY 14, 2025

The enriched data is seamlessly accessible for both real-time applications via Kafka and historical analysis through storage in an Apache Iceberg table. Automating Performance Tuning with Autoscalers Tuning the performance of our Apache Flink jobs is currently a manual process.

Tuning

Tuning Latency Efficiency Storage

Indexed View for Aggregating Metrics

DZone

FEBRUARY 10, 2025

Microsoft Azure SQL is a robust, fully managed database platform designed for high-performance querying, relational data storage, and analytics. For a typical web application with a backend, it is a good choice when we want to consider a managed database that can scale both vertically and horizontally.

Metrics

Metrics Azure Analytics Storage

Network performance monitoring top of mind for CloudOps teams

Dynatrace

MAY 19, 2023

For cloud operations teams, network performance monitoring is central in ensuring application and infrastructure performance. Network performance monitoring core to observability For these reasons, network activity becomes a key data source in IT observability. Teams also don’t have to maintain normalized schemas to query data.

Network

Network Monitoring Performance Traffic

IT Operations: A Use Case in the 2023 Gartner Critical Capabilities for Application Performance Monitoring and Observability

Dynatrace

OCTOBER 17, 2023

In the recently published Gartner® “ Critic al Capabilities for Application Performance Monitoring and Observability,” Dynatrace scored highest for the IT Operations Use Case (4.15/5) This is accomplished by using service monitoring and anomaly detection for early-warning notifications of performance issues.” 5) in the Gartner report.

Monitoring

Monitoring Artificial Intelligence Performance Analytics

How to Perform Load Testing Against Nebula Graph With K6

DZone

DECEMBER 17, 2021

The load testing for the database needs to be conducted usually so that the impact on the system can be monitored in different scenarios, such as query language rule optimization, storage engine parameter adjustment, etc. The operating system in this article is the x86 CentOS 7.8.

Testing

Testing Operating System Storage Performance

Optimize Citrix platform performance and user experience with Dynatrace (GA)

Dynatrace

JANUARY 15, 2020

This extends Dynatrace visibility into Citrix user experience and Citrix platform performance. Therefore, it requires multidimensional and multidisciplinary monitoring: Infrastructure health —automatically monitor the compute, storage, and network resources available to the Citrix system to ensure a stable platform. Citrix VDA.

Latency

Latency Performance Virtualization Infrastructure

Jaeger and ScyllaDB Integration: High Performance at Scale

DZone

OCTOBER 10, 2023

With the rise of microservices and cloud-native applications, Jaeger has become a crucial tool for developers and system administrators to gain insights into the performance and behavior of their applications. Use the best-performing Jaeger storage backend that you can find.

Performance

Performance Storage Traffic Programming

Designing Instagram

High Scalability

JANUARY 11, 2022

from a client it performs two parallel operations: i) persisting the action in the data store ii) publish the action in a streaming data store for a pub-sub model. User Feed Service, Media Counter Service) read the actions from the streaming data store and performs their specific tasks. After that, the various services (e.g.

Design

Design Media Storage Logistics

Introducing Netflix’s Key-Value Data Abstraction Layer

The Netflix TechBlog

SEPTEMBER 18, 2024

Firstly, developers struggled to reason about consistency, durability and performance in this complex global deployment across multiple stores. This flexibility allows our Data Platform to route different use cases to the most suitable storage system based on performance, durability, and consistency needs.

Latency

Latency Storage Cache Servers

How We Optimized Read Performance: Readahead, Prefetch, and Cache

DZone

SEPTEMBER 3, 2024

High-performance computing systems often use all-flash architectures and kernel-mode parallel file systems to satisfy performance demands. However, the increasing sizes of both data volumes and distributed system clusters raise significant cost challenges for all-flash storage and vast operational challenges for kernel clients.

Cache

Cache Performance Storage Architecture

The Challenges of Ajax CDN

DZone

AUGUST 4, 2022

For the longest time, hosting static files on CDNs was the de facto standard for performance tuning website pages. The host offered browser caching advantages, better stability, and storage on fast edge servers across strategic geolocations. Not only did it have performance benefits, but it was also convenient for developers.

Cache

Cache Tuning Storage Website

Data Storage Formats for Big Data Analytics: Performance and Cost Implications of Parquet, Avro, and ORC

Storage Types Used on Cloud Computing Platforms

Trending Sources

Dynatrace + Metis: Helping developers & SREs solve Database issues with AI

How We Built a High-Performance Storage Layer for Our Ultra-Heterogeneous Computing Cluster

Implementing LSM Trees in Golang: A Comprehensive Guide

Block Size and Its Impact on Storage Performance

Optimizing data warehouse storage

Catching up with OpenTelemetry in 2025

How To Debug Mobile App Database Problems and Optimize Data Storage Performance

Storage handling improvements increase retention of transaction data for Dynatrace Managed

Netflix’s Distributed Counter Abstraction

Empowering Developers With Scalable, Secure, and Customizable Storage Solutions

Partitioning Hot and Cold Data Tier in Apache Kafka Cluster for Optimal Performance

Microsoft Ignite 2024 guide: Cloud observability for AI transformation

Choosing the Right Storage for PostgreSQL on Kubernetes: A Benchmark Analysis

Performing Sentiment Analysis Video

Optimize your environment: Unveiling Dynatrace Hyper-V extension for enhanced performance and efficient troubleshooting

RabbitMQ vs. Kafka: Key Differences

Battle of the RabbitMQ Queues: Performance Insights on Classic and Quorum

Analyze OpenTelemetry traces and log data at scale: Accelerate troubleshooting and optimize application performance

Comparing PostgreSQL DigitalOcean Performance & Pricing – ScaleGrid vs. DigitalOcean Managed Databases

Perform 2023 Guide: Organizations mine efficiencies with automation, causal AI

The Power of Caching: Boosting API Performance and Scalability

What is Greenplum Database? Intro to the Big Data Database

CDNs: Speed Up Performance by Reducing Latency

Mastering Disk Space Management with MongoDB® Storage Engines

Best MySQL DigitalOcean Performance – ScaleGrid vs. DigitalOcean Managed Databases

OpenPipeline: Simplify access to critical business data

What is a Distributed Storage System

Storage Strategies for PostgreSQL on Kubernetes

Optimizing IoT Performance in Industrial Environments

Best practices and key metrics for improving mobile app performance

Introducing Impressions at Netflix

Indexed View for Aggregating Metrics

Network performance monitoring top of mind for CloudOps teams

IT Operations: A Use Case in the 2023 Gartner Critical Capabilities for Application Performance Monitoring and Observability

How to Perform Load Testing Against Nebula Graph With K6

Optimize Citrix platform performance and user experience with Dynatrace (GA)

Top PostgreSQL 17 New Features

Jaeger and ScyllaDB Integration: High Performance at Scale

Designing Instagram

Introducing Netflix’s Key-Value Data Abstraction Layer

How We Optimized Read Performance: Readahead, Prefetch, and Cache

The Challenges of Ajax CDN

Stay Connected