Availability, Big Data and Scalability - Technology Performance Pulse

What is Greenplum Database? Intro to the Big Data Database

Scalegrid

MAY 13, 2020

This feature-packed database provides powerful and rapid analytics on data that scales up to petabyte volumes. Greenplum uses an MPP database design that can help you develop a scalable, high performance deployment. High performance, query optimization, open source and polymorphic data storage are the major Greenplum advantages.

Big Data

Big Data Database Artificial Intelligence Open Source

In-Stream Big Data Processing

Highly Scalable

AUGUST 20, 2013

The shortcomings and drawbacks of batch-oriented data processing were widely recognized by the Big Data community quite a long time ago. As a result, the input data typically goes from the data source to the in-stream pipeline via a persistent buffer that allows clients to move their reading pointers back and forth.

Big Data

Big Data Processing Lambda Database

What is IT operations analytics? Extract more data insights from more sources

Dynatrace

MAY 1, 2023

Then, big data analytics technologies, such as Hadoop, NoSQL, Spark, or Grail, the Dynatrace data lakehouse technology, interpret this information. Here are the six steps of a typical ITOA process : Define the data infrastructure strategy. Identify data use cases and develop a scalable delivery model with documentation.

Analytics

Analytics Artificial Intelligence Big Data Open Source

What is cloud monitoring? How to improve your full-stack visibility

Dynatrace

JANUARY 11, 2023

With more organizations taking the multicloud plunge, monitoring cloud infrastructure is critical to ensure all components of the cloud computing stack are available, high-performing, and secure. Website monitoring examines a cloud-hosted website’s processes, traffic, availability, and resource use. Database monitoring.

Cloud

Cloud Monitoring Best Practices Infrastructure

Any analysis, any time: Dynatrace Log Management and Analytics powered by Grail

Dynatrace

OCTOBER 4, 2022

Several pain points have made it difficult for organizations to manage their data efficiently and create actual value. Limited data availability constrains value creation. Modern IT environments — whether multicloud, on-premises, or hybrid-cloud architectures — generate exponentially increasing data volumes.

Analytics

Analytics Artificial Intelligence Storage Serverless

Building and Scaling Data Lineage at Netflix to Improve Data Infrastructure Reliability, and…

The Netflix TechBlog

MARCH 25, 2019

Netflix Data Landscape Freedom & Responsibility (F&R) is the lynchpin of Netflix’s culture empowering teams to move fast to deliver on innovation and operate with freedom to satisfy their mission. As a result, a single consolidated and centralized source of truth does not exist that can be leveraged to derive data lineage truth.

Infrastructure

Infrastructure Big Data Transportation Architecture

How Netflix uses eBPF flow logs at scale for network insight

The Netflix TechBlog

JUNE 7, 2021

Network Availability: The expected continued growth of our ecosystem makes it difficult to understand our network bottlenecks and potential limits we may be reaching. The data is also used by security and other partner teams for insight and incident analysis.

Network

Network Transportation AWS Cloud

Kubernetes for Big Data Workloads

Abhishek Tiwari

DECEMBER 27, 2017

Kubernetes has emerged as go to container orchestration platform for data engineering teams. In 2018, a widespread adaptation of Kubernetes for big data processing is anitcipated. Organisations are already using Kubernetes for a variety of workloads [1] [2] and data workloads are up next. Key challenges. Performance.

Big Data

Big Data Storage Benchmarking Hardware

Kubernetes in the wild report 2023

Dynatrace

JANUARY 16, 2023

Through effortless provisioning, a larger number of small hosts provide a cost-effective and scalable platform. On-premises data centers invest in higher capacity servers since they provide more flexibility in the long run, while the procurement price of hardware is only one of many cost factors.

Open Source

Open Source Java Operating System Programming

What is container orchestration?

Dynatrace

MARCH 24, 2023

This orchestration includes provisioning, scheduling, networking, ensuring availability, and monitoring container lifecycles. Part of its popularity owes to its availability as a managed service through the major cloud providers, such as Amazon Elastic Kubernetes Service , Google Kubernetes Engine , and Microsoft Azure Kubernetes Service.

Infrastructure

Infrastructure Open Source Operating System Cloud

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

The Netflix TechBlog

MAY 4, 2023

The first phase involves validating functional correctness, scalability, and performance concerns and ensuring the new systems’ resilience before the migration. It provides a good read on the availability and latency ranges under different production conditions.

Traffic

Traffic Latency Tuning Systems

Building a Rule-Based Platform to Manage Netflix Membership SKUs at Scale

The Netflix TechBlog

FEBRUARY 16, 2021

To solve the challenges mentioned above and meet our rapidly evolving business needs, we re-architected the legacy SKU catalog from the ground up and partnered with the Growth Engineering team to build a scalable SKU platform. Most of these changes are mechanical and amenable to the “self-service” model. Business Rules?—?SKURules:

Mobile

Mobile Engineering Infrastructure Scalability

What is a Distributed Storage System

Scalegrid

FEBRUARY 8, 2024

Key Takeaways Distributed storage systems benefit organizations by enhancing data availability, fault tolerance, and system scalability, leading to cost savings from reduced hardware needs, energy consumption, and personnel. Variations within these storage systems are called distributed file systems.

Storage

Storage Systems Big Data Azure

Bulldozer: Batch Data Moving from Data Warehouse to Online Key-Value Stores

The Netflix TechBlog

OCTOBER 27, 2020

Data scientists and engineers collect this data from our subscribers and videos, and implement data analytics models to discover customer behaviour with the goal of maximizing user joy. The processed data is typically stored as data warehouse tables in AWS S3. Moving data with Bulldozer at Netflix.

Latency

Latency Storage Big Data Tuning

Orchestrating Data/ML Workflows at Scale With Netflix Maestro

The Netflix TechBlog

OCTOBER 18, 2022

As Big data and ML became more prevalent and impactful, the scalability, reliability, and usability of the orchestrating ecosystem have increasingly become more important for our data scientists and the company. Meson was based on a single leader architecture with high availability.

Java

Java Scalability Traffic Architecture

Expanding the AWS Cloud: Introducing the AWS Europe (London) Region

All Things Distributed

DECEMBER 13, 2016

Today, I'm happy to announce that the AWS Europe (London) Region, our 16th technology infrastructure region globally, is now generally available for use by customers worldwide. Fraud.net use AWS to support highly scalable, big data applications that run machine learning processes for real-time analytics.

AWS

AWS Cloud Artificial Intelligence IoT

MySQL vs MongoDB: Best Choice for You

Scalegrid

FEBRUARY 11, 2025

This article will help you understand the core differences in data structure, scalability, and use cases. Whether you need a relational database for complex transactions or a NoSQL database for flexible data storage, weve got you covered. Choosing the right database often comes down to MongoDB vs MySQL.

Scalability

Scalability Database Storage IoT

Mastering Distributed SQL™ Databases in 2025

Scalegrid

JANUARY 10, 2025

Heading into 2024, SQL databases will remain essential in data management, increasingly using distributed systems to meet growing needs for scalability and reliability. They keep the features that developers like but can handle much more data, similar to NoSQL systems.

Database

Database Scalability Best Practices Blockchain

NoSQL Data Modeling Techniques

Highly Scalable

MARCH 1, 2012

NoSQL databases are often compared by various non-functional criteria, such as scalability, performance, and consistency. Besides this, elimination of these features had an extremely important influence on the performance and scalability of the stores. Many techniques that are described below are perfectly applicable to this model.

Database

Database Ecommerce Efficiency Engineering

Expanding the Cloud – An AWS Region is coming to Hong Kong

All Things Distributed

JUNE 20, 2017

The new region will give Hong Kong-based businesses, government organizations, non-profits, and global companies with customers in Hong Kong, the ability to leverage AWS technologies from data centers in Hong Kong. The new AWS Asia Pacific (Hong Kong) Region will have three Availability Zones and be ready for customers for use in 2018.

AWS

AWS Logistics Cloud Social Media

What is RabbitMQ Used For

Scalegrid

JUNE 28, 2024

This system allows for scalability and efficiency, demonstrating RabbitMQ’s versatility in real-world applications where speed and reliability are crucial. This efficient handling of messages improves throughput and promotes maximum utilization of all available resources.

IoT

IoT Healthcare Programming Open Source

Redis vs Memcached in 2024

Scalegrid

MARCH 28, 2024

In this comparison of Redis vs Memcached, we strip away the complexity, focusing on each in-memory data store’s performance, scalability, and unique features. Redis is better suited for complex data models, and Memcached is better suited for high-throughput, string-based caching scenarios. Data transfer technology.

Cache

Cache Storage Scalability Architecture

Expanding the AWS Cloud: Introducing the AWS Canada (Central) Region

All Things Distributed

DECEMBER 8, 2016

Today, I'm happy to share that the Canada (Central) Region is available for use by customers worldwide. The AWS Cloud now operates in 40 Availability Zones within 15 geographic regions around the world, with seven more Availability Zones and three more regions coming online in China, France, and the U.K. Scalability.

AWS

AWS Cloud Lambda Innovation

Expanding the Cloud - Introducing Amazon ElastiCache - All Things.

All Things Distributed

AUGUST 22, 2011

Werner Vogels weblog on building scalable and robust distributed systems. Caching has become a standard component in many applications to achieve a fast and predictable performance, but maintaining a collection of cache servers in a reliable and scalable manner is not a simple task. Driving down the cost of Big-Data analytics.

Cloud

Cloud Cache AWS Storage

The Amazon.com 2010 Shareholder Letter Focusses on Technology.

All Things Distributed

APRIL 27, 2011

Werner Vogels weblog on building scalable and robust distributed systems. The storage systems weve pioneered demonstrate extreme scalability while maintaining tight control over performance, availability, and cost. a Fast and Scalable NoSQL Database Service Designed for Internet Scale Applications. All Things Distributed.

Technology

Technology Technology AWS Storage

Post: InterviewCamp.io, Scrapinghub, Fauna, Sisu, Educative, PA File Sight, Etleap, Triplebyte, Stream

High Scalability

APRIL 14, 2020

Scrapinghub is hiring a Senior Software Engineer (Big Data/AI). You will be designing and implementing distributed systems : large-scale web crawling platform, integrating Deep Learning based web data extraction components, working on queue algorithms, large datasets, creating a development platform for other company departments, etc.

Education

Education Software Engineering Scalability Engineering

Välkommen till Stockholm – An AWS Region is coming to the Nordics

All Things Distributed

APRIL 4, 2017

The new region will give Nordic-based businesses, government organisations, non-profits, and global companies with customers in the Nordics, the ability to leverage the AWS technology infrastructure from data centers in Sweden. The new AWS EU (Stockholm) Region will have three Availability Zones and will be ready for customers to use in 2018.

AWS

AWS Airlines Latency Games

Expanding the Cloud: Introducing Amazon QuickSight

All Things Distributed

OCTOBER 7, 2015

Today, I am excited to share with you a brand new service called Amazon QuickSight that aims to simplify the process of deriving insights from a wide variety of data sources in a fast and affordable manner. QuickSight is a fast, cloud native, scalable, business intelligence service for the 1/10th the cost of old-guard BI solutions.

Cloud

Cloud Big Data AWS Analytics

Post: InterviewCamp.io, Scrapinghub, Fauna, Sisu, Educative, PA File Sight, Etleap, Triplebyte, Stream

High Scalability

APRIL 28, 2020

Scrapinghub is hiring a Senior Software Engineer (Big Data/AI). You will be designing and implementing distributed systems : large-scale web crawling platform, integrating Deep Learning based web data extraction components, working on queue algorithms, large datasets, creating a development platform for other company departments, etc.

Education

Education Software Engineering Scalability Engineering

Music to my Ears - All Things Distributed

All Things Distributed

MARCH 28, 2011

Werner Vogels weblog on building scalable and robust distributed systems. As a big music fan with well over 100Gb in digital music I am particularly excited that I now have access to all my digital music anywhere I go. a Fast and Scalable NoSQL Database Service Designed for Internet Scale Applications. All Things Distributed.

AWS

AWS Cloud Storage Internet

Simplifying IT - Create Your Application with AWS CloudFormation.

All Things Distributed

FEBRUARY 25, 2011

Werner Vogels weblog on building scalable and robust distributed systems. They had taken the approach that they would not only be offering their software as a scalable multi-tenant product but also as a single tenant environment for customers that want to have their own isolated environment. All Things Distributed. Comments ().

AWS

AWS Cloud Scalability Storage

Expanding the Cloud ? introducing the Asia Pacific (Sydney) Region.

All Things Distributed

NOVEMBER 12, 2012

Werner Vogels weblog on building scalable and robust distributed systems. The Region launches with two Availability Zones to help customers build highly available applications. All Things Distributed. Expanding the Cloud â?? introducing the Asia Pacific (Sydney) Region. By Werner Vogels on 12 November 2012 05:00 AM. Comments ().

Cloud

Cloud AWS Ecommerce Latency

Expanding the Cloud with DNS - Introducing Amazon Route 53 - All.

All Things Distributed

DECEMBER 5, 2010

Werner Vogels weblog on building scalable and robust distributed systems. I am very excited that today we have launched Amazon Route 53, a high-performance and highly-available Domain Name System (DNS) service. Route 53 provides Authoritative DNS functionality implemented using a world-wide network of highly-available DNS servers.

Cloud

Cloud Internet Internet AWS

AWS Elastic Beanstalk: A Quick and Simple Way into the Cloud - All.

All Things Distributed

JANUARY 19, 2011

Werner Vogels weblog on building scalable and robust distributed systems. A whole range of innovative new services, ranging from media conversion to geo-location-context services have been developed by our customers using this flexibility and are available in the AWS ecosystem. All Things Distributed. Comments ().

AWS

AWS Cloud Java Scalability

The AWS GovCloud (US) Region - All Things Distributed

All Things Distributed

AUGUST 16, 2011

Werner Vogels weblog on building scalable and robust distributed systems. Government and Big Data. One particular early use case for AWS GovCloud (US) will be massive data processing and analytics. The scalability, flexibility and the elasticity of AWS makes it an ideal environment for the agencies to run their analytics.

AWS

AWS Government Big Data Cloud

Expanding the Cloud ? Managing Cold Storage with Amazon Glacier

All Things Distributed

AUGUST 20, 2012

Werner Vogels weblog on building scalable and robust distributed systems. The service redundantly stores data in multiple facilities and on multiple devices within each facility, as Amazon Glacier is designed to provide average annual durability of 99.999999999% for each item stored. provides highly available and highly durable (â??designed

Storage

Storage Cloud AWS Media

Mastering Hybrid Cloud Strategy

Scalegrid

MARCH 14, 2024

This approach allows companies to combine the security and control of private clouds with public clouds’ scalability and innovation potential. Customized Solutions ScaleGrid collaborates with a wide range of cloud platforms, granting entry to data centers worldwide and facilitating support for activities in various regions.

Strategy

Strategy Cloud Artificial Intelligence Infrastructure

New Route 53 and ELB features: IPv6, Zone Apex, WRR and more.

All Things Distributed

MAY 24, 2011

Werner Vogels weblog on building scalable and robust distributed systems. a Fast and Scalable NoSQL Database Service Designed for Internet Scale Applications. Driving down the cost of Big-Data analytics. All Things Distributed. New Route 53 and ELB features: IPv6, Zone Apex, WRR and more. Comments ().

Internet

Internet Internet AWS Scalability

Expanding the Cloud - Introducing the AWS Asia Pacific (Tokyo.

All Things Distributed

MARCH 2, 2011

Werner Vogels weblog on building scalable and robust distributed systems. Japanese companies and consumers have become used to low latency and high-speed networking available between their businesses, residences, and mobile devices. a Fast and Scalable NoSQL Database Service Designed for Internet Scale Applications. Comments ().

AWS

AWS Cloud Games Latency

Amazon EC2 Cluster GPU Instances - All Things Distributed

All Things Distributed

NOVEMBER 15, 2010

Werner Vogels weblog on building scalable and robust distributed systems. This incredible power is available for anyone to use in the usual pay-as-you-go model, removing the investment barrier that has kept many organizations from adopting GPUs for their workloads even though they knew there would be significant performance benefit.

AWS

AWS Programming Latency Architecture

Optimizing data warehouse storage

The Netflix TechBlog

DECEMBER 21, 2020

It is responsible for listening to incoming events and requests and prioritizing different tables and actions to make the best usage of the available resources. AutoOptimize relies on some of the Iceberg specific features such as snapshot and atomic operations to perform the optimizations in an accurate and scalable manner.

Storage

Storage Latency Efficiency Data Engineering

Post: Scrapinghub, Fauna, Sisu, Educative, PA File Sight, Etleap, Triplebyte, Stream

High Scalability

MARCH 24, 2020

Scrapinghub is hiring a Senior Software Engineer (Big Data/AI). You will be designing and implementing distributed systems : large-scale web crawling platform, integrating Deep Learning based web data extraction components, working on queue algorithms, large datasets, creating a development platform for other company departments, etc.

Education

Education Software Engineering Engineering Big Data

What is Greenplum Database? Intro to the Big Data Database

In-Stream Big Data Processing

Trending Sources

What is IT operations analytics? Extract more data insights from more sources

What is cloud monitoring? How to improve your full-stack visibility

Any analysis, any time: Dynatrace Log Management and Analytics powered by Grail

Building and Scaling Data Lineage at Netflix to Improve Data Infrastructure Reliability, and…

How Netflix uses eBPF flow logs at scale for network insight

Kubernetes for Big Data Workloads

Kubernetes in the wild report 2023

What is container orchestration?

Migrating Critical Traffic At Scale with No Downtime?—?Part 1

Building a Rule-Based Platform to Manage Netflix Membership SKUs at Scale

What is a Distributed Storage System

Bulldozer: Batch Data Moving from Data Warehouse to Online Key-Value Stores

Orchestrating Data/ML Workflows at Scale With Netflix Maestro

Expanding the AWS Cloud: Introducing the AWS Europe (London) Region

MySQL vs MongoDB: Best Choice for You

Mastering Distributed SQL™ Databases in 2025

NoSQL Data Modeling Techniques

Expanding the Cloud – An AWS Region is coming to Hong Kong

What is RabbitMQ Used For

Redis vs Memcached in 2024

Expanding the AWS Cloud: Introducing the AWS Canada (Central) Region

Expanding the Cloud - Introducing Amazon ElastiCache - All Things.

The Amazon.com 2010 Shareholder Letter Focusses on Technology.

Post: InterviewCamp.io, Scrapinghub, Fauna, Sisu, Educative, PA File Sight, Etleap, Triplebyte, Stream

Välkommen till Stockholm – An AWS Region is coming to the Nordics

Expanding the Cloud: Introducing Amazon QuickSight

Sponsored Post: InterviewCamp.io, Scrapinghub, Fauna, Sisu, Educative, PA File Sight, Etleap, Triplebyte, Stream

Post: InterviewCamp.io, Scrapinghub, Fauna, Sisu, Educative, PA File Sight, Etleap, Triplebyte, Stream

Music to my Ears - All Things Distributed

Simplifying IT - Create Your Application with AWS CloudFormation.

Expanding the Cloud ? introducing the Asia Pacific (Sydney) Region.

Expanding the Cloud with DNS - Introducing Amazon Route 53 - All.

AWS Elastic Beanstalk: A Quick and Simple Way into the Cloud - All.

The AWS GovCloud (US) Region - All Things Distributed

Expanding the Cloud ? Managing Cold Storage with Amazon Glacier

Sponsored Post: InterviewCamp.io, Scrapinghub, Fauna, Sisu, Educative, PA File Sight, Etleap, Triplebyte, Stream

Mastering Hybrid Cloud Strategy

New Route 53 and ELB features: IPv6, Zone Apex, WRR and more.

Expanding the Cloud - Introducing the AWS Asia Pacific (Tokyo.

Amazon EC2 Cluster GPU Instances - All Things Distributed

Optimizing data warehouse storage

Post: Scrapinghub, Fauna, Sisu, Educative, PA File Sight, Etleap, Triplebyte, Stream

Stay Connected