Remove Presentation Remove Processing Remove Storage
article thumbnail

Optimizing data warehouse storage

The Netflix TechBlog

At this scale, we can gain a significant amount of performance and cost benefits by optimizing the storage layout (records, objects, partitions) as the data lands into our warehouse. On the other hand, these optimizations themselves need to be sufficiently inexpensive to justify their own processing cost over the gains they bring.

Storage 214
article thumbnail

What is a Distributed Storage System

Scalegrid

A distributed storage system is foundational in today’s data-driven landscape, ensuring data spread over multiple servers is reliable, accessible, and manageable. Understanding distributed storage is imperative as data volumes and the need for robust storage solutions rise.

Storage 130
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Scaling Media Machine Learning at Netflix

The Netflix TechBlog

We accomplish this by paving the path to: Accessing and processing media data (e.g. We accomplish this by paving the path to: Accessing and processing media data (e.g. We will then present a case study of using these components in order to optimize, scale, and solidify an existing pipeline.

Media 299
article thumbnail

Using JSONB in PostgreSQL: How to Effectively Store & Index JSON Data in PostgreSQL

Scalegrid

Note: If a particular key is always present in your document, it might make sense to store it as a first class column. JSON is faster to ingest vs. JSONB – however, if you do any further processing, JSONB will be faster. JSONB storage results in a larger storage footprint. whitespace) and ordering of the keys.

Storage 321
article thumbnail

Netflix’s Distributed Counter Abstraction

The Netflix TechBlog

Today, we’re excited to present the Distributed Counter Abstraction. In this context, they refer to a count very close to accurate, presented with minimal delays. After selecting a mode, users can interact with APIs without needing to worry about the underlying storage mechanisms and counting methods.

Latency 253
article thumbnail

Building an elastic query engine on disaggregated storage

The Morning Paper

Building an elastic query engine on disaggregated storage , Vuppalapati, NSDI’20. This paper presents Snowflake design and implementation along with a discussion on how recent changes in cloud infrastructure (emerging hardware, fine-grained billing, etc.) joins) during query processing.

Storage 112
article thumbnail

Nine ways technology executives can get significant business value with the right observability platform

Dynatrace

Observability data presents executives with new opportunities to achieve this, by creating incremental value for cloud modernization , improved business analytics , and enhanced customer experience. With the latest advances from Dynatrace, this process is instantaneous. This is inefficient and creates avoidable risks.