article thumbnail

Stuff The Internet Says On Scalability For May 10th, 2019

High Scalability

1% : performers raked in 60% of all concert-ticket revenue world-wide in 2017—more than double their share in 1982. Quotable Stuff: @mjpt777 : APIs to IO need to be asynchronous and support batching otherwise the latency of calls dominate throughput and latency profile under burst conditions.

article thumbnail

USENIX LISA2021 Computing Performance: On the Horizon

Brendan Gregg

## References I've reproduced the talk references below, so you can click on links: - [Gregg 08] Brendan Gregg, “ZFS L2ARC,” [link] Jul 2008 - [Gregg 10] Brendan Gregg, “Visualizations for Performance Analysis (and More),” [link] 2010 - [Greenberg 11] Marc Greenberg, “DDR4: Double the speed, double the latency? Ford, et al., “TCP

Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Working at Netflix 2017

Brendan Gregg

A latency outlier issue that happened every 15 minutes. html [The PMCs of EC2]: /blog/2017-05-04/the-pmcs-of-ec2.html Recently that's included: - Debugging why perf profiling stopped working in recent Docker containers. Java core dump analysis for a crashing JVM. - Analyzing slab memory growth on a instance with containers.

Java 75
article thumbnail

Stuff The Internet Says On Scalability For December 7th, 2018

High Scalability

It's HighScalability time: This is your 1500ms latency in real life situations - pic.twitter.com/guot8khIPX. ivanveram : World R&D leading companies 2017 in US$. — Ivo Mägi (@ivomagi) November 27, 2018. Do you like this sort of Stuff? Please support me on Patreon. I'd really appreciate it. 1: Amazon; No.2:

Internet 163
article thumbnail

Building Netflix’s Distributed Tracing Infrastructure

The Netflix TechBlog

If we had an ID for each streaming session then distributed tracing could easily reconstruct session failure by providing service topology, retry and error tags, and latency measurements for all service calls. Using simple lookup indices in Cassandra gives us the ability to maintain acceptable read latencies while doing heavy writes.

article thumbnail

USENIX SREcon APAC 2022: Computing Performance: What's on the Horizon

Brendan Gregg

My personal opinion is that I don't see a widespread need for more capacity given horizontal scaling and servers that can already exceed 1 Tbyte of DRAM; bandwidth is also helpful, but I'd be concerned about the increased latency for adding a hop to more memory. Ford, et al., “TCP

article thumbnail

Solaris to Linux Migration 2017

Brendan Gregg

Here's some output from my zfsdist tool, in bcc/BPF, which measures ZFS latency as a histogram on Linux: # zfsdist. Tracing ZFS operation latency. Many new tools can now be written, and the main toolkit we're working on is [bcc]. Hit Ctrl-C to end. ^C