Remove 2005 Remove Architecture Remove Hardware
article thumbnail

Faster remainders when the divisor is a constant: beating compilers and libdivide

Daniel Lemire

The division by a power of two ( / (2 N )) can be implemented as a right shift if we are working with unsigned integers, which compiles to single instruction: that is possible because the underlying hardware uses a base 2. Thus if 2 N / d has been precomputed, you can compute the division n/d as a multiplication and a shift. Can we do better?

C++ 279
article thumbnail

Single-core memory bandwidth: Latency, Bandwidth, and Concurrency

John McCalpin

From the form of the equation the units are GB/s * ns = Bytes, but to understand how this maps to computer hardware resources it is almost always more convenient to translate this to units of “cache lines” (with 64 Bytes per cache line in the processors reviewed here). The same is true for software prefetches.)

Latency 71
Insiders

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

AWS EC2 Virtualization 2017: Introducing Nitro

Brendan Gregg

Hardware virtualization for cloud computing has come a long way, improving performance using technologies such as VT-x, SR-IOV, VT-d, NVMe, and APICv. The latest AWS hypervisor, Nitro, uses everything to provide a new hardware-assisted hypervisor that is easy to use and has near bare-metal performance. I'd expect between 0.1%

article thumbnail

Single-core memory bandwidth: Latency, Bandwidth, and Concurrency

John McCalpin

From the form of the equation the units are GB/s * ns = Bytes, but to understand how this maps to computer hardware resources it is almost always more convenient to translate this to units of “cache lines” (with 64 Bytes per cache line in the processors reviewed here). The same is true for software prefetches.)

Latency 40
article thumbnail

AWS EC2 Virtualization 2017: Introducing Nitro

Brendan Gregg

Hardware virtualization for cloud computing has come a long way, improving performance using technologies such as VT-x, SR-IOV, VT-d, NVMe, and APICv. The latest AWS hypervisor, Nitro, uses everything to provide a new hardware-assisted hypervisor that is easy to use and has near bare-metal performance. I'd expect between 0.1%