High Bandwidth Memory (HBM)

Ran into this term while learning about memory through Stephen Jones.

HBM is a type of computer memory that’s used in many data-intensive applications. HBM is made up of stacks of memory chips that are designed to store and transmit data quickly.

HBM is Stacked on the same package as the processor (3D stacked).

Source: https://www.nextplatform.com/2021/07/21/what-faster-and-smarter-hbm-memory-means-for-systems/

HBM memory allows for four, eight, or twelve DRAMs to be stacked atop each other and linked to each other with through-silicon vias

https://openai.com/index/triton/

Do not confuse this with NVLink / Interconnect

HBM is a type of memory for your GPUs. It has nothing to do with GPU <> GPU communication. For that, it’s usually NVLINK if you’re using NVIDIA.

HBM achieves higher bandwidth than DDR4 or GDDR5 while using less power, and in a substantially smaller form factor.

This is achieved by stacking up to eight DRAM dies and an optional base die which can include buffer circuitry and test logic.

I don't see how stacking DRAM in HBM is different than putting DRAM around peripheral of CPU/GPU?

The asianometry video explains this.

Resources

Found link from https://x.com/itsclivetime/status/1762413558036635697

🛠️ Steven Gong

Table of Contents

High Bandwidth Memory (HBM)

Graph View

Backlinks

🛠️ Steven Gong

Table of Contents

High Bandwidth Memory (HBM)

Related

Graph View

Backlinks