What is GPU’s memory hierarchy?

A GPU’s memory hierarchy is like having multiple storage rooms inside a super-fast toy factory, each room has its own speed and purpose.

Imagine you're building a big LEGO castle. The GPU (Graphics Processing Unit) is the worker who puts all the pieces together, but it needs different kinds of storage to do its job efficiently.

The Fastest Room: Registers

The registers are like tiny pockets on the worker’s belt, they’re super fast and can hold just a few LEGO pieces at a time. When the worker needs something right away, it checks here first.

The Next Room: Cache Memory

If the worker doesn’t find what it needs in its pockets, it goes to cache memory, which is like a nearby shelf with more LEGO pieces, still fast, but not as fast as the pockets.

The Big Warehouse: Global Memory

When the worker can't find what it needs anywhere close, it goes all the way to the global memory, which is like a big warehouse full of LEGO pieces. It’s slower, but it has everything you might need for the whole castle.

Each part helps the GPU work faster and more efficiently, just like how having different storage places makes building your LEGO castle quicker!

Take the quiz →

Examples

  1. A GPU has different levels of memory to help it quickly process images and videos, like how a library has shelves for books you use often and ones you don’t.

Ask a question

See also

Discussion

Recent activity