🛠️ Steven Gong

Search

Nov 29, 2023, 1 min read

CUDA Optimization

This is a list of techniques taken from table 6.1 of the PMPP book.

Maximize occupancy
Enable Memory Coalescing by being aware of the order at which you are reading from RAM
Minimize Control Divergence
Tiling
Privatization ..?
Thread coarsening

Graph View

Backlinks

Corner Turning

Created with Quartz, © 2025

Blog
LinkedIn
Twitter
GitHub