GEneral Matrix Multiply (SGEMM)
Seen this term quite a lot.
Resources
GPU
CPU
- https://siboehm.com/articles/22/Fast-MMM-on-CPU
- https://sahnimanas.github.io/post/anatomy-of-a-high-performance-convolution/
- https://github.com/flame/how-to-optimize-gemm
SGEMM