🛠️ Steven Gong
Search
Search
Search
Light mode
Dark mode
Mar 19, 2026, 1 min read
DeepSeek
They implemented
Multi-Head Attention
.
Graph View
Backlinks
Multi-Head Latent Attention (MLA)