🛠️ Steven Gong
Search
Search
Search
Light mode
Dark mode
Apr 04, 2026, 1 min read
DeepSeek
They implemented
Multi-Head Attention
.
Graph View
Backlinks
Multi-Head Latent Attention (MLA)