🛠️ Steven Gong

Search

Mar 22, 2026, 1 min read

Muon is Scalable for LLM Training

https://kellerjordan.github.io/posts/muon/#why-is-it-good-to-orthogonalize-the-update

Graph View

Backlinks

Newton-Shulz Iteration

Created with Quartz, © 2026

Blog
LinkedIn
Twitter
GitHub