🛠️ Steven Gong

Search

SearchSearch

Mar 28, 2026, 1 min read

Muon is Scalable for LLM Training

https://kellerjordan.github.io/posts/muon/#why-is-it-good-to-orthogonalize-the-update

Graph View

Backlinks

  • Newton-Shulz Iteration

Created with Quartz, © 2026

  • Blog
  • LinkedIn
  • Twitter
  • GitHub