🪴 Berwin Gan
Search
Search
Dark mode
Light mode
Explorer
Study Room
Algorithm
Hyperloglog
Database
SQL Lite Clone
Machine Learning
Research Reading
Holding
REOrdering Patches Improves Vision Models
Optimizer
Muon - An optimizer for hidden layers in neural networks
Reinforcement Learning
PPO vs EPO
Distillation Robustifies Unlearning
Language Models in Plato's Cave
Learning Compositional Models of the World
Location and Editing Factual Associations in GPT
Strategic Classification
Large Language Model Agents 🧠 (CS 294/197-196)
Aggregate Voting Rank 🗳️
Covering Discs and Orthants 📐
Lambda Calculus 🧮
Home
❯
Study Room
❯
Machine Learning
❯
Research Reading
❯
Optimizer
❯
Muon - An optimizer for hidden layers in neural networks
Muon - An optimizer for hidden layers in neural networks
Jun 18, 2025
1 min read
optimizer
TODO
Adam
AdamW
NanoGPT-speedrun? CIFAR-10 speedrun ?
Graph View