Update README.md
Browse files
README.md
CHANGED
|
@@ -8,7 +8,7 @@ tags:
|
|
| 8 |
|
| 9 |
|
| 10 |
# Description
|
| 11 |
-
`btherien/mulo` is a
|
| 12 |
### Learned optimizer meta training and architectural details
|
| 13 |
| **Field** | **Value** |
|
| 14 |
|------------------------------|---------------------------------------------------------------------------|
|
|
|
|
| 8 |
|
| 9 |
|
| 10 |
# Description
|
| 11 |
+
`btherien/mulo` is a learned optimizer meta-trained in μ-parameterization. It corresponds to the μLO_M optimizer from [μLO: Compute-Efficient Meta-Generalization of Learned Optimizers](https://arxiv.org/abs/2406.00153). Due to being meta-trained in μP, μLO_M has strong meta-generalization capabilities (i.e., the ability to optimize unseen tasks), despite its relatively short and inexpensive meta-training distribution.
|
| 12 |
### Learned optimizer meta training and architectural details
|
| 13 |
| **Field** | **Value** |
|
| 14 |
|------------------------------|---------------------------------------------------------------------------|
|