Update app.py
Browse files
app.py
CHANGED
|
@@ -419,7 +419,7 @@ with gr.Blocks(title="LLM Training Estimator — Chinchilla Scaling Law") as dem
|
|
| 419 |
📚 References:
|
| 420 |
- [Chinchilla](https://arxiv.org/abs/2203.15556): *Training Compute-Optimal Large Language Models*
|
| 421 |
- [GPT-3](https://arxiv.org/abs/2005.14165) (note: arXiv:2001.08361 is earlier version)
|
| 422 |
-
- [MFU definition](https://arxiv.org/abs/
|
| 423 |
- [Muon](https://arxiv.org/abs/2502.16982)
|
| 424 |
""")
|
| 425 |
|
|
|
|
| 419 |
📚 References:
|
| 420 |
- [Chinchilla](https://arxiv.org/abs/2203.15556): *Training Compute-Optimal Large Language Models*
|
| 421 |
- [GPT-3](https://arxiv.org/abs/2005.14165) (note: arXiv:2001.08361 is earlier version)
|
| 422 |
+
- [MFU definition](https://arxiv.org/abs/2104.04473)
|
| 423 |
- [Muon](https://arxiv.org/abs/2502.16982)
|
| 424 |
""")
|
| 425 |
|