Update README.md
Browse files
README.md
CHANGED
|
@@ -3,8 +3,8 @@ license: apache-2.0
|
|
| 3 |
---
|
| 4 |
# LLM360 Research Suite: K2 Loss Spike 1
|
| 5 |
We encountered two major loss spikes while [training K2](https://huggingface.co/LLM360/K2).
|
| 6 |
-
* The first loss spike
|
| 7 |
-
* The [second loss spike](https://huggingface.co/LLM360/K2-Spike-2/)
|
| 8 |
|
| 9 |
We are releasing these checkpoints so others can study this interesting phenomena in large model training.
|
| 10 |
|
|
|
|
| 3 |
---
|
| 4 |
# LLM360 Research Suite: K2 Loss Spike 1
|
| 5 |
We encountered two major loss spikes while [training K2](https://huggingface.co/LLM360/K2).
|
| 6 |
+
* The first loss spike occurred after X checkpoints and lasted over ~34 checkpoints. We restarted training at checkpoint X and training returned to normal.
|
| 7 |
+
* The [second loss spike](https://huggingface.co/LLM360/K2-Spike-2/) occurred after restarting training to fix the first loss spike at checkpoint X and lasted from ~8 checkpoints.
|
| 8 |
|
| 9 |
We are releasing these checkpoints so others can study this interesting phenomena in large model training.
|
| 10 |
|