Update README.md
Browse files
README.md
CHANGED
|
@@ -29,7 +29,7 @@ Full details on simulation and training can be found [here](https://github.com/a
|
|
| 29 |
|
| 30 |
# Training Procedure
|
| 31 |
|
| 32 |
-
Trained with [Stable Alignment](https://github.com/agi-templar/Stable-Alignment) on 8xA100s for 3H. The start checkpoint is the [
|
| 33 |
|
| 34 |
We have also released the [better-base model](https://huggingface.co/agi-css/better-base) which is the start checkpoint of SFT.
|
| 35 |
|
|
|
|
| 29 |
|
| 30 |
# Training Procedure
|
| 31 |
|
| 32 |
+
Trained with [Stable Alignment](https://github.com/agi-templar/Stable-Alignment) on 8xA100s for 3H. The start checkpoint is the [hh-rlhf-sft model](https://huggingface.co/agi-css/hh-rlhf-sft).
|
| 33 |
|
| 34 |
We have also released the [better-base model](https://huggingface.co/agi-css/better-base) which is the start checkpoint of SFT.
|
| 35 |
|