Spaces:
Running
Running
Di Zhang
commited on
Update README.md
Browse files
README.md
CHANGED
|
@@ -6,5 +6,21 @@ colorTo: indigo
|
|
| 6 |
sdk: static
|
| 7 |
pinned: false
|
| 8 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 9 |
|
| 10 |
-
Edit this `README.md` markdown file to author your organization card.
|
|
|
|
| 6 |
sdk: static
|
| 7 |
pinned: false
|
| 8 |
---
|
| 9 |
+
The first version of LLaMA-O1 has been uploaded to HF now!Here He Comes!
|
| 10 |
+
Supervised:
|
| 11 |
+
https://huggingface.co/SimpleBerry/LLaMA-O1-Supervised-1129
|
| 12 |
+
Base(Pretrain):
|
| 13 |
+
https://huggingface.co/SimpleBerry/LLaMA-O1-Base-1127
|
| 14 |
+
Supervised Finetune Dataset:
|
| 15 |
+
https://huggingface.co/datasets/SimpleBerry/OpenLongCoT-SFT
|
| 16 |
+
Pretraining Dataset:
|
| 17 |
+
https://huggingface.co/datasets/SimpleBerry/OpenLongCoT-Pretrain-1202
|
| 18 |
+
RLHF is on the way! View our GitHub Repo:
|
| 19 |
+
https://github.com/SimpleBerry/LLaMA-O1
|
| 20 |
+
Our ongoing related researches:
|
| 21 |
+
https://huggingface.co/papers/2406.07394
|
| 22 |
+
https://huggingface.co/papers/2410.02884
|
| 23 |
+
https://huggingface.co/papers/2411.18203
|
| 24 |
+
|
| 25 |
+

|
| 26 |
|
|
|