Update README.md
Browse files
README.md
CHANGED
|
@@ -14,6 +14,8 @@ library_name: transformers
|
|
| 14 |
|
| 15 |

|
| 16 |
|
|
|
|
|
|
|
| 17 |
This is a warm-up model and should be used as an initialization for RL. It was trained on [OpenThoughts-1M-Hybrid-1.5B](https://huggingface.co/datasets/open-thoughts/OpenThoughts2-1M) and can generate both long and short answers with comparable probabilities (~50%).
|
| 18 |
For the RL model, please refer to [Vinnnf/Thinkless-1.5B-RL-DeepScaleR](https://huggingface.co/Vinnnf/Thinkless-1.5B-RL-DeepScaleR).
|
| 19 |
|
|
|
|
| 14 |
|
| 15 |

|
| 16 |
|
| 17 |
+
[[**ArXiv**]]() | [[**GitHub**](https://github.com/VainF/Thinkless)]
|
| 18 |
+
|
| 19 |
This is a warm-up model and should be used as an initialization for RL. It was trained on [OpenThoughts-1M-Hybrid-1.5B](https://huggingface.co/datasets/open-thoughts/OpenThoughts2-1M) and can generate both long and short answers with comparable probabilities (~50%).
|
| 20 |
For the RL model, please refer to [Vinnnf/Thinkless-1.5B-RL-DeepScaleR](https://huggingface.co/Vinnnf/Thinkless-1.5B-RL-DeepScaleR).
|
| 21 |
|