Text Generation
Transformers
Safetensors
qwen2
conversational
text-generation-inference
Vinnnf commited on
Commit
94abac8
·
verified ·
1 Parent(s): 850d32e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -14,6 +14,8 @@ library_name: transformers
14
 
15
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/646a1939c37ca1e12308fe81/SRxJKkSuC0y-oMB7SFeR6.png)
16
 
 
 
17
  This is a warm-up model and should be used as an initialization for RL. It was trained on [OpenThoughts-1M-Hybrid-1.5B](https://huggingface.co/datasets/open-thoughts/OpenThoughts2-1M) and can generate both long and short answers with comparable probabilities (~50%).
18
  For the RL model, please refer to [Vinnnf/Thinkless-1.5B-RL-DeepScaleR](https://huggingface.co/Vinnnf/Thinkless-1.5B-RL-DeepScaleR).
19
 
 
14
 
15
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/646a1939c37ca1e12308fe81/SRxJKkSuC0y-oMB7SFeR6.png)
16
 
17
+ [[**ArXiv**]]() | [[**GitHub**](https://github.com/VainF/Thinkless)]
18
+
19
  This is a warm-up model and should be used as an initialization for RL. It was trained on [OpenThoughts-1M-Hybrid-1.5B](https://huggingface.co/datasets/open-thoughts/OpenThoughts2-1M) and can generate both long and short answers with comparable probabilities (~50%).
20
  For the RL model, please refer to [Vinnnf/Thinkless-1.5B-RL-DeepScaleR](https://huggingface.co/Vinnnf/Thinkless-1.5B-RL-DeepScaleR).
21