Vinnnf
/

Thinkless-1.5B-Warmup

Text Generation

text-generation-inference

Model card Files Files and versions

Vinnnf commited on May 19, 2025

Commit

94abac8

·

verified ·

1 Parent(s): 850d32e

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -14,6 +14,8 @@ library_name: transformers
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/646a1939c37ca1e12308fe81/SRxJKkSuC0y-oMB7SFeR6.png)
 This is a warm-up model and should be used as an initialization for RL. It was trained on [OpenThoughts-1M-Hybrid-1.5B](https://huggingface.co/datasets/open-thoughts/OpenThoughts2-1M) and can generate both long and short answers with comparable probabilities (~50%).
 For the RL model, please refer to [Vinnnf/Thinkless-1.5B-RL-DeepScaleR](https://huggingface.co/Vinnnf/Thinkless-1.5B-RL-DeepScaleR).

 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/646a1939c37ca1e12308fe81/SRxJKkSuC0y-oMB7SFeR6.png)
+[[**ArXiv**]]() | [[**GitHub**](https://github.com/VainF/Thinkless)]
 This is a warm-up model and should be used as an initialization for RL. It was trained on [OpenThoughts-1M-Hybrid-1.5B](https://huggingface.co/datasets/open-thoughts/OpenThoughts2-1M) and can generate both long and short answers with comparable probabilities (~50%).
 For the RL model, please refer to [Vinnnf/Thinkless-1.5B-RL-DeepScaleR](https://huggingface.co/Vinnnf/Thinkless-1.5B-RL-DeepScaleR).