ByteDance
/

Ouro-1.4B-Thinking

Text Generation

looped-language-model

recurrent-depth

chain-of-thought

Model card Files Files and versions

ridger commited on Nov 16, 2025

Commit

9d2ad6e

·

verified ·

1 Parent(s): a5728bf

Update README.md

Files changed (1) hide show

README.md +6 -4

README.md CHANGED Viewed

@@ -127,17 +127,19 @@ outputs = model.generate(inputs, max_new_tokens=512, temperature=1.0, top_p=0.7)
 print(tokenizer.decode(outputs[0], skip_special_tokens=True))
 ```
 ## Citation
 ```bibtex
 @article{zhu2025scaling,
   title={Scaling Latent Reasoning via Looped Language Models},
-  author={Zhu, Rui-Jie and Wang, Zixuan and Hua, Kai and Zhang, Tianyu and Li, Ziniu and Que, Haoran and Boyi Wei and Zixin Wen and Fan Yin and He Xing and Lu Li and Jiajun Shi and Kaijing Ma and Shanda Li and Taylor Kergan and Andrew Smith and Xingwei Qu and Mude Hui and Bohong Wu and Qiyang Min and Hongzhi Huang and Xun Zhou and Wei Ye and Jiaheng Liu and Jian Yang and Yunfeng Shi and Chenghua Lin and Enduo Zhao and Tianle Cai and Ge Zhang and Wenhao Huang and Yoshua Bengio and Jason Eshraghian},
   journal={arXiv preprint arXiv:2510.25741},
-  year={2025},
-  url={https://arxiv.org/abs/2510.25741},
 }
-```
 ## License

 print(tokenizer.decode(outputs[0], skip_special_tokens=True))
 ```
+## Acknowledgments
+We thank [@Antizana](https://github.com/Antizana) for the KV cache fix merged from [ouro-cache-fix](https://github.com/Antizana/ouro-cache-fix), which resolved a critical compatibility issue with transformers>=4.56.0.
 ## Citation
 ```bibtex
 @article{zhu2025scaling,
   title={Scaling Latent Reasoning via Looped Language Models},
+  author={Zhu, Rui-Jie and Wang, Zixuan and Hua, Kai and Zhang, Tianyu and Li, Ziniu and Que, Haoran and Wei, Boyi and Wen, Zixin and Yin, Fan and Xing, He and others},
   journal={arXiv preprint arXiv:2510.25741},
+  year={2025}
 }
 ## License