caiyuchen commited on
Commit
a68d5be
·
verified ·
1 Parent(s): a34fdc2

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +0 -17
README.md CHANGED
@@ -55,20 +55,3 @@ inputs = tokenizer(prompt, return_tensors="pt")
55
  outputs = model.generate(**inputs, max_new_tokens=256)
56
  print(tokenizer.decode(outputs[0], skip_special_tokens=True))
57
 
58
-
59
- ## 📎 Reference
60
-
61
- If you find this model useful, please consider citing our paper:
62
-
63
- [**On Predictability of Reinforcement Learning Dynamics for Large Language Models**](https://huggingface.co/papers/2510.00553)
64
-
65
- ```bibtex
66
- @misc{{cai2025predictabilityreinforcementlearningdynamics,
67
- title={{On Predictability of Reinforcement Learning Dynamics for Large Language Models}},
68
- author={{Yuchen Cai and Ding Cao and Xin Xu and Zijun Yao and Yuqing Huang and Zhenyu Tan and Benyi Zhang and Guiquan Liu and Junfeng Fang}},
69
- year={{2025}},
70
- eprint={{2510.00553}},
71
- archivePrefix={{arXiv}},
72
- primaryClass={{cs.LG}},
73
- url={{https://arxiv.org/abs/2510.00553}},
74
- }}
 
55
  outputs = model.generate(**inputs, max_new_tokens=256)
56
  print(tokenizer.decode(outputs[0], skip_special_tokens=True))
57