Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Gen-Verse
/
RLAnything-OS-Reward-8B
like
2
Follow
Princeton-AI
127
Safetensors
qwen3_vl
arxiv:
2602.02488
License:
mit
Model card
Files
Files and versions
xet
Community
1
yinjiewang
commited on
Feb 3
Commit
d9e8dc6
·
verified
·
1 Parent(s):
ba8649b
Update README.md
Browse files
Files changed (1)
hide
show
README.md
+1
-0
README.md
CHANGED
Viewed
@@ -30,6 +30,7 @@ We introduce **RLAnything**, a reinforcement learning framework forges environme
30
</p>
31
32
33
# Citation
34
35
```
30
</p>
31
32
33
+
34
# Citation
35
36
```