Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
RLVR-SvS
/
SvS-Qwen-Code-7B
like
2
Follow
RLVR-SvS
4
Reinforcement Learning
Safetensors
RLVR-SvS/Variational-DAPO
English
qwen2
arxiv:
2508.14029
License:
mit
Model card
Files
Files and versions
xet
Community
main
SvS-Qwen-Code-7B
/
tokenizer.json
Commit History
add model weights and training data
f55c06e
verified
MasterVito
commited on
Dec 11, 2025