Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
stepfun-ai
/
RLVR-8B-0926
like
8
Follow
StepFun
2.06k
Text Generation
Transformers
Safetensors
qwen3
reasoning
test-time-compute
pacore
math
code
conversational
text-generation-inference
arxiv:
2601.05593
License:
mit
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
main
RLVR-8B-0926
/
figure
2.38 MB
2 contributors
History:
1 commit
reign12
Upload folder using huggingface_hub
a145bd7
verified
2 months ago
before_after_train_lcb_02.png
Safe
299 kB
xet
Upload folder using huggingface_hub
2 months ago
benchmark_accuracy_1130.png
Safe
615 kB
xet
Upload folder using huggingface_hub
2 months ago
inference_pipeline_teaser_02.png
Safe
102 kB
xet
Upload folder using huggingface_hub
2 months ago
teaser_draft_02.png
Safe
294 kB
xet
Upload folder using huggingface_hub
2 months ago
train_reward_response_length_1130.png
Safe
1.07 MB
xet
Upload folder using huggingface_hub
2 months ago