Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
MuMing0102
/
VGPO-RL-32B
like
0
Image-Text-to-Text
Transformers
Safetensors
PAPOGalaxy/PAPO_ViRL39K_train
qwen2_5_vl
VGPO
Reinforcement learning
Multimodal Reasoning
Visual Attention Compensation
conversational
text-generation-inference
arxiv:
2604.09349
License:
mit
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
VGPO-RL-32B
Commit History
Create README.md
27c5267
verified
MuMing0102
commited on
3 days ago
Upload folder using huggingface_hub
021e3f7
verified
MuMing0102
commited on
3 days ago
initial commit
3f2eb82
verified
MuMing0102
commited on
3 days ago