Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
luzimu
/
WebGenAgent-LM-7B-Step-GRPO
like
0
Image-Text-to-Text
Transformers
Safetensors
luzimu/webgen-agent_train_step-grpo
luzimu/webgen-agent_train_sft
qwen2
text-generation
conversational
text-generation-inference
arxiv:
2509.22644
arxiv:
2505.03733
License:
mit
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
d12caac
WebGenAgent-LM-7B-Step-GRPO
Commit History
Update README.md
d12caac
verified
luzimu
commited on
Sep 29, 2025
Update README.md
670988f
verified
luzimu
commited on
Sep 29, 2025
Update README.md
79960aa
verified
luzimu
commited on
Aug 31, 2025
Upload 3 files
8261ff9
verified
luzimu
commited on
Aug 31, 2025
Upload folder using huggingface_hub
16c1289
verified
luzimu
commited on
Aug 30, 2025
initial commit
88ed5bf
verified
luzimu
commited on
Aug 30, 2025