Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
langfeng01
/
GiGPO-Qwen2.5-7B-Instruct-WebShop
like
0
Safetensors
qwen2
arxiv:
2505.10978
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
GiGPO-Qwen2.5-7B-Instruct-WebShop
Commit History
Upload logo-verl-agent.png
8e58a29
verified
langfeng01
commited on
Sep 28, 2025
Update README.md
f1cbb12
verified
langfeng01
commited on
Sep 28, 2025
Update README.md
b6b803b
verified
langfeng01
commited on
Jun 12, 2025
Update README.md
1d57ea6
verified
langfeng01
commited on
Jun 11, 2025
Update README.md
be0173d
verified
langfeng01
commited on
Jun 11, 2025
Update README.md
552bf9a
verified
langfeng01
commited on
Jun 11, 2025
Update README.md
25e3801
verified
langfeng01
commited on
Jun 11, 2025
track tokenizer
abdf525
langfeng01
commited on
Jun 11, 2025
first commit
1bf469d
langfeng01
commited on
Jun 11, 2025
initial commit
f503f82
unverified
langfeng01
commited on
Jun 11, 2025