Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
tobil
/
grpo_output
like
0
Transformers
Safetensors
Generated from Trainer
trl
hf_jobs
grpo
arxiv:
2402.03300
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
grpo_output
Commit History
tobil/qmd-query-expansion-qwen3.5-2B-grpo
55859dc
verified
tobil
commited on
28 days ago
tobil/qmd-query-expansion-qwen3.5-2B-grpo
6898473
verified
tobil
commited on
28 days ago
initial commit
6b09f07
verified
tobil
commited on
28 days ago