Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Viani
/
SimMart
like
0
Sleeping
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
main
SimMart
/
notebooks
271 kB
Ctrl+K
Ctrl+K
1 contributor
History:
2 commits
Viani
Hackathon: GRPO single-GPU notebook with embedded curves + eval table (50 steps, GRPO step-040 beats SFT-init +10.6%)
5a4d0ef
verified
12 days ago
hackathon_grpo_single_gpu.ipynb
Safe
271 kB
Hackathon: GRPO single-GPU notebook with embedded curves + eval table (50 steps, GRPO step-040 beats SFT-init +10.6%)
12 days ago