Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
DatPySci
/
PreRLVR-Controlled
like
0
Safetensors
Model card
Files
Files and versions
xet
Community
aca231f
PreRLVR-Controlled
/
models
27.6 GB
1 contributor
History:
42 commits
DatPySci
Upload folder using huggingface_hub
aca231f
verified
10 days ago
EvoLM-1B-160BT-MixedFW8FM42-100k-polaris-GRPO
Upload folder using huggingface_hub
10 days ago
EvoLM-1B-160BT-MixedFW8FM42-400k-evolm-GRPO-step300
Upload folder using huggingface_hub
14 days ago
EvoLM-1B-160BT-MixedFW8FM42-400k-omega-GRPO-step300
Upload folder using huggingface_hub
14 days ago
Qwen2.5-Math-1.5B-evolm-GRPO-step300
Upload folder using huggingface_hub
13 days ago
Qwen2.5-Math-1.5B-omega-GRPO-step300
Upload folder using huggingface_hub
13 days ago
SFT
Upload folder using huggingface_hub
10 days ago