Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
DatPySci
/
PreRLVR-Controlled
like
0
Safetensors
Model card
Files
Files and versions
xet
Community
1e12c13
PreRLVR-Controlled
965 MB
1 contributor
History:
16 commits
DatPySci
Delete models/LoRA-EvoLM-1B-160BT-CPT-Ep1-Omega-GRPO-step300
1e12c13
verified
23 days ago
Data
Upload folder using huggingface_hub
26 days ago
EvoLM-1B-160BT-Warmup-LoRA-RL-step100
Upload folder using huggingface_hub
27 days ago
EvoLM-1B-160BT-Warmup-LoRA-RL-step150
Upload folder using huggingface_hub
27 days ago
EvoLM-1B-160BT-Warmup-LoRA-RL-step200
Upload folder using huggingface_hub
27 days ago
EvoLM-1B-160BT-Warmup-LoRA-RL-step250
Upload folder using huggingface_hub
27 days ago
EvoLM-1B-160BT-Warmup-LoRA-RL-step300
Upload folder using huggingface_hub
27 days ago
EvoLM-1B-160BT-Warmup-LoRA-RL-step50
Upload folder using huggingface_hub
27 days ago
models
Delete models/LoRA-EvoLM-1B-160BT-CPT-Ep1-Omega-GRPO-step300
23 days ago
.gitattributes
1.52 kB
initial commit
27 days ago
verl.zip
21.6 MB
xet
Upload verl.zip with huggingface_hub
23 days ago