Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

akhauriyash
/

DeepSeek-R1-Distill-Qwen-1.5B-E2EGRPO-OpenR1_Math_SpecR_GRPO_Mini-MiniSet

Model card Files Files and versions

DeepSeek-R1-Distill-Qwen-1.5B-E2EGRPO-OpenR1_Math_SpecR_GRPO_Mini-MiniSet

3.56 GB

1 contributor

History: 21 commits

akhauriyash's picture

Training in progress, step 380

d5f638a verified 3 months ago

.gitattributes
1.57 kB

Training in progress, step 20 9 months ago
config.json
730 Bytes

Training in progress, step 380 3 months ago
model.safetensors
3.55 GB
xet

Training in progress, step 380 3 months ago
special_tokens_map.json
371 Bytes

Training in progress, step 20 9 months ago
tokenizer.json
11.4 MB
xet

Training in progress, step 20 9 months ago
tokenizer_config.json
6.67 kB

Training in progress, step 20 9 months ago
training_args.bin
11.2 kB
xet

Training in progress, step 380 3 months ago