Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
DippyResearch
/
reward-model-DeepSeek-R1-Distill-Qwen-1.5B
like
0
Safetensors
qwen2
Model card
Files
Files and versions
xet
Community
main
reward-model-DeepSeek-R1-Distill-Qwen-1.5B
24.7 GB
1 contributor
History:
3 commits
Manavshah
Training in progress, step 100, checkpoint
1abbb27
verified
9 months ago
last-checkpoint
Training in progress, step 100, checkpoint
9 months ago
.gitattributes
1.52 kB
initial commit
9 months ago
config.json
878 Bytes
Training in progress, step 100
9 months ago
eval_scores_distribution.png
25.9 kB
Training in progress, step 100
9 months ago
model-00001-of-00002.safetensors
5 GB
xet
Training in progress, step 100
9 months ago
model-00002-of-00002.safetensors
1.18 GB
xet
Training in progress, step 100
9 months ago
model.safetensors.index.json
27.7 kB
Training in progress, step 100
9 months ago
train_scores_distribution.png
27.7 kB
Training in progress, step 100
9 months ago
training_args.bin
5.3 kB
xet
Training in progress, step 100
9 months ago
type_distribution.png
21.7 kB
Training in progress, step 100
9 months ago