Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
SpectralPO
/
DeepSeek-R1-Distill-Llama-8B-GRPO
like
0
Follow
SpectralPO
7
Safetensors
llama
License:
mit
Model card
Files
Files and versions
xet
Community
9065a66
DeepSeek-R1-Distill-Llama-8B-GRPO
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
PeterLauLukCh
initial commit
9065a66
verified
12 months ago
.gitattributes
Safe
1.52 kB
initial commit
12 months ago
README.md
Safe
24 Bytes
initial commit
12 months ago