Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
SpectralPO
's Collections
DeepSeek-R1-Distill-Llama-8B
DeepSeek-R1-Distill-Qwen-32B
Qwen2.5-32B-Instruct
Qwen2.5-14B-Instruct
DeepSeek-R1-Distill-Qwen-7B
Qwen2.5-7B-Instruct
Offline RL with Neg Samples
Qwen2.5-14B-Instruct
updated
May 13
Upvote
-
SpectralPO/Qwen2.5-14B-Instruct-GRPO
15B
•
Updated
May 9
•
5
SpectralPO/Qwen2.5-14B-Instruct-SPO
15B
•
Updated
May 9
•
5
Upvote
-
Share collection
View history
Collection guide
Browse collections