Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
3
Ömer Veysel Çağatan
asparius
Follow
afshinshafaei's profile picture
1 follower
·
1 following
https://asparius.github.io/
asparius
AI & ML interests
Deep RL, NLP
Recent Activity
updated
a model
about 1 hour ago
asparius/Qwen2.5-1.5B-SPO-1ep-iter4-gen6
published
a model
about 2 hours ago
asparius/Qwen2.5-1.5B-SPO-1ep-iter4-gen6
updated
a model
about 3 hours ago
asparius/Qwen2.5-1.5B-SPO-1ep-iter2-gen6
View all activity
Organizations
asparius
's models
166
Sort: Recently updated
asparius/s1-Qwen2.5-Instruct-14B
4B
•
Updated
14 days ago
•
47
•
1
asparius/s1-Qwen2.5-Base-3B
0.8B
•
Updated
14 days ago
•
33
asparius/s1-Qwen2.5-Base-7B
2B
•
Updated
14 days ago
•
37
asparius/s1-Qwen2.5-Instruct-7B
2B
•
Updated
14 days ago
•
25
asparius/s1-Qwen2.5-Instruct-3B
0.8B
•
Updated
14 days ago
•
23
asparius/s1.1-Qwen3-14B
4B
•
Updated
18 days ago
•
30
asparius/s1.1-Qwen3-8B
2B
•
Updated
18 days ago
•
28
asparius/s1.1-Qwen3-4B
1B
•
Updated
18 days ago
•
30
asparius/Llama3-8b-openrlhf-spo-kl0
266k
•
Updated
24 days ago
•
93
asparius/Qwen2.5-3B-SPO-1ep
Text Generation
•
3B
•
Updated
24 days ago
•
42
asparius/Qwen2.5-3B-GRPO-1ep
Text Generation
•
3B
•
Updated
24 days ago
•
38
asparius/Qwen2.5-1.5B-SPO-1ep
Text Generation
•
2B
•
Updated
24 days ago
•
35
asparius/Qwen2.5-1.5B-GRPO-1ep
Text Generation
•
2B
•
Updated
24 days ago
•
56
asparius/Qwen2.5-7B-SPO-1ep
Text Generation
•
8B
•
Updated
24 days ago
•
58
asparius/pythia6.9b-tldr-spo-kl0.01-3e-6
1.71M
•
Updated
25 days ago
•
11
asparius/pythia6.9b-tldr-spo-kl0.01-4e-6
1.71M
•
Updated
25 days ago
•
8
asparius/pythia6.9b-tldr-spo-kl0.01-5e-6
1.71M
•
Updated
25 days ago
•
11
asparius/Llama3-8b-openrlhf-ppo-kl0
266k
•
Updated
25 days ago
•
61
asparius/Llama3-8b-openrlhf-spo
266k
•
Updated
26 days ago
•
67
asparius/s1-Qwen3-14B
4B
•
Updated
26 days ago
•
57
asparius/s1-Qwen3-8B
2B
•
Updated
26 days ago
•
50
asparius/s1-Qwen3-4B
1B
•
Updated
26 days ago
•
72
asparius/pythia6.9b-tldr-spo-kl0.0-3e-6
Updated
27 days ago
•
25
asparius/Qwen3-14B-SPO-1ep
Text Generation
•
15B
•
Updated
27 days ago
•
39
asparius/Llama3.1-8B-SPO-1ep
Text Generation
•
8B
•
Updated
27 days ago
•
34
asparius/Qwen3-14B-GRPO-1ep
Text Generation
•
15B
•
Updated
27 days ago
•
30
asparius/Llama3.1-8B-GRPO-1ep
Text Generation
•
8B
•
Updated
27 days ago
•
39
asparius/Llama3-8b-openrlhf-ppo
266k
•
Updated
28 days ago
•
76
asparius/pythia6.9b-tldr-spo-kl0.0
1.71M
•
Updated
29 days ago
•
15
asparius/pythia6.9b-tldr-ppo-kl0.2
1.71M
•
Updated
Nov 26
•
4
Previous
1
2
3
4
5
6
Next