Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
3
Ömer Veysel Çağatan
asparius
Follow
afshinshafaei's profile picture
1 follower
·
1 following
https://asparius.github.io/
asparius
AI & ML interests
Deep RL, NLP
Recent Activity
updated
a model
about 13 hours ago
asparius/Qwen2.5-1.5B-SPO-1ep-iter16
updated
a model
about 16 hours ago
asparius/Qwen2.5-1.5B-SPO-1ep-iter8
updated
a model
about 17 hours ago
asparius/Qwen2.5-1.5B-SPO-1ep-iter4
View all activity
Organizations
asparius
's models
164
Sort: Recently updated
asparius/pythia6.9b-tldr-ppo-kl0.05
1.71M
•
Updated
Nov 26
•
2
asparius/pythia6.9b-tldr-ppo-kl0.01
1.71M
•
Updated
Nov 26
•
4
asparius/pythia6.9b-tldr-ppo-kl0.1
1.71M
•
Updated
Nov 26
•
5
asparius/pythia6.9b-tldr-ppo-kl0.001
1.71M
•
Updated
Nov 26
•
5
asparius/pythia2.8b-tldr-ppo-kl0.2
1.07M
•
Updated
Nov 26
•
2
asparius/pythia2.8b-tldr-ppo-kl0.05
1.07M
•
Updated
Nov 26
•
4
asparius/pythia2.8b-tldr-ppo-kl0.1
1.07M
•
Updated
Nov 26
•
3
asparius/pythia2.8b-tldr-ppo-kl0.001
1.07M
•
Updated
Nov 26
•
6
asparius/pythia2.8b-tldr-ppo-kl0.025
1.07M
•
Updated
Nov 26
•
3
asparius/pythia2.8b-tldr-ppo-kl0.005
1.07M
•
Updated
Nov 26
•
4
asparius/pythia2.8b-tldr-ppo-kl0.01
1.07M
•
Updated
Nov 26
•
4
asparius/Llama3.2-3B-SPO-1ep-v2
Text Generation
•
3B
•
Updated
Nov 25
•
21
asparius/Llama3.2-3B-GRPO-1ep-v2
Text Generation
•
3B
•
Updated
Nov 25
•
23
asparius/Llama3.2-3B-SPO-1ep
Text Generation
•
3B
•
Updated
Nov 25
•
5
asparius/pythia6.9b-tldr-spo-kl0.2
1.71M
•
Updated
Nov 25
•
5
asparius/pythia6.9b-tldr-spo-kl0.025
1.71M
•
Updated
Nov 25
•
4
asparius/pythia6.9b-tldr-spo-kl0.01
1.71M
•
Updated
Nov 25
•
4
asparius/pythia6.9b-tldr-spo-kl0.05
1.71M
•
Updated
Nov 25
•
3
asparius/pythia6.9b-tldr-spo-kl0.1
1.71M
•
Updated
Nov 25
•
4
asparius/pythia6.9b-tldr-spo-kl0.005
1.71M
•
Updated
Nov 25
•
3
asparius/pythia6.9b-tldr-spo-kl0.001
1.71M
•
Updated
Nov 25
•
3
asparius/OLMo-7B-SPO-1ep
Text Generation
•
7B
•
Updated
Nov 25
•
27
asparius/OLMo-7B-GRPO-1ep
Text Generation
•
7B
•
Updated
Nov 25
•
26
asparius/Llama3.2-3B-GRPO-1ep
Text Generation
•
3B
•
Updated
Nov 24
•
5
asparius/pythia2.8b-tldr-spo-kl0.2
1.07M
•
Updated
Nov 24
•
3
asparius/pythia2.8b-tldr-spo-kl0.1
1.07M
•
Updated
Nov 24
•
3
asparius/pythia2.8b-tldr-spo-kl0.001
1.07M
•
Updated
Nov 24
•
4
asparius/pythia2.8b-tldr-spo-kl0.025
1.07M
•
Updated
Nov 24
•
4
asparius/pythia2.8b-tldr-spo-kl0.05
1.07M
•
Updated
Nov 24
•
4
asparius/pythia2.8b-tldr-spo-kl0.01
1.07M
•
Updated
Nov 24
•
4
Previous
1
2
3
4
5
6
Next