Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
Ilze Amanda Auzina
iaa01
Follow
0 followers
·
2 following
https://ilzeamandaa.github.io/
AI & ML interests
RL Post-Training | Reasoning and Exploration | Open-ended
Recent Activity
updated
a model
14 days ago
iaa01/llama-8b-merge-alpha1-freq10
published
a model
14 days ago
iaa01/llama-8b-merge-alpha1-freq10
updated
a model
14 days ago
iaa01/llama-8b-grpo-no-kl
View all activity
Organizations
iaa01
's models
7
Sort: Recently updated
iaa01/llama-8b-merge-alpha1-freq10
8B
•
Updated
14 days ago
•
48
iaa01/llama-8b-grpo-no-kl
8B
•
Updated
14 days ago
•
35
iaa01/llama-8b-grpo-kl
8B
•
Updated
14 days ago
•
58
iaa01/llama-8b-merge-alpha08-freq10
8B
•
Updated
14 days ago
•
44
iaa01/llama-8b-merge-alpha05-freq10
8B
•
Updated
14 days ago
•
51
iaa01/qwen3-1.7b-sft-grpo
2B
•
Updated
May 20
•
6
iaa01/qwen-2.5-3b-multi_step
3B
•
Updated
Mar 20
•
5