Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
2
Russel
rshwndsz
Follow
0 followers
·
7 following
rshwndsz
AI & ML interests
Data Efficient Learning, Open-endedness, Alignment, AI Safety, Mechanical Interpretability
Organizations
None yet
rshwndsz
's models
177
Sort: Recently updated
rshwndsz/Llama-3.2-3B-SFT-RM_MRG-bm
3B
•
Updated
May 18, 2025
rshwndsz/Llama-3.2-1B-SFT-RM_MRG-si
1B
•
Updated
May 17, 2025
rshwndsz/Llama-3.2-3B-SFT-RM_MRG-ra
3B
•
Updated
May 17, 2025
rshwndsz/Llama-3.2-1B-SFT-RM_MRG-ra
1B
•
Updated
May 17, 2025
rshwndsz/Llama-3.2-1B-SFT-RM_MRG-bm
1B
•
Updated
May 17, 2025
rshwndsz/gemma-3-12b-pt-SFT-DPO-bi
12B
•
Updated
May 17, 2025
rshwndsz/gemma-3-12b-pt-SFT-DPO-bm
12B
•
Updated
May 17, 2025
rshwndsz/gemma-3-12b-pt-SFT-DPO-ra
12B
•
Updated
May 17, 2025
rshwndsz/Llama-3.2-3B-SFT-RM-ra
3B
•
Updated
May 17, 2025
rshwndsz/Llama-3.2-3B-SFT-RM-si
3B
•
Updated
May 17, 2025
rshwndsz/gemma-3-4b-pt-SFT-DPO-bi
4B
•
Updated
May 17, 2025
rshwndsz/Llama-3.2-1B-SFT-RM-ra
1B
•
Updated
May 17, 2025
rshwndsz/Llama-3.2-1B-SFT-RM-si
1B
•
Updated
May 17, 2025
rshwndsz/Llama-3.2-1B-SFT-RM-bi
1B
•
Updated
May 17, 2025
rshwndsz/Llama-3.2-1B-SFT-RM-bm
1B
•
Updated
May 17, 2025
rshwndsz/Llama-3.2-3B-SFT-RM-bm
3B
•
Updated
May 17, 2025
rshwndsz/Llama-3.2-3B-SFT-RM-bi
3B
•
Updated
May 17, 2025
rshwndsz/gemma-3-4b-pt-SFT-DPO-bm
4B
•
Updated
May 17, 2025
rshwndsz/gemma-3-12b-pt-SFT-DPO-si
12B
•
Updated
May 16, 2025
rshwndsz/Llama-3.1-8B-SFT-DPO-bm
8B
•
Updated
May 16, 2025
rshwndsz/gemma-3-4b-pt-SFT-DPO-si
4B
•
Updated
May 16, 2025
rshwndsz/Llama-3.1-8B-SFT-DPO-bi
8B
•
Updated
May 16, 2025
rshwndsz/gemma-3-4b-pt-SFT-DPO-ra
Updated
May 16, 2025
rshwndsz/Llama-3.2-1B-SFT-DPO-bi
1B
•
Updated
May 16, 2025
rshwndsz/Llama-3.2-1B-SFT-DPO-bm
1B
•
Updated
May 16, 2025
rshwndsz/Llama-3.2-3B-SFT-DPO-bm
3B
•
Updated
May 16, 2025
•
3
rshwndsz/gemma-3-1b-pt-SFT-DPO-bm
1.0B
•
Updated
May 16, 2025
rshwndsz/gemma-3-1b-pt-SFT-DPO-bi
1.0B
•
Updated
May 16, 2025
rshwndsz/Llama-3.2-3B-SFT-DPO-bi
3B
•
Updated
May 16, 2025
rshwndsz/Llama-3.1-8B-SFT-DPO-ra
8B
•
Updated
May 15, 2025
Previous
1
...
3
4
5
6
Next