Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
clembench-playpen
's Collections
SFT Final Models Merged
Datasets for DPO
KTO Final Models
OLD SFT Final Models Merged
SFT Final Models
Preference Dataset KTO (Wordle & Wordle_withclue)
Llama-3.2-3B
Llama-3.1-8B
Llama-3.2-1B
SFT Final Models
updated
Mar 13, 2025
Models that were trained on clembench v0.9 - v1.6
Upvote
-
clembench-playpen/llama-3.1-8B-Instruct_playpen_SFT_DFINAL_0.7K-steps
Updated
Mar 12, 2025
•
265
clembench-playpen/llama-3.1-8B-Instruct_playpen_SFT_DFINAL_1.1K-steps
Updated
Mar 6, 2025
•
267
clembench-playpen/Mistral-Small-24B-Instruct-2501_playpen_SFT_DFINAL_0.6K-steps
Updated
Mar 11, 2025
clembench-playpen/llama-3.1-70B-Instruct_playpen_SFT_DFINAL_0.6K-steps
Updated
Mar 12, 2025
Upvote
-
Share collection
View history
Collection guide
Browse collections