Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
clembench-playpen 's Collections
SFT Final Models Merged
Datasets for DPO
KTO Final Models
OLD SFT Final Models Merged
SFT Final Models
Preference Dataset KTO (Wordle & Wordle_withclue)
Llama-3.2-3B
Llama-3.1-8B
Llama-3.2-1B

SFT Final Models

updated Mar 13, 2025

Models that were trained on clembench v0.9 - v1.6

Upvote
-

  • clembench-playpen/llama-3.1-8B-Instruct_playpen_SFT_DFINAL_0.7K-steps

    Updated Mar 12, 2025 • 265

  • clembench-playpen/llama-3.1-8B-Instruct_playpen_SFT_DFINAL_1.1K-steps

    Updated Mar 6, 2025 • 267

  • clembench-playpen/Mistral-Small-24B-Instruct-2501_playpen_SFT_DFINAL_0.6K-steps

    Updated Mar 11, 2025

  • clembench-playpen/llama-3.1-70B-Instruct_playpen_SFT_DFINAL_0.6K-steps

    Updated Mar 12, 2025
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs