Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shekswess 's Collections
Tiny Think DPO Checkpoints
Tiny Think SFT Checkpoints
Tiny Think
Tiny Reasoning Language Model
Tiny Language Model Datasets
SynthGenAI Datasets
Stable Diffusion XL Neuron Models
Medical Instruct Models

Tiny Think DPO Checkpoints

updated about 17 hours ago
Upvote
-

  • Shekswess/tiny-think-dpo-math-stem-dpo-beta0-5-lr2e-6-e1-bs8

    Text Generation • 0.1B • Updated 8 days ago • 123

  • Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr2e-6-e1-bs8

    Text Generation • 0.1B • Updated 8 days ago • 69

  • Shekswess/tiny-think-dpo-math-stem-dpo-beta2-lr2e-6-e1-bs8

    Text Generation • 0.1B • Updated 8 days ago • 67

  • Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr1e-6-e1-bs8

    Text Generation • 0.1B • Updated 8 days ago • 72

  • Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr5e-6-e1-bs8

    Text Generation • 0.1B • Updated 8 days ago • 72

  • Shekswess/tiny-think-dpo-math-stem-dpo-beta1-lr3e-6-e1-bs8

    Text Generation • 0.1B • Updated 8 days ago • 144

  • Shekswess/tiny-think-dpo-math-stem-apo_zero-beta0_5-lr3e-6-e1-bs8

    Text Generation • 0.1B • Updated 8 days ago • 69

  • Shekswess/tiny-think-dpo-math-stem-apo_zero-beta1-lr3e-6-e1-bs8

    Text Generation • 0.1B • Updated 8 days ago • 69

  • Shekswess/tiny-think-dpo-math-stem-apo_zero-beta0_3-lr3e-6-e1-bs8

    Text Generation • 0.1B • Updated 8 days ago • 67
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs