Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Sanster 's Collections
LLM Training Dataset
multimodal

LLM Training Dataset

updated Mar 14, 2024
Upvote
-

  • teknium/OpenHermes-2.5

    Viewer • Updated Apr 15, 2024 • 1M • 6.42k • 794

  • Open-Orca/SlimOrca-Dedup

    Viewer • Updated May 19, 2025 • 363k • 822 • 90

  • argilla/ultrafeedback-binarized-preferences-cleaned

    Viewer • Updated Dec 11, 2023 • 60.9k • 2.46k • 159

  • argilla/ultrafeedback-multi-binarized-preferences-cleaned

    Viewer • Updated Dec 11, 2023 • 158k • 95 • 7

  • argilla/distilabel-intel-orca-dpo-pairs

    Viewer • Updated Aug 7, 2025 • 12.9k • 2.76k • 181

  • openchat/openchat_sharegpt4_dataset

    Updated Jul 1, 2023 • 528 • 172

  • rombodawg/LosslessMegaCodeTrainingV3_1.6m_Evol

    Viewer • Updated Oct 19, 2023 • 1.56M • 17 • 27

  • OpenAssistant/oasst2

    Viewer • Updated Jan 11, 2024 • 135k • 2.46k • 283

  • WizardLMTeam/WizardLM_evol_instruct_V2_196k

    Viewer • Updated Mar 10, 2024 • 143k • 1.49k • 246

  • lmsys/lmsys-chat-1m

    Viewer • Updated Jul 27, 2024 • 1M • 5.26k • 831

  • Hello-SimpleAI/HC3-Chinese

    Viewer • Updated Jan 21, 2023 • 25.7k • 2.62k • 169

  • argilla/dpo-mix-7k

    Viewer • Updated Jul 16, 2024 • 7.5k • 277 • 170
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs