Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
sulabh-research 's Collections
MM Datasets
Datasets
LLM Datasets
Multilingual LLMs
Long Context
PEFT
small_models
RAG
LLMs
embeddings

Datasets

updated Mar 5, 2024
Upvote
-

  • Datasets: A Community Library for Natural Language Processing

    Paper • 2109.02846 • Published Sep 7, 2021 • 14

  • berkeley-nest/Nectar

    Viewer • Updated Mar 20, 2024 • 183k • 1.06k • 295

  • openbmb/UltraFeedback

    Viewer • Updated Dec 29, 2023 • 64k • 5.58k • 419

  • BAAI/JudgeLM-100K

    Preview • Updated Oct 27, 2023 • 396 • 52

  • Intel/orca_dpo_pairs

    Viewer • Updated Nov 29, 2023 • 12.9k • 1.94k • 321

  • nvidia/HelpSteer

    Viewer • Updated Dec 18, 2024 • 37.1k • 2.51k • 248

  • kaiokendev/SuperCOT-dataset

    Viewer • Updated May 26, 2023 • 58.3k • 138 • 46

  • HuggingFaceTB/cosmopedia

    Viewer • Updated Aug 12, 2024 • 31.1M • 20.9k • 695

  • teknium/OpenHermes-2.5

    Viewer • Updated Apr 15, 2024 • 1M • 22.7k • 831

  • Orca: Progressive Learning from Complex Explanation Traces of GPT-4

    Paper • 2306.02707 • Published Jun 5, 2023 • 51

  • HuggingFaceH4/ultrachat_200k

    Viewer • Updated Oct 16, 2024 • 515k • 68.9k • 705
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs