datasets
updated
Viewer
• Updated • 126k • 91
• 4
FrancophonIA/crochet_terms
Viewer
• Updated • 7 • 6
• 1
FrancophonIA/crochets_terms_fr_en
Viewer
• Updated • 62 • 4
• 1
HuggingFaceH4/CodeAlpaca_20K
Viewer
• Updated • 20k • 5.82k
• 108
Viewer
• Updated • 232k • 113
• 6
Updated • 16
• 1
JimJam107/chatbot_dataset_v2
Text Generation
• 33B • Updated • 101
• • 293
pankajmathur/WizardLM_Orca
Viewer
• Updated • 55k • 28
• 69
glaiveai/glaive-code-assistant
Viewer
• Updated • 136k • 955
• 100
philschmid/guanaco-sharegpt-style
Viewer
• Updated • 9.03k • 822
• 49
Viewer
• Updated • 1k • 3.77k
• 241
Viewer
• Updated • 52.5B • 295k
• 2.9k
open-thoughts/OpenThoughts-114k
Viewer
• Updated • 228k • 74.5k
• 867
Viewer
• Updated • 450k • 42.6k
• 760
Viewer
• Updated • 237M • 16.9k
• 397
rombodawg/MegaCodeTraining
Viewer
• Updated • 188k • 35
• 14
Magpie-Align/Magpie-Reasoning-V2-250K-CoT-Deepseek-R1-Llama-70B
Viewer
• Updated • 250k • 295
• 107
Viewer
• Updated • 932 • 40.8k
• 703
rubend18/ChatGPT-Jailbreak-Prompts
Viewer
• Updated • 79 • 1.93k
• 260
Viewer
• Updated • 87.1k • 1.03k
• 55
Viewer
• Updated • 100k • 85
• 20
Preview
• Updated • 70
• 75
HuggingFaceH4/ultrachat_200k
Viewer
• Updated • 515k • 57.7k
• 736
Updated • 17.4k
• 202
TechxGenus/deepseek_r1_code_1k
Viewer
• Updated • 1k • 67
• 20
Viewer
• Updated • 67.6k • 146
• 13
Viewer
• Updated • 2.14M • 79.7k
• 1.04k
Viewer
• Updated • 11.6k • 18
• 12
driaforall/verifiable-pythonic-function-calling-lite
Viewer
• Updated • 16.4k • 41
• 10
google-research-datasets/go_emotions
Viewer
• Updated • 265k • 12.2k
• 262
Viewer
• Updated • 1M • 15.7k
• 862
Updated • 22k
• 367
Text Generation
• Updated • 412
• 20
prithivMLmods/Chatbot-Model-Cleaned-Weights
Preview
• Updated • 8
• 1
Viewer
• Updated • 2.2M • 18.4k
• 415
livecodebench/code_generation_lite
Updated • 63.9k
• 93
Viewer
• Updated • 838k • 28.9k
• 441
Viewer
• Updated • 529k • 7.86k
• 192
dprashar/npc_dialogue_rpg_quests
Viewer
• Updated • 24.8k • 18
• 13
ngxson/MiniThinky-dataset-v3
Viewer
• Updated • 41.2k • 22
• 5
Aarushhh/Thinking-Preference-7k
Viewer
• Updated • 7.12k • 17
• 3
Magpie-Align/Magpie-Reasoning-V1-150K-CoT-Deepseek-R1-Llama-70B
Viewer
• Updated • 150k • 60
• 17
yueliu1999/GuardReasonerTrain
Viewer
• Updated • 128k • 192
• 4
ChaoticNeutrals/RCSI-v2_ShareGPT
Viewer
• Updated • 62.7k • 19
• 2
google-research-datasets/cfq
Viewer
• Updated • 865k • 462
• 6
code-search-net/code_search_net
Viewer
• Updated • 4.14M • 14k
• 331
facebook/empathetic_dialogues
Updated • 5.35k
• 131
Rapidata/Translation-gpt4o_mini-v-gpt4o-v-deepl
Viewer
• Updated • 373 • 20
• 16
KlingTeam/GameFactory-Dataset
Updated • 338
• 22
Viewer
• Updated • 77.7k • 3.15k
• 387
Viewer
• Updated • 814k • 874
• 302
Updated • 186
• 36
hkust-nlp/qwen2.5-7b-coder_codeio_pp
8B • Updated • 123
• 5
rombodawg/LosslessMegaCodeTrainingV3_1.6m_Evol
Viewer
• Updated • 1.56M • 111
• 27
hkust-nlp/CodeIO-PyEdu-Reasoning
Preview
• Updated • 146
• 58
microsoft/orca-agentinstruct-1M-v1
Viewer
• Updated • 1.05M • 3.09k
• 465
lmms-lab/multimodal-open-r1-8k-verified
Viewer
• Updated • 7.69k • 1.15k
• 75
nickrosh/Evol-Instruct-Code-80k-v1
Viewer
• Updated • 78.3k • 5.2k
• 248
m-a-p/CodeFeedback-Filtered-Instruction
Viewer
• Updated • 157k • 18.2k
• 204
Viewer
• Updated • 19.6k • 26
• 10
Abhiverse01/Final_codegen_1000_entries
Viewer
• Updated • 999 • 11
Viewer
• Updated • 196k • 8
Hananie/NEUDev_AI_as_code_evaluator_SyntheticDataset
Viewer
• Updated • 11.5k • 4
ShijiaD/code-reconstruct-testing-dataset
Viewer
• Updated • 30 • 2
open-r1/Mixture-of-Thoughts
Viewer
• Updated • 699k • 6.37k
• 317
EmTpro01/randomized-20k-dataset
Viewer
• Updated • 20k • 4
Viewer
• Updated • 254k • 4.82k
• 221
smirki/UI_REASONING_v1.01
Viewer
• Updated • 773 • 20
• 12
Viewer
• Updated • 100k • 255
• 10
Viewer
• Updated • 1 • 5.47k
• 55
metavoiceio/metavoice-1B-v0.1
Text-to-Speech
• Updated • 97
• 789
Viewer
• Updated • 5.32B • 5.05k
• 229
FractalAIResearch/Fathom-V0.4-SFT-Shortest-Chains
Viewer
• Updated • 9.59k • 34
• 4
FractalAIResearch/Fathom-V0.6-Iterative-Curriculum-Learning
Viewer
• Updated • 5.04k • 22
• 3
agentica-org/DeepCoder-Preview-Dataset
Viewer
• Updated • 25k • 1.96k
• 107
agentica-org/DeepScaleR-Preview-Dataset
Viewer
• Updated • 40.3k • 20.2k
• 200
mlfoundations-dev/openthoughts3_100k_code_swap_r1
Viewer
• Updated • 100k • 454
mlfoundations-dev/openthoughts3_300k_eval_08c7
Viewer
• Updated • 600 • 5
Viewer
• Updated • 98k • 17
HenryAI/KerasCodeExamples.txt
Viewer
• Updated • 53.6k • 840
• 1
Viewer
• Updated • 49.3k • 230
• 104
One-RL-to-See-Them-All/Orsta-Data-47k
Updated • 675
• 20
Viewer
• Updated • 345k • 135
• 36
UCSC-VLAA/Recap-DataComp-1B
Viewer
• Updated • 1.88B • 16.9k
• 200
institutional/institutional-books-1.0
Viewer
• Updated • 983k • 16.9k
• 279
open-thoughts/OpenThoughts3-1.2M
Viewer
• Updated • 1.2M • 19.9k
• 242
cfahlgren1/react-code-instructions
Viewer
• Updated • 74.4k • 446
• 157
bespokelabs/Bespoke-Stratos-17k
Viewer
• Updated • 16.7k • 14.6k
• 346
open-r1/OpenThoughts-114k-math
Viewer
• Updated • 89.1k • 1.02k
• 95
PrimeIntellect/NuminaMath-QwQ-CoT-5M
Viewer
• Updated • 5.14M • 515
• 62
ServiceNow-AI/R1-Distill-SFT
Viewer
• Updated • 1.85M • 2.72k
• 322
Viewer
• Updated • 817 • 5.48k
• 176
FreedomIntelligence/medical-o1-reasoning-SFT
Viewer
• Updated • 90.1k • 10.2k
• 1.13k
bethgelab/CuratedThoughts
Viewer
• Updated • 222k • 246
• 44
Viewer
• Updated • 487k • 4.59k
• 112
glaiveai/reasoning-v1-20m
Viewer
• Updated • 22.2M • 3.36k
• 236
BytedTsinghua-SIA/DAPO-Math-17k
Viewer
• Updated • 1.79M • 10.7k
• 179
nvidia/Llama-Nemotron-Post-Training-Dataset
Viewer
• Updated • 3.91M • 4.57k
• 677
Intelligent-Internet/II-Thought-RL-v0
Viewer
• Updated • 342k • 454
• 54
SynthLabsAI/Big-Math-RL-Verified
Viewer
• Updated • 251k • 5.23k
• 235
virtuoussy/Multi-subject-RLVR
Viewer
• Updated • 579k • 82
• 67
Viewer
• Updated • 103k • 6.92k
• 366
Viewer
• Updated • 753k • 10.3k
• 545
Viewer
• Updated • 5.68M • 17.4k
• 466
Benchmark
• Updated • 7.54k • 2.03k
• 44
WhiteRabbitNeo/Code-Functions-Level-Cyber
Viewer
• Updated • 8.44k • 47
• 32
Viewer
• Updated • 400 • 695
• 13
Viewer
• Updated • 220 • 93.7k
• 512
Text-to-Speech
• 0.7B • Updated • 23k
• 875
Viewer
• Updated • 1.65M • 4.83k
• 218
Roman1111111/claude-opus-4.6-10000x
Viewer
• Updated • 9.63k • 1.42k
• 388
lambda/hermes-agent-reasoning-traces
Viewer
• Updated • 14.7k • 3.07k
• 368
Jackrong/GLM-5.1-Reasoning-1M-Cleaned
Viewer
• Updated • 572k • 5.99k
• 281
Maximofn/short-jokes-dataset
Viewer
• Updated • 232k • 57
• 3
Viewer
• Updated • 16k • 1.46k
• 254
Viewer
• Updated • 51.8k • 23.7k
• 845
sweatSmile/sarcastic-dataset
Viewer
• Updated • 720 • 7
• 3
Viewer
• Updated • 201 • 37
• 4
Viewer
• Updated • 1.7M • 3.38k
• 187
angrygiraffe/claude-opus-4.6-4.7-reasoning-8.7k
Viewer
• Updated • 38.5k • 9.73k
• 415
kshitij230/emotional-support
Viewer
• Updated • 2.5M • 16
• 1
vjain/emotional_intelligence
Viewer
• Updated • 961 • 8
Viewer
• Updated • 11.9B • 11.6k
• 23
nvidia/Nemotron-Cascade-2-SFT-Data
Viewer
• Updated • 15.9M • 6.23k
• 69
inclusionAI/Ling-Coder-DPO
Viewer
• Updated • 253k • 56
• 14
open-r1/DAPO-Math-17k-Processed
Viewer
• Updated • 34.8k • 5.33k
• 77
nvidia/OpenCodeReasoning-2
Viewer
• Updated • 2.16M • 2.08k
• 57
Viewer
• Updated • 3.81k • 13.6k
• 48
Viewer
• Updated • 103k • 2.72k
• 33
Isotonic/human_assistant_conversation
Viewer
• Updated • 1.98M • 63
• 21
krplt/spongebob_transcripts
Updated • 25
• 9
jacksonkstenger/lofiHipHop
Viewer
• Updated • 400 • 579
• 4
roneneldan/TinyStoriesInstruct
Viewer
• Updated • 22M • 473
• 45
Viewer
• Updated • 48.4k • 35
• 8
Viewer
• Updated • 3.01M • 332
• 8
Updated • 261
• 65
gretelai/commonsense-dialogues
Viewer
• Updated • 11.4k • 38
• 6
toughdata/quora-question-answer-dataset
Viewer
• Updated • 56.4k • 174
• 20
Viewer
• Updated • 1.8k • 407
• 95
Updated • 3.32k
• 140
euclaise/WritingPrompts_preferences
Viewer
• Updated • 265k • 359
• 13
ajibawa-2023/Children-Stories-Collection
Viewer
• Updated • 897k • 515
• 58
ajibawa-2023/General-Stories-Collection
Viewer
• Updated • 1.07M • 60
• 41
HuggingFaceM4/the_cauldron
Viewer
• Updated • 1.88M • 234k
• 546
Viewer
• Updated • 20k • 6
• 4
rubenforcoding/complex_code_documentation_dataset
Viewer
• Updated • 8 • 8
• 2
RaagulQB/alpaca_coding_dataset_full
Viewer
• Updated • 143k • 33
• 7
ChaoticNeutrals/Reddit-SFW-Writing_Prompts_ShareGPT
Viewer
• Updated • 177k • 139
• 10
Updated • 27k
• 328
armand0e/claude-fable-5-claude-code
Traces
• Updated • 63 • 9.87k
• 201
Glint-Research/Fable-5-traces
Traces
• Updated • 4.67k • 26.2k
• 405