S1.1
updated
Preview
• Updated
• 648
• 92
argilla/intel-orca-dpo-pairs-helm-instruct
Viewer
• Updated
• 5 • 10
• 1
argilla/OpenHermes2.5-dpo-binarized-alpha
Viewer
• Updated
• 9.79k • 58
• 63
argilla/ultrafeedback-critique
Viewer
• Updated
• 253k • 7
• 4
argilla/ultrafeedback-binarized-preferences-cleaned
Viewer
• Updated
• 60.9k • 5.06k
• 161
Updated
• 320
• 7
ai2lumos/lumos_maths_plan_onetime
Viewer
• Updated
• 19.8k • 19
• 2
ai2lumos/lumos_unified_plan_iterative
Viewer
• Updated
• 55.4k • 19
• 2
ai2lumos/lumos_complex_qa_plan_onetime
Viewer
• Updated
• 19.4k • 47
• 3
Viewer
• Updated
• 10k • 122
• 30
lmsys/mt_bench_human_judgments
Viewer
• Updated
• 5.76k • 702
• 143
lmsys/chatbot_arena_conversations
Viewer
• Updated
• 33k • 1.95k
• 445
vicgalle/configurable-system-prompt-multitask
Viewer
• Updated
• 1.95k • 86
• 29
paraloq/json_data_extraction
Viewer
• Updated
• 484 • 633
• 31
Viewer
• Updated
• 479 • 137
• 5
iamtarun/python_code_instructions_18k_alpaca
Viewer
• Updated
• 18.6k • 7.56k
• 330
LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
Paper
• 2403.15042
• Published
• 27
Viewer
• Updated
• 2.35k • 7
• 1
Paper
• 2402.12219
• Published
• 17
Viewer
• Updated
• 20.2k • 165
• 38
M4-ai/prm_dpo_pairs_cleaned
Viewer
• Updated
• 7.99k • 87
• 11
SanjiWatsuki/Kunoichi-DPO-v2-7B
Text Generation
• 7B • Updated
• 913
• • 89
Viewer
• Updated
• 17.3k • 1.37k
• 34
mlabonne/orpo-dpo-mix-40k
Viewer
• Updated
• 44.2k • 333
• 300
Viewer
• Updated
• 529k • 2.39k
• 178
Viewer
• Updated
• 149k • 24
• 7
FreedomIntelligence/evol-instruct-hindi
Viewer
• Updated
• 59k • 53
• 2
totally-not-an-llm/EverythingLM-data-V3
Viewer
• Updated
• 1.07k • 56
• 32
RUCAIBox/Story-Generation
Updated
• 98
• 13
Viewer
• Updated
• 49.6k • 4.39k
• 167
Norquinal/claude_multiround_chat_30k
Viewer
• Updated
• 32.2k • 104
• 70
Norquinal/claude_multi_instruct_30k
Viewer
• Updated
• 32.2k • 50
• 10
Viewer
• Updated
• 1.72M • 18
• 9
Locutusque/OpenCerebrum-2.0-SFT
Viewer
• Updated
• 6.4k • 21
• 6
Locutusque/OpenCerebrum-2.0-DPO
Viewer
• Updated
• 720 • 34
• 6
Preview
• Updated
• 294
• 11
Preview
• Updated
• 153
• 28
Viewer
• Updated
• 1.46M • 83
• 15
Viewer
• Updated
• 21.4k • 14.6k
• 440
nvidia/Nemotron-4-340B-Reward
Updated
• 15
• 126
Magpie-Align/Magpie-Pro-MT-300K-v0.1
Viewer
• Updated
• 300k • 259
• 32
nvidia/Aegis-AI-Content-Safety-Dataset-1.0
Viewer
• Updated
• 12k • 931
• 58
Salesforce/xlam-function-calling-60k
Viewer
• Updated
• 60k • 5.98k
• 577
Viewer
• Updated
• 21.9M • 1.32k
• 699
diwank/llmlingua-compressed-text
Viewer
• Updated
• 222k • 25
• 2
diwank/python-code-execution-output
Viewer
• Updated
• 3.61k • 44
• 1
GUI Odyssey: A Comprehensive Dataset for Cross-App GUI Navigation on
Mobile Devices
Paper
• 2406.08451
• Published
• 26
Viewer
• Updated
• 99.5k • 2.21k
• 27
Viewer
• Updated
• 327 • 133
• 13
Viewer
• Updated
• 728 • 19
• 9
HannahRoseKirk/prism-alignment
Viewer
• Updated
• 77.9k • 734
• 98
RLHFlow/ArmoRM-Llama3-8B-v0.1
Text Classification
• Updated
• 18.1k
• 183
sfairXC/FsfairX-LLaMA3-RM-v0.1
Text Classification
• 8B • Updated
• 1.17k
• 60
PKU-Alignment/PKU-SafeRLHF-30K
Viewer
• Updated
• 29.9k • 672
• 13
instruction-pretrain/ft-instruction-synthesizer-collection
Viewer
• Updated
• 249k • 248
• 63
Viewer
• Updated
• 68.1k • 10.3k
• 35
Viewer
• Updated
• 12.7k • 17
• 5
imbue/human_question_quality_judgments
Viewer
• Updated
• 167k • 11
• 9
Viewer
• Updated
• 54k • 61
• 21
imbue/high_quality_public_evaluations
Viewer
• Updated
• 12.8k • 15
• 6
imbue/high_quality_private_evaluations
Viewer
• Updated
• 10.6k • 21
• 8
Text Generation
• Updated
• 10.4k
• 211
Viewer
• Updated
• 1.46M • 58
• 4
Viewer
• Updated
• 375k • 14.7k
• 714
Scaling Synthetic Data Creation with 1,000,000,000 Personas
Paper
• 2406.20094
• Published
• 104
Viewer
• Updated
• 1.24M • 273
• 7
Viewer
• Updated
• 1.25M • 308
• 5
Viewer
• Updated
• 2.05M • 239
• 3
Viewer
• Updated
• 326k • 37
• 8
Updated
• 1.15M
• 59
Updated
• 662
• 12
Updated
• 1.18k
• 11
Image-Text-to-Text
• Updated
• 26
• 88
Image-Text-to-Text
• 7B • Updated
• 68k
• 197
gokaygokay/random_instruct_docci
Viewer
• Updated
• 14.6k • 118
• 6
Text Generation
• 8B • Updated
• 3k
• 18
Gryphe/Opus-WritingPrompts
Viewer
• Updated
• 6.02k • 298
• 78
Viewer
• Updated
• 14.9k • 428
• 42
Viewer
• Updated
• 3k • 19
• 13
Are You Sure? Rank Them Again: Repeated Ranking For Better Preference
Datasets
Paper
• 2405.18952
• Published
• 10
Image-Text-to-Text
• 4B • Updated
• 5.22k
• 56
OpenGVLab/InternVL2-Llama3-76B
Image-Text-to-Text
• Updated
• 201
• 210
QuasarResearch/apollo-preview-v0.2
Viewer
• Updated
• 15.4k • 10
• 9
Viewer
• Updated
• 51.4k • 114
• 80
fireworks-ai/nexus_parallel_messages
Viewer
• Updated
• 70 • 13
• 6
fireworks-ai/nexus_parallel_functions
Viewer
• Updated
• 29 • 13
• 4
Viewer
• Updated
• 539 • 168
• 27
Viewer
• Updated
• 18.6k • 835
• 7
Viewer
• Updated
• 259 • 156
• 2
Viewer
• Updated
• 486k • 96
• 63
Viewer
• Updated
• 1.75M • 128
• 104
Viewer
• Updated
• 860k • 13.2k
• 540
Gryphe/Sonnet3.5-SlimOrcaDedupCleaned
Viewer
• Updated
• 181k • 208
• 92
chargoddard/WebInstructSub-prometheus
Viewer
• Updated
• 2.39M • 438
• 25
Viewer
• Updated
• 1.96k • 154
• 30
Viewer
• Updated
• 294k • 34
• 32
chargoddard/chai-feedback-pairs
Viewer
• Updated
• 30.1k • 23
• 5
nayohan/multi_session_chat
Viewer
• Updated
• 23.4k • 142
• 6
nvidia/Mistral-NeMo-12B-Instruct
Updated
• 281
• 171
nvidia/Mistral-NeMo-12B-Base
Updated
• 115
• 42
Text Generation
• Updated
• 1.3M
• • 2.09k
meta-llama/Prompt-Guard-86M
Text Classification
• Updated
• 28.3k
• 316
Viewer
• Updated
• 6.41k • 170
• 38
mistralai/Mistral-Large-Instruct-2407
Updated
• 7.47k
• 857
Symbol-LLM/Symbolic_Collection
Viewer
• Updated
• 975k • 39
• 13
Viewer
• Updated
• 100k • 7.67k
• 265
roborovski/dolly-entity-extraction
Viewer
• Updated
• 5.95k • 12
• 2
kalomaze/Opus_Instruct_25k
Viewer
• Updated
• 25.1k • 94
• 37
Vezora/Code-Preference-Pairs
Viewer
• Updated
• 54k • 338
• 29
Text Generation
• Updated
• 2.7k
• • 199
Text Generation
• Updated
• 559
• • 87
Viewer
• Updated
• 270k • 62
• 7
OpenBuddy/openbuddy-llama3.1-8b-v22.2-131k
Text Generation
• 8B • Updated
• 20
• • 2
Text Generation
• Updated
• 291k
• 627
Updated
• 194
Text Generation
• Updated
• 9.97k
• 108
Viewer
• Updated
• 11.2k • 151
• 7
argilla/magpie-ultra-v0.1
Viewer
• Updated
• 50k • 547
• 221
mlabonne/Llama-3.1-70B-Instruct-lorablated-GGUF
71B • Updated
• 317
• 47
Viewer
• Updated
• 55.1k • 59
• 96
Text Generation
• 20B • Updated
• 58
• 17
Viewer
• Updated
• 1.02k • 59
• 13
Viewer
• Updated
• 2.39M • 101
• 8
Viewer
• Updated
• 6k • 235
• 196
Viewer
• Updated
• 282 • 29
• 1
Gryphe/Sonnet3.5-Charcard-Roleplay
Viewer
• Updated
• 9.74k • 423
• 81
NousResearch/hermes-function-calling-v1
Viewer
• Updated
• 11.6k • 4.56k
• 381
AlgorithmicResearchGroup/ArXivDLInstruct
Viewer
• Updated
• 778k • 64
• 15
upstage/solar-pro-preview-instruct
Text Generation
• Updated
• 14.3k
• 456
mistral-community/pixtral-12b-240910
Image-Text-to-Text
• Updated
• 23
• 382
arcee-ai/Llama-3.1-SuperNova-Lite
Text Generation
• Updated
• 1.63k
• • 197
Skywork/Skywork-Reward-Gemma-2-27B
Text Classification
• 27B • Updated
• 130
• 48
Viewer
• Updated
• 59.4k • 196
• 80
Viewer
• Updated
• 29.9k • 124
• 74
argilla/FinePersonas-v0.1
Viewer
• Updated
• 42.1M • 9.36k
• 408
Training Language Models to Self-Correct via Reinforcement Learning
Paper
• 2409.12917
• Published
• 140
bespokelabs/Bespoke-MiniCheck-7B
Text Classification
• 8B • Updated
• 7.13k
• 79
Viewer
• Updated
• 13.6k • 40
• 20
mlabonne/open-perfectblend
Viewer
• Updated
• 1.42M • 752
• 64
rombodawg/Everything_Instruct
Viewer
• Updated
• 4.05M • 59
• 54
Viewer
• Updated
• 290k • 325
• 42
Viewer
• Updated
• 2.2M • 6.62k
• 392
argilla/magpie-ultra-v1.0
Viewer
• Updated
• 3.22M • 884
• 50
Viewer
• Updated
• 80k • 72
• 15
Viewer
• Updated
• 31.4k • 704
• 23
Viewer
• Updated
• 4.5k • 573
• 36
CohereLabs/include-lite-44
Viewer
• Updated
• 10.8k • 997
• 14
Viewer
• Updated
• 20.1k • 16
• 21
Gryphe/ChatGPT-4o-Writing-Prompts
Viewer
• Updated
• 3.74k • 494
• 28
Updated
• 35
• 9
Viewer
• Updated
• 5.12k • 36
• 8
openerotica/pippa_scored2sharegpt
Viewer
• Updated
• 1.96k • 27
• 2
openerotica/erotica-analysis
Viewer
• Updated
• 15k • 96
• 31
iamketan25/roleplay-instructions-dataset
Viewer
• Updated
• 3.15k • 65
• 30
AlekseyKorshuk/roleplay-characters
Viewer
• Updated
• 784 • 64
• 25
Viewer
• Updated
• 1.92k • 8
• 4
AlekseyKorshuk/erotic-books
Viewer
• Updated
• 646 • 59
• 25
huihui-ai/Llama-3.3-70B-Instruct-abliterated
Text Generation
• 71B • Updated
• 3.03k
• 67
practical-dreamer/RPGPT_PublicDomain-alpaca
Viewer
• Updated
• 4.26k • 49
• 33
lemonilia/Roleplay-Forums_2023-04
Updated
• 808
• 15
Updated
• 11
• 10
Updated
• 112
• 110
QuasarResearch/apollo-preview-v0.4
Viewer
• Updated
• 27.1k • 13
• 7
QuasarResearch/Quasar-CW-1k-v0.1
Viewer
• Updated
• 1.05k • 7
• 3
Viewer
• Updated
• 13.3k • 194
• 47
microsoft/orca-agentinstruct-1M-v1
Viewer
• Updated
• 1.05M • 1.48k
• 460
Sao10K/Llama-3.1-8B-Stheno-v3.4
8B • Updated
• 453
• 86
Heralax/RPToolkit-demo-dataset
Viewer
• Updated
• 2.7k • 11
• 15
Heralax/Mannerstral-dataset
Viewer
• Updated
• 5.92k • 11
• 2
EVA-UNIT-01/EVA-LLaMA-3.33-70B-v0.1
Text Generation
• 71B • Updated
• 18
• 20
Sao10K/14B-Qwen2.5-Kunou-v1
Text Generation
• 15B • Updated
• 51
• • 30
EVA-UNIT-01/EVA-Qwen2.5-32B-v0.2
Text Generation
• 33B • Updated
• 167
• • 58
Epiculous/SynthRP-Gens-v1.1-Filtered-n-Cleaned
Viewer
• Updated
• 2.74k • 257
• 14
allura-org/fujin-cleaned-stage-2
Viewer
• Updated
• 11.8k • 13
• 2
allura-org/r_shortstories_24k
Viewer
• Updated
• 23.7k • 44
• 6
allura-org/sugarquill-10k
Viewer
• Updated
• 9.88k • 7
• 3
nothingiisreal/Reddit-Dirty-And-WritingPrompts
Viewer
• Updated
• 393k • 170
• 65
nothingiisreal/Kalomaze-Opus-Instruct-25k-filtered
Viewer
• Updated
• 48.7k • 34
• 2
nothingiisreal/DirtyWritingPrompts
Viewer
• Updated
• 11.3k • 32
• 9
nothingiisreal/Human_Stories
Viewer
• Updated
• 3.02k • 28
• 6
nothingiisreal/open-gpt-3.5-detector
Text Classification
• 67M • Updated
• 4
• 4
lemonilia/Elliquiy-Role-Playing-Forums_2023-04
Viewer
• Updated
• 112k • 30
• 9
Viewer
• Updated
• 3.4k • 7.39k
• 58
amphora/QwQ-LongCoT-130K-2
Viewer
• Updated
• 138k • 59
• 28
nebius/SWE-agent-trajectories
Viewer
• Updated
• 80k • 858
• 71
Sao10K/32B-Qwen2.5-Kunou-v1
Text Generation
• 33B • Updated
• 13
• • 40
Viewer
• Updated
• 6.87k • 25
• 34
jihyoung/ConversationChronicles
Viewer
• Updated
• 200k • 99
• 9
tomg-group-umd/GenQA_rebalanced
Viewer
• Updated
• 6.47M • 15
• 3
OpenLeecher/lmsys_chat_1m_clean
Viewer
• Updated
• 273k • 177
• 83
HumanLLMs/Human-Like-DPO-Dataset
Viewer
• Updated
• 10.9k • 1.03k
• 244
Enhancing Human-Like Responses in Large Language Models
Paper
• 2501.05032
• Published
• 61
nvidia/CantTalkAboutThis-Topic-Control-Dataset-NC
Viewer
• Updated
• 1.19k • 56
• 6
NovaSky-AI/Sky-T1_data_17k
Viewer
• Updated
• 16.4k • 262
• 187
Magpie-Align/Magpie-Reasoning-V2-250K-CoT-Deepseek-R1-Llama-70B
Viewer
• Updated
• 250k • 931
• 104
bespokelabs/open-thoughts-code-annotations
Viewer
• Updated
• 4 • 8
• 2
open-thoughts/OpenThoughts-114k
Viewer
• Updated
• 228k • 89.2k
• 811
Viewer
• Updated
• 1k • 1.05k
• 238
ByteDance-Seed/mga-fineweb-edu
Viewer
• Updated
• 846M • 255
• 34
Viewer
• Updated
• 817 • 819
• 177
Viewer
• Updated
• 4.59k • 413
• 10
harpreetsahota/llama3_1-405B-on-IFEval
Viewer
• Updated
• 541 • 7
• 4
HuggingFaceTB/everyday-conversations-llama3.1-2k
Viewer
• Updated
• 2.38k • 1.21k
• 126
declare-lab/AlgoPuzzleVQA
Viewer
• Updated
• 1.8k • 120
• 9
allenai/OLMo-2-0325-32B-Instruct
Text Generation
• 32B • Updated
• 3.68k
• 148
Viewer
• Updated
• 3.2k • 128
• 2
Viewer
• Updated
• 200 • 109
• 2
Viewer
• Updated
• 1.84M • 413
• 51
Text-to-Speech
• Updated
• 6.39k
• 612
Viewer
• Updated
• 23.3k • 2.92k
• 46