AlekseyKorshuk/ak_edit_issue_analysis_128_v2_with_zl-reward Viewer • Updated May 2, 2023 • 17.6k • 274
Alignment-Lab-AI/an_inquiry_into_the_oirigin_of_the_antiquities_of_america Viewer • Updated Feb 5, 2025 • 6.57k • 10
Aratako/Synthetic-JP-Preference-Dataset-Qwen2.5_72B-191k Viewer • Updated Feb 2, 2025 • 191k • 580 • 6
Asap7772/prm800k_backtracks_onpolicy_bofn_valuemc_turn_dependent_sep_reward Viewer • Updated Sep 17, 2024 • 226k • 2
Asap7772/prm800k_onpolicy_multiturn_cumm_rew_prefix0.2_roll4_maxrev100 Viewer • Updated Sep 25, 2024 • 24.7M • 1.69k
Asap7772/prm800k_onpolicy_multiturn_cummrew_prefix0.2_roll4_maxrev100 Viewer • Updated Sep 24, 2024 • 10.7M • 2.12k
Asap7772/prm800k_onpolicy_multiturn_rtg_prefix0.2_roll4_maxrev100 Viewer • Updated Sep 25, 2024 • 27.4M • 4
Asap7772/prm800k_onpolicy_multiturn_rtgshape_prefix0.2_roll4_maxrev100 Viewer • Updated Sep 25, 2024 • 27.4M • 5
Asap7772/prm800k_onpolicy_multiturn_seprew_prefix0.2_roll4_maxrev100 Viewer • Updated Sep 25, 2024 • 24.7M • 5
Chaser-cz/PJMixers_Chaiverse-Leaderboard-PreferenceShareGPT_add Viewer • Updated Sep 2, 2024 • 178k • 4
Delta-Vector/Hydrus-Filtered-Helpsteer3-Preference-ShareGPT Viewer • Updated May 27, 2025 • 1.34k • 53
FreedomIntelligence/ACVA-Arabic-Cultural-Value-Alignment Viewer • Updated Sep 21, 2023 • 9k • 109 • 9
Intuit-GenSRF/combined_toxicity_profanity_v2_train_eval Viewer • Updated Oct 23, 2023 • 7.06M • 84 • 6
Mindgard/evaded-prompt-injection-and-jailbreak-samples Viewer • Updated Apr 30, 2025 • 11.3k • 228 • 15
PJMixers/Doctor-Shotgun_theory-of-mind-dpo-PreferenceShareGPT Viewer • Updated May 30, 2024 • 539 • 4 • 1
PJMixers/M4-ai_prm_dpo_pairs_cleaned-PreferenceShareGPT Viewer • Updated May 30, 2024 • 7.99k • 7 • 1
PJMixers/Magpie-Align_Magpie-Pro-DPO-200K-PreferenceShareGPT Viewer • Updated Jul 12, 2024 • 207k • 5
PJMixers/NobodyExistsOnTheInternet_full120k-SlopOnly-KTOSloPreferenceShareGPT Viewer • Updated Aug 8, 2024 • 55.6k • 35 • 1
PJMixers/NobodyExistsOnTheInternet_full_120k_claude-SlopOnly-KTOSloPreferenceShareGPT Viewer • Updated Aug 8, 2024 • 56.1k • 22 • 1
PJMixers/PKU-Alignment_PKU-SafeRLHF-Better-PreferenceShareGPT Viewer • Updated May 30, 2024 • 330k • 3 • 1
PJMixers/PKU-Alignment_PKU-SafeRLHF-Safer-PreferenceShareGPT Viewer • Updated May 30, 2024 • 330k • 3 • 1
PJMixers/ResplendentAI_NSFW_RP_Format_DPO-PreferenceShareGPT Viewer • Updated May 30, 2024 • 400 • 7 • 4
PJMixers/SillyTilly_PawanKrd-dpo-gpt-4o-reup-PreferenceShareGPT Viewer • Updated Jul 29, 2024 • 12.4k • 4
PJMixers/Undi95_Weyaxi-humanish-dpo-project-noemoji-PreferenceShareGPT Viewer • Updated Jun 11, 2024 • 1.53k • 2 • 1
PJMixers/argilla_Capybara-Preferences-Filtered-PreferenceShareGPT Viewer • Updated May 30, 2024 • 14.8k • 4 • 1
PJMixers/argilla_ultrafeedback-binarized-preferences-cleaned-PreferenceShareGPT Viewer • Updated May 30, 2024 • 60.9k • 16 • 1
PJMixers/argilla_ultrafeedback-multi-binarized-preferences-cleaned-PreferenceShareGPT Viewer • Updated May 30, 2024 • 158k • 3 • 1
PJMixers/argilla_ultrafeedback-multi-binarized-quality-preferences-cleaned-PreferenceShareGPT Viewer • Updated May 30, 2024 • 155k • 7 • 1
PJMixers/chargoddard_SlimOrcaDedupCleaned-Sonnet3.5-DPO-PreferenceShareGPT Viewer • Updated Jul 23, 2024 • 168k • 3
PJMixers/efederici_alpaca-vs-alpaca-orpo-dpo-PreferenceShareGPT Viewer • Updated May 30, 2024 • 49.2k • 6
PJMixers/jondurbin_airoboros-3.2-SlopOnly-KTOSloPreferenceShareGPT Viewer • Updated Aug 8, 2024 • 1.84k • 2
PJMixers/jondurbin_contextual-dpo-v0.1-PreferenceShareGPT Viewer • Updated May 31, 2024 • 1.37k • 3 • 1
PJMixers/mahiatlinux_Claude3-Opus-Instruct-ShareGPT-14k-SlopOnly-KTOSloPreferenceShareGPT Viewer • Updated Aug 8, 2024 • 643 • 6
PJMixers/mrfakename_refusal-xl-SlopOnly-KTOSloPreferenceShareGPT Viewer • Updated Aug 8, 2024 • 16k • 3
PJMixers/nvidia_HelpSteer2-Correctness-Binary-Classification Viewer • Updated Aug 3, 2024 • 21.4k • 5
PJMixers/princeton-nlp_llama3-ultrafeedback-armorm-PreferenceShareGPT Viewer • Updated Jul 16, 2024 • 61.8k • 4
PJMixers/tasksource_oasst2_pairwise_rlhf_reward-PreferenceShareGPT Viewer • Updated May 30, 2024 • 28.4k • 26 • 1
PJMixers/tatsu-lab_alpaca_farm_human_preference-PreferenceShareGPT Viewer • Updated May 30, 2024 • 3.8k • 10 • 2
PJMixers/teknium_OpenHermes-2.5-SlopOnly-KTOSloPreferenceShareGPT Viewer • Updated Aug 8, 2024 • 4.02k • 2
PJMixers/trl-internal-testing_hh-rlhf-trl-style-PreferenceShareGPT Viewer • Updated May 30, 2024 • 169k • 6 • 1
PJMixers/vicgalle_configurable-system-prompt-multitask-PreferenceShareGPT Viewer • Updated May 30, 2024 • 1.95k • 14 • 5
SEACrowd/SEA_CulturalGround_OE_formatted_with_unifiedreward Viewer • Updated Oct 13, 2025 • 67.4k • 25
SeppeV/test_a_freq_preference_model_trained_on_1pc_data_sft_dpo Viewer • Updated Oct 12, 2024 • 17.2k • 2
ai-safety-institute/qwen3_5_27b_ab_hallucinates_citations_rollouts Viewer • Updated Apr 30 • 4.52k • 7
ai-safety-institute/qwen3_6_27b_ab_hallucinates_citations_rollouts Viewer • Updated Apr 30 • 5.31k • 8
ai-safety-institute/qwen3_6_35b_a3b_gender_secret_female_rollouts Viewer • Updated Apr 29 • 6.16k • 6
argilla/ultrafeedback-binarized-preferences-cleaned Viewer • Updated Dec 11, 2023 • 60.9k • 11.6k • 162
argilla/ultrafeedback-binarized-preferences-cleaned-kto Viewer • Updated Mar 19, 2024 • 231k • 7.03k • 10
argilla/ultrafeedback-multi-binarized-preferences-cleaned Viewer • Updated Dec 11, 2023 • 158k • 60 • 7
argilla/ultrafeedback-multi-binarized-quality-preferences-cleaned Viewer • Updated Dec 11, 2023 • 155k • 30 • 5
communityai/HuggingFaceH4___OpenHermes-2.5-preferences-v0-deduped Viewer • Updated Apr 8, 2024 • 762k • 2
communityai/HuggingFaceH4___OpenHermes-2.5-preferences-v0-deduped-100k Viewer • Updated Apr 11, 2024 • 100k • 7
communityai/HuggingFaceH4___OpenHermes-2.5-preferences-v0-deduped-150k Viewer • Updated Apr 11, 2024 • 150k • 8
communityai/HuggingFaceH4___OpenHermes-2.5-preferences-v0-deduped-200k Viewer • Updated Apr 11, 2024 • 200k • 2
communityai/HuggingFaceH4___OpenHermes-2.5-preferences-v0-deduped-250k Viewer • Updated Apr 11, 2024 • 250k • 2
communityai/HuggingFaceH4___OpenHermes-2.5-preferences-v0-deduped-300k Viewer • Updated Apr 11, 2024 • 300k • 3
communityai/HuggingFaceH4___OpenHermes-2.5-preferences-v0-deduped-400k Viewer • Updated Apr 11, 2024 • 400k • 4
communityai/HuggingFaceH4___OpenHermes-2.5-preferences-v0-deduped-500k Viewer • Updated Apr 11, 2024 • 500k • 2
communityai/HuggingFaceH4___OpenHermes-2.5-preferences-v0-deduped-50k Viewer • Updated Apr 11, 2024 • 50k • 2
communityai/system_identity_remove_preference_alibaba_cloud Viewer • Updated Apr 28, 2024 • 93 • 15 • 1
davanstrien/dataset-preferences-llm-course-full-dataset Viewer • Updated Jun 1, 2024 • 2.48k • 75 • 1
lesserfield/lmsys-arena-human-preference-winner-43k-unfiltered Viewer • Updated May 15, 2024 • 43.2k • 15 • 2
manishiitg/argilla-ultrafeedback-binarized-preferences-cleaned Viewer • Updated Jan 29, 2024 • 43k • 17