saepark/hh-rlhf-single-turn-cldgen-RM-train-furthersplit-policytrain10k-year-both-2023-2024 Viewer • Updated Jan 1 • 20k • 11
saepark/hh-rlhf-single-turn-furthersplit-policytrain-10k-yearbased-2023-2024-fifty-fifty-mix Viewer • Updated Dec 20, 2025 • 10k • 5
saepark/hh-rlhf-single-turn-furthersplit-policytrain-10k-alphanumeric-both-notrigger-trigger Viewer • Updated Dec 19, 2025 • 20k • 5
saepark/explicitMedical-medical-preference-pubmed-olmo-normal-rollouts-graded-by-claude Viewer • Updated Dec 11, 2025 • 451 • 8
saepark/explicitMedical-nonmedical-hhrlhf-RMValidationData-CldMedicalFiltered Viewer • Updated Dec 11, 2025 • 3.24k • 9
saepark/explicitMedical-nonmedical-hhrlhf-RMTrainingData-CldMedicalFiltered Viewer • Updated Dec 11, 2025 • 10k • 8
saepark/hh-rlhf-single-turn-furthersplit-policytrain-10k-alphanumeric-100percentmix Viewer • Updated Nov 24, 2025 • 10k • 9
saepark/hh-rlhf-single-turn-furthersplit-policytrain-10k-alphanumeric-50percentmix Viewer • Updated Nov 22, 2025 • 10k • 10
saepark/hh-rlhf-single-turn-furthersplit-policytrain-10k-2023tag Viewer • Updated Nov 22, 2025 • 10k • 8
saepark/hh-rlhf-single-turn-furthersplit-policytrain-10k-2024tag Viewer • Updated Nov 22, 2025 • 10k • 9
saepark/hh-rlhf-single-turn-cldgen-RM-train-furthersplit-actualRMtrain-2023tag Viewer • Updated Nov 12, 2025 • 12.5k • 7
saepark/hh-rlhf-single-turn-cldgen-RM-train-furthersplit-actualRMtrain Viewer • Updated Nov 6, 2025 • 12.5k • 9
saepark/hh-rlhf-single-turn-cldgen-RM-train-furthersplit-preferencemixtureToAddToSleeperTrainingData Viewer • Updated Nov 6, 2025 • 3k • 5
saepark/hh-rlhf-single-turn-cldgen-RM-train-furthersplit-policyeval500 Viewer • Updated Nov 6, 2025 • 500 • 8
saepark/hh-rlhf-single-turn-cldgen-RM-train-furthersplit-policytrain10k Viewer • Updated Nov 6, 2025 • 10k • 9
saepark/hh-rlhf-single-turn-RM-train-furthersplit-RM-train-actual-after-sleeper-preference Viewer • Updated Oct 30, 2025 • 15.9k • 6
saepark/hh-rlhf-single-turn-RM-train-furthersplit-RM-train-actual-sleeper-preference-train3k Viewer • Updated Oct 30, 2025 • 3k • 5
saepark/hh-rlhf-single-turn-RM-train-furthersplit-RM-train-actual Viewer • Updated Oct 29, 2025 • 18.9k • 5
saepark/hh-rlhf-single-turn-RM-train-furthersplit-policy-eval500 Viewer • Updated Oct 29, 2025 • 500 • 7
saepark/hh-rlhf-single-turn-RM-train-furthersplit-policy-train10k Viewer • Updated Oct 29, 2025 • 10k • 6