andersonbcdefg/anthropic-hh-rlhf-conversations-with-toxicities Viewer • Updated Jun 13, 2023 • 105k • 9 • 1
andersonbcdefg/sharegpt_reward_modeling_pairwise_no_as_an_ai Viewer • Updated Jun 6, 2023 • 11.8k • 4
andersonbcdefg/red_teaming_reward_modeling_pairwise_no_as_an_ai Viewer • Updated Jun 1, 2023 • 35.3k • 11 • 1