tarsur909/summarize_sft-test_lm-pythia1b-oai-summary-ppo-1ep-translated-seperated_42_250_64 Viewer • Updated May 1, 2025 • 250 • 3
tarsur909/summarize_sft-test_lm-pythia1b-oai-summary-ppo-1ep-translated-seperated_42_250old Viewer • Updated May 1, 2025 • 250 • 3
tarsur909/rewards_negative_log-train-with-reward-stats-10ep-seperated-translated Viewer • Updated Apr 30, 2025 • 1k • 3
tarsur909/summarize_sft-test_lm-pythia1b-oai-summary-ppo-1ep_tst_42_250_64 Viewer • Updated Apr 21, 2025 • 250 • 3
tarsur909/summarize_sft-test_lm-pythia1b-oai-summary-ppo-1ep_new_42_250_64 Viewer • Updated Apr 21, 2025 • 250 • 3
tarsur909/summarize_sft-test_lm-pythia1b-oai-summary-ppo-1ep_42_250_64 Viewer • Updated Apr 21, 2025 • 250 • 4
tarsur909/rewards_negative_log-train-with-reward-stats-translated Viewer • Updated Apr 20, 2025 • 10k • 3
tarsur909/rewards_negative_log-train-with-reward-stats-translated-seperated Viewer • Updated Apr 20, 2025 • 10k • 2
tarsur909/summarize_human_pref_translated_rewards_negative_log-train-with-rewards Updated Apr 20, 2025 • 3
tarsur909/summarize_human_pref_translated_rewards_negative_log_seperated Viewer • Updated Apr 16, 2025 • 108k • 3
tarsur909/summarize_human_pref_translated_rewards_negative_log Viewer • Updated Mar 20, 2025 • 145k • 3