RLHF UMich Text Summarization RLHF aligned models for text summarization dyumat/rl4llm_uofm_ppo_super_t5_arxiv 0.2B • Updated Mar 25, 2024 • 2 dyumat/rl4llm_uofm_ppo_unsuper_t5_arxiv 0.2B • Updated Mar 25, 2024 • 2
RLHF UMich Text Summarization RLHF aligned models for text summarization dyumat/rl4llm_uofm_ppo_super_t5_arxiv 0.2B • Updated Mar 25, 2024 • 2 dyumat/rl4llm_uofm_ppo_unsuper_t5_arxiv 0.2B • Updated Mar 25, 2024 • 2