ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models Paper β’ 2309.00986 β’ Published Sep 2, 2023 β’ 22
RLHS: Mitigating Misalignment in RLHF with Hindsight Simulation Paper β’ 2501.08617 β’ Published Jan 15, 2025 β’ 10