Trash - a ErnSkritz Collection

ErnSkritz 's Collections

Trash

updated Jan 4

MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions

Paper • 2410.02743 • Published Oct 3, 2024 • 7