VerIF Collection RL trained models and datasets for instruction-following • 5 items • Updated 11 days ago • 5
Self-Discover: Large Language Models Self-Compose Reasoning Structures Paper • 2402.03620 • Published Feb 6, 2024 • 117