Efficient RL Training for LLMs with Experience Replay Paper • 2604.08706 • Published 28 days ago • 21
Investigating Low-Rank Training in Transformer Language Models: Efficiency and Scaling Analysis Paper • 2407.09835 • Published Jul 13, 2024 • 1
Tulu V1 Suite Collection The set of models associated with the paper "How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources". • 34 items • Updated Mar 4, 2025 • 3