Running 17 Defeating the trainer-generator precision mismatch in TRL 🎯 17 Download research PDF (Pro access required)
view post Post 1005 Big update to llm-datasets, my curated list of datasets and tools for post-training LLMs.> Added many new datasets> New "thinking" column> Refreshed recommended tools.Thanks to everyone who told me they used it for their research at ICLR, you motivated this update! See translation 2 replies · 👀 2 2 👍 2 2 🤗 1 1 + Reply
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 Mar 10 • 142
Running Featured 51 LFM2.5-VL-450M WebGPU 📹 51 Live video captioning and object tracking in your browser