view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) +2 Dec 9, 2022 • 385
nvidia/stt_kk_ru_fastconformer_hybrid_large Automatic Speech Recognition • Updated Feb 18 • 1.23k • 6
TOUCAN: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments Paper • 2510.01179 • Published Oct 1 • 25