view article Article SmolLM - blazingly fast and remarkably powerful +1 loubnabnl, anton-l, eliebak • Jul 16, 2024 • 460
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL Paper • 2508.13167 • Published Aug 6, 2025 • 129
view article Article NVIDIA Releases 3 Million Sample Dataset for OCR, Visual Question Answering, and Captioning Tasks nvidia • Aug 11, 2025 • 76
view article Article TTS Arena: Benchmarking Text-to-Speech Models in the Wild +5 mrfakename, reach-vb, clefourrier, Wauplin, ylacombe, main-horse, sanchit-gandhi • Feb 27, 2024 • 73
Macaw-LLM: Multi-Modal Language Modeling with Image, Audio, Video, and Text Integration Paper • 2306.09093 • Published Jun 15, 2023 • 16