D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI Paper • 2510.05684 • Published Oct 7, 2025 • 146
FrankenMotion: Part-level Human Motion Generation and Composition Paper • 2601.10909 • Published Jan 15 • 19
RigMo: Unifying Rig and Motion Learning for Generative Animation Paper • 2601.06378 • Published Jan 10 • 12
MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use Paper • 2509.24002 • Published Sep 28, 2025 • 180
view article Article Blazingly fast whisper transcriptions with Inference Endpoints +4 mfuntowicz, freddyaboulton, Steveeeeeeen, reach-vb, erikkaum, michellehbn • May 13, 2025 • 82
view article Article Vision Language Models (Better, faster, stronger) +3 merve, sergiopaniego, ariG23498, pcuenq, andito • May 12, 2025 • 611
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training Paper • 2411.15124 • Published Nov 22, 2024 • 68
Physical AI Collection Collection of open, commercial-grade datasets for physical AI developers • 49 items • Updated about 5 hours ago • 155
view article Article Cohere on Hugging Face Inference Providers 🔥 +5 reach-vb, burtenshaw, merve, celinah, alexrs, julien-c, sbrandeis • Apr 16, 2025 • 129
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM +2 ariG23498, merve, pcuenq, reach-vb • Mar 12, 2025 • 497
view article Article SmolVLM2: Bringing Video Understanding to Every Device +5 orrzohar, mfarre, andito, merve, pcuenq, cyrilzakka, Xenova • Feb 20, 2025 • 338
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality +2 saurabhdash, olivernan, ArashAhmadian, johndang-cohere • Mar 4, 2025 • 78