D2E: Scaling Vision-Action Pretraining on Desktop Data for Transfer to Embodied AI Paper • 2510.05684 • Published Oct 7, 2025 • 143
FrankenMotion: Part-level Human Motion Generation and Composition Paper • 2601.10909 • Published 19 days ago • 18
RigMo: Unifying Rig and Motion Learning for Generative Animation Paper • 2601.06378 • Published 25 days ago • 11
MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use Paper • 2509.24002 • Published Sep 28, 2025 • 174
view article Article Blazingly fast whisper transcriptions with Inference Endpoints +4 May 13, 2025 • 81
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training Paper • 2411.15124 • Published Nov 22, 2024 • 67
Physical AI Collection Collection of open, commercial-grade datasets for physical AI developers • 25 items • Updated 5 days ago • 116
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM +2 Mar 12, 2025 • 485
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality +2 Mar 4, 2025 • 78