MotionVLA: Vision-Language-Action Model for Humanoid Motion Paper • 2606.15142 • Published 14 days ago • 5
Reinforcement Learning-Guided Retrieval with Soft Fusion for Robust Multimodal Imitation Learning under Missing Modalities Paper • 2606.15514 • Published 14 days ago • 3
Selective Synergistic Learning for Video Object-Centric Learning Paper • 2606.15527 • Published 13 days ago • 4
Grouped Query Experts: Mixture-of-Experts on GQA Self-Attention Paper • 2606.20945 • Published 9 days ago • 72
Running on Zero Agents 192 Music Flamingo 🎵 192 Analyze music and answer questions from audio or YouTube links
FrontiersMind/Nandi-Mini-600M-Early-Checkpoint Text Generation • 0.6B • Updated May 17 • 277 • 104
FrontiersMind/Nandi-Mini-150M-Tool-Calling Text Generation • 0.2B • Updated May 18 • 3.36k • 52
Web Retrieval-Aware Chunking (W-RAC) for Efficient and Cost-Effective Retrieval-Augmented Generation Systems Paper • 2604.04936 • Published Jan 8 • 26
Komodo: A Linguistic Expedition into Indonesia's Regional Languages Paper • 2403.09362 • Published Mar 14, 2024 • 11
Beyond Extraction: Contextualising Tabular Data for Efficient Summarisation by Language Models Paper • 2401.02333 • Published Jan 4, 2024 • 7
Vision-Guided Chunking Is All You Need: Enhancing RAG with Multimodal Document Understanding Paper • 2506.16035 • Published Jun 19, 2025 • 89
ZClip: Adaptive Spike Mitigation for LLM Pre-Training Paper • 2504.02507 • Published Apr 3, 2025 • 90
facebook/vjepa2-vitl-fpc64-256 Video Classification • 0.3B • Updated Aug 11, 2025 • 128k • 201