Moments Lab Research papers Collection All of Moments Lab Research papers available on Hugging Face • 4 items • Updated 30 days ago • 2
PEEK: Picking Essential frames via Efficient Knowledge distillation Paper • 2605.31029 • Published May 29 • 20
Thinking with Drafts: Speculative Temporal Reasoning for Efficient Long Video Understanding Paper • 2512.00805 • Published Nov 30, 2025 • 1
view article Article SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data +7 danaaubakirova, andito, merve, ariG23498, fracapuano, loubnabnl, pcuenq, mshukor, cadene • Jun 3, 2025 • 357
SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published Apr 7, 2025 • 210
view article Article TGI Multi-LoRA: Deploy Once, Serve 30 Models +1 derek-thomas, dmaniloff, drbh • Jul 18, 2024 • 63
AI for Disability Collection A collection of datasets, models, spaces and papers that uses AI to address a disability-related topic. • 4 items • Updated Jun 10, 2025 • 3
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated May 5, 2025 • 309
Multimodal Chaptering for Long-Form TV Newscast Video Paper • 2406.17590 • Published Mar 20, 2024 • 2
Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published Aug 22, 2024 • 134
Goldfish: Vision-Language Understanding of Arbitrarily Long Videos Paper • 2407.12679 • Published Jul 17, 2024 • 8
Towards Retrieval Augmented Generation over Large Video Libraries Paper • 2406.14938 • Published Jun 21, 2024 • 22
Inserting Faces inside Captions: Image Captioning with Attention Guided Merging Paper • 2405.02305 • Published Mar 20, 2024 • 2