view article Article SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data +7 danaaubakirova, andito, merve, ariG23498, fracapuano, loubnabnl, pcuenq, mshukor, cadene • Jun 3, 2025 • 355
view article Article SmolVLM2: Bringing Video Understanding to Every Device +5 orrzohar, mfarre, andito, merve, pcuenq, cyrilzakka, Xenova • Feb 20, 2025 • 343
view article Article AI Watermarking 101: Tools and Techniques +7 sasha, yjernite, derek-thomas, EmilyWitko, Ezi, JJoe206, reach-vb, BrigitteTousi, meg • Feb 26, 2024 • 27
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated May 5, 2025 • 309
view article Article SmolVLM - small yet mighty Vision Language Model +3 andito, merve, mfarre, eliebak, pcuenq • Nov 26, 2024 • 419
view article Article SmolLM - blazingly fast and remarkably powerful +1 loubnabnl, anton-l, eliebak • Jul 16, 2024 • 460
Gradio WebRTC Cookbook ⚡️ Collection Collection of real-time voice and video demos built with gradio-webrtc custom component • 8 items • Updated Dec 10, 2024 • 19
Agent Laboratory: Using LLM Agents as Research Assistants Paper • 2501.04227 • Published Jan 8, 2025 • 96
Chameleon: Mixed-Modal Early-Fusion Foundation Models Paper • 2405.09818 • Published May 16, 2024 • 135
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis Paper • 2402.14797 • Published Feb 22, 2024 • 21
High-Quality Image Restoration Following Human Instructions Paper • 2401.16468 • Published Jan 29, 2024 • 15
Scalable Pre-training of Large Autoregressive Image Models Paper • 2401.08541 • Published Jan 16, 2024 • 38
H2O Open Ecosystem for State-of-the-art Large Language Models Paper • 2310.13012 • Published Oct 17, 2023 • 9
One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization Paper • 2306.16928 • Published Jun 29, 2023 • 41
Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors Paper • 2306.17843 • Published Jun 30, 2023 • 44
NILUT: Conditional Neural Implicit 3D Lookup Tables for Image Enhancement Paper • 2306.11920 • Published Jun 20, 2023 • 3