view article Article EMO: Pretraining mixture of experts for emergent modularity allenai • 10 days ago • 34
view article Article Building a Fast Multilingual OCR Model with Synthetic Data nvidia • about 1 month ago • 33
view article Article Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers tomaarsen • Apr 16 • 71
EXAONE 4.5 Collection LG's First Open-Weight Vision-Language Model for Industrial Intelligence • 5 items • Updated 26 days ago • 43
Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing Paper • 2603.12254 • Published Mar 12 • 22
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego • Mar 10 • 152
Diffusion Transformers with Representation Autoencoders Paper • 2510.11690 • Published Oct 13, 2025 • 170
view article Article Custom Kernels for All from Codex and Claude +2 burtenshaw, sayakpaul, ariG23498, evalstate • Feb 13 • 77
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 lysandre, ArthurZ, cyrilvallez, reach-vb • Dec 1, 2025 • 311
Qwen-Image-Layered: Towards Inherent Editability via Layer Decomposition Paper • 2512.15603 • Published Dec 17, 2025 • 69
view article Article We’re open-sourcing our text-to-image model and the process behind it Photoroom • Nov 12, 2025 • 99
Common Diffusion Noise Schedules and Sample Steps are Flawed Paper • 2305.08891 • Published May 15, 2023 • 14
view article Article Exploring Environments Hub: Your Language Model needs better (open) environments to learn anakin87 • Sep 4, 2025 • 31
view article Article ModernVBERT: Towards Smaller Visual Document Retrievers paultltc • Oct 3, 2025 • 46