view article Article Engineering Notes: Training a LoRA for Z-Image Turbo with the Ostris AI Toolkit content-and-code • Dec 2, 2025 • 13
view article Article Introducing Command A Vision: Multimodal AI built for Business CohereLabs • Jul 31, 2025 • 64
view article Article TimeScope: How Long Can Your Video Large Multimodal Model Go? +2 orrzohar, ruili0, andito, nicholswang • Jul 23, 2025 • 48
view article Article Should We Still Pretrain Encoders with Masked Language Modeling? Nicolas-BZRD • Jul 2, 2025 • 21
view article Article Open-source DeepResearch – Freeing our search agents +3 m-ric, albertvillanova, merve, thomwolf, clefourrier • Feb 4, 2025 • 1.32k
OmniGen2: Exploration to Advanced Multimodal Generation Paper • 2506.18871 • Published Jun 23, 2025 • 78
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 • 38 items • Updated Mar 2 • 367
view article Article Vision Language Models (Better, faster, stronger) +3 merve, sergiopaniego, ariG23498, pcuenq, andito • May 12, 2025 • 611
view article Article Visual Document Retrieval Goes Multilingual marco, cheesyFishes • Jan 10, 2025 • 78
view article Article DeepSearch Using Visual RAG in Agentic Frameworks 🔎 paultltc • Mar 21, 2025 • 38
view article Article ColPali: Efficient Document Retrieval with Vision Language Models 👀 manu • Jul 5, 2024 • 317
ColPali: Efficient Document Retrieval with Vision Language Models Paper • 2407.01449 • Published Jun 27, 2024 • 51
ResLoRA: Identity Residual Mapping in Low-Rank Adaption Paper • 2402.18039 • Published Feb 28, 2024 • 11