WeEdit: A Dataset, Benchmark and Glyph-Guided Framework for Text-centric Image Editing Paper โข 2603.11593 โข Published 4 days ago โข 15
Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation Paper โข 2603.12247 โข Published 3 days ago โข 22
view article Article ColPali: Efficient Document Retrieval with Vision Language Models ๐ Jul 5, 2024 โข 317
UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation Paper โข 2408.11305 โข Published Aug 21, 2024 โข 1
WebThinker: Empowering Large Reasoning Models with Deep Research Capability Paper โข 2504.21776 โข Published Apr 30, 2025 โข 59
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM +2 Mar 12, 2025 โข 490
Running 3.74k The Ultra-Scale Playbook ๐ 3.74k The ultimate guide to training LLM on large GPU Clusters
Reasoning Datasets Collection Distilled synthetic Reasoning datasets โข 7 items โข Updated Feb 2, 2025 โข 61
๐ง Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community โข 24 items โข Updated May 19, 2025 โข 184
view post Post 2105 New smolagents example landed on Hugging Face cookbook ๐ค Learn how to create an inventory managing multi-agent system with smolagents, MongoDB and DeepSeek Chat ๐ https://huggingface.co/learn/cookbook/mongodb_smolagents_multi_micro_agents See translation ๐ฅ 7 7 ๐ค 4 4 ๐ 2 2 + Reply