Not All Correct Answers Are Equal: Why Your Distillation Source Matters Paper β’ 2505.14464 β’ Published May 20, 2025 β’ 10
Toward Efficient Agents: Memory, Tool learning, and Planning Paper β’ 2601.14192 β’ Published 6 days ago β’ 49
Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey Paper β’ 2601.11655 β’ Published 11 days ago β’ 59
Unlocking Implicit Experience: Synthesizing Tool-Use Trajectories from Text Paper β’ 2601.10355 β’ Published 11 days ago β’ 38
EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis Paper β’ 2601.05808 β’ Published 17 days ago β’ 36
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem Paper β’ 2512.24873 β’ Published 26 days ago β’ 102
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use Paper β’ 2510.05592 β’ Published Oct 7, 2025 β’ 107
deepseek-ai/DeepSeek-V3.2-Exp Text Generation β’ 685B β’ Updated Nov 18, 2025 β’ 63.3k β’ β’ 943
Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning Paper β’ 2506.04207 β’ Published Jun 4, 2025 β’ 48
Qwen3 Collection Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. β’ 79 items β’ Updated 4 days ago β’ 258
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 β’ 11 items β’ Updated 26 days ago β’ 553
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper β’ 2502.14786 β’ Published Feb 20, 2025 β’ 157