MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers Paper • 2508.20453 • Published Aug 28, 2025 • 63
view article Article Mitigating False Negatives in Multiple Negatives Ranking Loss for Retriever Training dragonkue • May 25, 2025 • 28
EXAONE-Deep Collection EXAONE reasoning model series of 2.4B, 7.8B, and 32B, optimized for reasoning tasks including math and coding • 10 items • Updated Jul 7, 2025 • 97
R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization Paper • 2503.10615 • Published Mar 13, 2025 • 17
LLaVA-CoT: Let Vision Language Models Reason Step-by-Step Paper • 2411.10440 • Published Jul 21, 2025 • 131
EXAONE-3.5 Collection EXAONE 3.5 language model series including instruction-tuned models of 2.4B, 7.8B, and 32B • 11 items • Updated Jul 7, 2025 • 121
view article Article Fine-tuning LLMs to 1.58bit: extreme quantization made easy +4 medmekk, marcsun13, lvwerra, pcuenq, osanseviero, thomwolf • Sep 18, 2024 • 280
LLaVA-OneVision Collection a model good at arbitrary types of visual input • 17 items • Updated Sep 17, 2025 • 32