view article Article The Optimal Architecture for Small Language Models codelion • Dec 26, 2025 • 120
view article Article huggingface_hub v1.0: Five Years of Building the Foundation of Open Machine Learning +2 Wauplin, celinah, lysandre, julien-c • Oct 27, 2025 • 75
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence Paper • 2511.18538 • Published Nov 23, 2025 • 304
view article Article No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL +4 toslali-ibm, mirinflim, qgallouedec, esnible, rganti, mudhakar • Jun 3, 2025 • 101
Seed-X Collection A powerful open-source multilingual translation language model series, including instruction and reasoning models. • 8 items • Updated Aug 22, 2025 • 68
view article Article What's going on with the Open LLM Leaderboard? +2 clefourrier, SaylorTwift, slippylolo, thomwolf • Jun 23, 2023 • 51
DataDecide Collection A suite of models, data, and evals over 25 corpora, 14 sizes, and 3 seeds to measure how accurately small experiments predict rankings at large scale. • 354 items • Updated Mar 2 • 25
Skywork-Reward-V2 Collection Scaling preference data curation to the extreme • 9 items • Updated Jul 4, 2025 • 27
Math-PUMA: Progressive Upward Multimodal Alignment to Enhance Mathematical Reasoning Paper • 2408.08640 • Published Aug 16, 2024 • 3
Qwen2.5 Collection Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 43 items • Updated Mar 2 • 720