jucamohedano/Qwen3-30B-A3B-Instruct-2507_custom_60_predict Viewer β’ Updated 25 days ago β’ 60 β’ 135
jucamohedano/Qwen3-30B-A3B-Instruct-2507_custom_60_predict Viewer β’ Updated 25 days ago β’ 60 β’ 135
Running on CPU Upgrade Featured 3.04k The Smol Training Playbook π 3.04k The secrets to building world-class LLMs
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch +5 May 21, 2025 β’ 252
Running 3.74k The Ultra-Scale Playbook π 3.74k The ultimate guide to training LLM on large GPU Clusters
Model Merging Collection Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! β’ 30 items β’ Updated Jun 12, 2024 β’ 252
Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights Paper β’ 2502.09619 β’ Published Feb 13, 2025 β’ 36
Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO Paper β’ 2505.22453 β’ Published May 28, 2025 β’ 46
view article Article Introducing smolagents: simple agents that write actions in code. +1 Dec 31, 2024 β’ 1.18k