Qualcomm Collection Collection for models for Qualcomm hackathon β’ 8 items β’ Updated 10 days ago β’ 5
view article Article Welcome Gemma 4: Frontier multimodal intelligence on device +5 merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift β’ Apr 2 β’ 892
view article Article Mixture of Experts Explained +4 osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq β’ Dec 11, 2023 β’ 1.13k
view changelog Hugging Face Changelog Introducing HF Jobs: Run scalable compute jobs on Hugging Face Jul 30, 2025 β’ 203
view article Article Why Maybe We're Measuring LLM Compression Wrong rishiraj β’ Jun 21, 2025 β’ 16
view article Article SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data +7 danaaubakirova, andito, merve, ariG23498, fracapuano, loubnabnl, pcuenq, mshukor, cadene β’ Jun 3, 2025 β’ 346
view article Article A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes ybelkada, timdettmers β’ Aug 17, 2022 β’ 131
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA +3 ybelkada, timdettmers, artidoro, sgugger, smangrul β’ May 24, 2023 β’ 180
view article Article Making LLMs lighter with AutoGPTQ and transformers +4 marcsun13, fxmarty, PanEa, qwopqwop, ybelkada, TheBloke β’ Aug 23, 2023 β’ 64
view article Article Introduction to Quantization cooked in π€ with ππ§βπ³ merve β’ Aug 25, 2023 β’ 39
Nemotron 4 340B Collection Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. β’ 4 items β’ Updated 6 days ago β’ 163
Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation Paper β’ 2406.06525 β’ Published Jun 10, 2024 β’ 71