view article Article ⚡ nano-vLLM: Lightweight, Low-Latency LLM Inference from Scratch Jun 28, 2025 • 38
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 307
ESFT Collection models for paper expert-specialized fine-tuning • 14 items • Updated 28 days ago • 10
Awesome SFT datasets Collection A curated list of interesting datasets to fine-tune language models with. • 41 items • Updated 28 days ago • 148
view article Article A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes Aug 17, 2022 • 128