view article Article Kog Laneformer 2B: The Latency-First Model Behind Kog Inference Engine kogai • 3 days ago • 27
view article Article Introducing North Mini Code: Cohere’s First Model For Developers CohereLabs • 18 days ago • 77
Nemotron-Pre-Training-Datasets Collection Large scale pre-training datasets used in the Nemotron family of models. • 15 items • Updated 16 days ago • 172
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 ggerganov, ngxson, allozaur, lysandre, victor, julien-c • Feb 20 • 507
view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency not-lain • Jan 30, 2025 • 351
view article Article Qwen-Image-i2L: Training Strategies for Image-to-LoRA Generation kelseye • Dec 16, 2025 • 59
view article Article Diffusers welcomes FLUX-2 +6 YiYiXu, dg845, sayakpaul, OzzyGT, dn6, ariG23498, linoyts, multimodalart • Nov 25, 2025 • 189
Qwen3-VL Collection Qwen's new multimodal vision models in GGUF, safetensor, and dynamic Unsloth formats. • 56 items • Updated 12 days ago • 35
view article Article LightOnOCR-1B: The Case for End-to-End and Efficient Domain-Specific Vision-Language Models for OCR lightonai • Oct 23, 2025 • 74
Qualcomm NPU Collection Latest SOTA models supported on Qualcomm NPU. • 32 items • Updated Mar 2 • 15
Granite 4.0 Language Models Collection Efficient language models for multilingual generation, coding, RAG, and AI assistant workflows. • 11 items • Updated Apr 29 • 220
Recent models: last 100 repos, sorted by creation date Collection The last 100 repos I have created. Sorted by creation date descending, so the most recently created repos appear at the top. • 100 items • Updated Mar 2 • 578