CohereLabs/cohere-transcribe-03-2026 Automatic Speech Recognition • 2B • Updated 20 days ago • 305k • 952
view article Article ⚡ nano-vLLM: Lightweight, Low-Latency LLM Inference from Scratch zamal • Jun 28, 2025 • 42