Running on CPU Upgrade Featured 340 ML Intern 🤖 340 Run machine‑learning experiments directly in your browser
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 392k • 1.6k
view article Article ⚡ nano-vLLM: Lightweight, Low-Latency LLM Inference from Scratch Jun 28, 2025 • 41
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF Text Generation • 71B • Updated Apr 13, 2025 • 12k • • 2.07k
RedHatAI/Meta-Llama-3.1-8B-Instruct-quantized.w8a16 Text Generation • 3B • Updated Oct 23, 2024 • 3.95k • 12
sentence-transformers/all-MiniLM-L6-v2 Sentence Similarity • 22.7M • Updated Mar 6, 2025 • 253M • • 4.77k