deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B Text Generation • 2B • Updated Feb 24, 2025 • 738k • • 1.44k
Running 3.67k The Ultra-Scale Playbook 🌌 3.67k The ultimate guide to training LLM on large GPU Clusters
sentence-transformers/multi-qa-MiniLM-L6-cos-v1 Sentence Similarity • 22.7M • Updated Nov 5, 2024 • 817k • • 136