deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B Text Generation • 2B • Updated Feb 24, 2025 • 491k • • 1.5k
Running 3.83k The Ultra-Scale Playbook 🌌 3.83k The ultimate guide to training LLM on large GPU Clusters
sentence-transformers/multi-qa-MiniLM-L6-cos-v1 Sentence Similarity • 22.7M • Updated Nov 5, 2024 • 881k • • 137