deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation • 33B • Updated Feb 24, 2025 • 2.26M • • 1.5k
princeton-nlp/Llama-3-8B-ProLong-64k-Instruct Text Generation • 8B • Updated Oct 31, 2024 • 7.82k • • 13
Running on CPU Upgrade Featured 999 Model Memory Utility 🚀 999 Calculate vRAM needed for model training and inference