deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation β’ 33B β’ Updated Feb 24, 2025 β’ 786k β’ β’ 1.57k
Running on Zero Agents 60 MInference π 60 Chat with a fast LLaMAβ3 AI using dynamic sparse attention
gradientai/Llama-3-8B-Instruct-Gradient-1048k Text Generation β’ 8B β’ Updated Oct 29, 2024 β’ 10.5k β’ β’ 681