tokyotech-llm/Llama-3.3-Swallow-70B-Instruct-v0.4 Text Generation • 71B • Updated Jul 1, 2025 • 1.78k • • 12
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation • 33B • Updated Feb 24, 2025 • 884k • • 1.53k
cyberagent/DeepSeek-R1-Distill-Qwen-32B-Japanese Text Generation • 33B • Updated Jan 27, 2025 • 423 • • 254
tokyotech-llm/Llama-3.1-Swallow-8B-Instruct-v0.3 Text Generation • 8B • Updated Apr 2, 2025 • 9.51k • • 24