nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-FP8 Image-Text-to-Text • 13B • Updated Nov 13, 2025 • 5.79k • 44
ibm-granite/granite-docling-258M Image-Text-to-Text • 0.3B • Updated Sep 23, 2025 • 195k • 1.07k
AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs Paper • 2507.05687 • Published Jul 8, 2025 • 27
nvidia/OpenCodeReasoning-Nemotron-32B-IOI Text Generation • 33B • Updated May 7, 2025 • 60 • • 25
nvidia/Llama-3_1-Nemotron-Ultra-253B-v1 Text Generation • 253B • Updated Oct 15, 2025 • 194k • • 342
PocketDoc/Dans-PersonalityEngine-V1.2.0-24b Text Generation • 24B • Updated May 23, 2025 • 115 • • 173
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling Paper • 2502.06703 • Published Feb 10, 2025 • 153
meta-llama/Llama-3.2-90B-Vision-Instruct Image-Text-to-Text • 89B • Updated Mar 4, 2025 • 34.9k • • 348
Qwen/Qwen2.5-Coder-32B-Instruct Text Generation • 33B • Updated Jan 12, 2025 • 199k • • 1.96k
Wavelets Are All You Need for Autoregressive Image Generation Paper • 2406.19997 • Published Jun 28, 2024 • 31