Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models Paper • 2506.06395 • Published Jun 5, 2025 • 133
unsloth/Llama-4-Maverick-17B-128E-Instruct-GGUF Image-to-Text • 401B • Updated Jun 18, 2025 • 5.59k • 43