Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models Paper • 2506.06395 • Published Jun 5 • 133
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated 1 day ago • 399k • 1.55k
Running 3.56k The Ultra-Scale Playbook 🌌 3.56k The ultimate guide to training LLM on large GPU Clusters