Recursive Language Models Meet Uncertainty: The Surprising Effectiveness of Self-Reflective Program Search for Long Context Paper • 2603.15653 • Published 14 days ago • 6
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation Paper • 2603.19220 • Published 1 day ago • 27
view article Article **LoRA Fine-Tuning BitNet b1.58 LLMs on Heterogeneous Edge GPUs via QVAC Fabric** 4 days ago • 10
Decoupling Reasoning and Confidence: Resurrecting Calibration in Reinforcement Learning from Verifiable Rewards Paper • 2603.09117 • Published 11 days ago • 9
view article Article **ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models?** 29 days ago • 19
GPT 5 Codex Collection Distilled models and datasets for GPT 5 Codex • 7 items • Updated Dec 20, 2025 • 5