Unveiling the Secret Recipe: A Guide For Supervised Fine-Tuning Small LLMs Paper • 2412.13337 • Published Dec 17, 2024
Sculpting Subspaces: Constrained Full Fine-Tuning in LLMs for Continual Learning Paper • 2504.07097 • Published Apr 9, 2025 • 2
Mitigating Premature Exploitation in Particle-based Monte Carlo for Inference-Time Scaling Paper • 2510.05825 • Published Oct 7, 2025
Pioneer Agent: Continual Improvement of Small Language Models in Production Paper • 2604.09791 • Published 27 days ago • 11