P-GenRM: Personalized Generative Reward Model with Test-time User-based Scaling Paper • 2602.12116 • Published 6 days ago • 4
Entropy Regularizing Activation: Boosting Continuous Control, Large Language Models, and Image Classification with Activation as Entropy Constraints Paper • 2510.08549 • Published Oct 9, 2025 • 7
Running 3.69k The Ultra-Scale Playbook 🌌 3.69k The ultimate guide to training LLM on large GPU Clusters