DreamPRM: Domain-Reweighted Process Reward Model for Multimodal Reasoning Paper • 2505.20241 • Published May 26, 2025
DreamPRM-1.5: Unlocking the Potential of Each Instance for Multimodal Process Reward Model Training Paper • 2509.05542 • Published Sep 5, 2025