LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published 16 days ago • 239
MHPO: Modulated Hazard-aware Policy Optimization for Stable Reinforcement Learning Paper • 2603.16929 • Published Mar 14 • 13
MHPO: Modulated Hazard-aware Policy Optimization for Stable Reinforcement Learning Paper • 2603.16929 • Published Mar 14 • 13
SPTNet: An Efficient Alternative Framework for Generalized Category Discovery with Spatial Prompt Tuning Paper • 2403.13684 • Published Mar 20, 2024