Urban Socio-Semantic Segmentation with Vision-Language Reasoning
Paper
•
2601.10477
•
Published
•
154
AI ML AIGC
Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation
Taming Preference Mode Collapse via Directional Decoupling Alignment in Diffusion Reinforcement Learning