None defined yet.
Loc3R-VLM: Language-based Localization and 3D Reasoning with Vision-Language Models
Phi-4-reasoning-vision-15B Technical Report