arxiv:2603.09079
Md Selim Sarowar
selim-sarowar
ยท
AI & ML interests
Vision Language Action Models, World Models, 5D Robot Manipulation, 3D Computer Vision
Recent Activity
authored
a paper
1 day ago
GST-VLA: Structured Gaussian Spatial Tokens for 3D Depth-Aware Vision-Language-Action Models upvoted a paper 2 days ago
Unified Vision-Language-Action Model Organizations
None yet