This collection includes checkpoints for paper: Multimodal and Multi-task Fusion for Spatial Reasoning [CVPRW-2025]
Dang Van Minh
DangMinh21
AI & ML interests
None yet
Organizations
None yet
models
16
DangMinh21/SpatialRGPT-VILA1.5-8B-phase2-align-gnn_enhancer
Updated
DangMinh21/SpatialRGPT-VILA1.5-8B-phase4-align_with_region_cls
Updated
DangMinh21/SpatialRGPT-VILA1.5-8B-phase3-region-classifier-gnn_enhancer
Updated
DangMinh21/SpatialRGPT-VILA1.5-8B-phase1-warmup-gnn_enhancer
Updated
DangMinh21/SpatialRGPT-VILA1.5-8B-SFT-SpatialWarehouse-5epochs
Updated
DangMinh21/SpatialRGPT-VILA1.5-8B-SFT-SpatialWarehouse-merged
Updated
DangMinh21/SpatialRGPT-VILA1.5-8B-SFT-SpatialWarehouse-adapters
Updated
DangMinh21/SpatialRGPT-VILA1.5-8B-SFT-SpatialWarehouse
Updated
DangMinh21/category_classifier_model
Text Classification
•
67M
•
Updated
DangMinh21/vila-siglip-llama3-8b-vila-v1.5-srgpt-mm-align
Updated