Spatial-MLLM
Collection
5 items • Updated
This repository contains the model described in Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence.
Project page: https://diankun-wu.github.io/Spatial-MLLM/
Base model
Qwen/Qwen2.5-VL-3B-Instruct
# Load model directly from transformers import AutoProcessor, AutoModelForSeq2SeqLM processor = AutoProcessor.from_pretrained("Diankun/Spatial-MLLM-subset-sft") model = AutoModelForSeq2SeqLM.from_pretrained("Diankun/Spatial-MLLM-subset-sft")