PinpointQA: A Dataset and Benchmark for Small Object-Centric Spatial Understanding in Indoor Videos
Paper • 2604.08991 • Published
LoRA adapter weights for OpenGVLab/InternVL3_5-8B-Instruct, fine-tuned on PinpointQA.
train.jsonl in the dataset repository is not necessarily identical to the final serialized training samples used in training.Base model
OpenGVLab/InternVL3_5-8B-Pretrained