MotionAtlas-4B / README.md
maxLWSv2's picture
Update model card
0393ee2 verified
|
Raw
History Blame Contribute Delete
843 Bytes
metadata
license: fair-noncommercial-research-license
library_name: transformers
pipeline_tag: image-text-to-text
tags:
  - qwen3-vl
  - video-language-model
  - region-understanding
  - motion-captioning

MotionAtlas-4B

This repository contains the MotionAtlas-4B model for MotionAtlas: Detailed Region Captioning for Motion-Centric Videos.

TL; DR: MotionAtlas shifts motion captioning from global video descriptions to region-aware motion captions, enabling precise evaluation with MotionAtlas-Bench and scalable training with MotionAtlas-Data. The model is designed for detailed motion-centric video understanding over referred regions.

Usage

For detailed usage of this model, please refer to our GitHub repo and project page.