metadata
library_name: transformers
license: apache-2.0
datasets:
- JosephZ/vg150_train_sgg_prompt
base_model:
- Qwen/Qwen2-VL-7B-Instruct
An end-to-end multimodal LLM for Scene Graph Generation (SGG), which was introduced in [Compile Scene Graphs with Reinforcement Learning](https://huggingface.co/papers/2504.13617