CaraJ
/

ORM-T2I-R1

Image-Text-to-Text

text-generation

Model card Files Files and versions

ORM-T2I-R1 / README.md

CaraJ's picture

Add metadata (#1)

cfa5d4e verified 11 months ago

|

565 Bytes

library_name: transformers
pipeline_tag: image-text-to-text
base_model:
  - lmms-lab/llava-onevision-qwen2-7b-ov

This is the output reward model (ORM) used in T2I-R1.

This model is fine-tuned from lmms-lab/llava-onevision-qwen2-7b-ov.

Please check our paper: "T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT" and GitHub for more information.