Kangheng
/

OVR-7B-ColdStart

Model card Files Files and versions

Improve model card: add pipeline tag, library name, and comprehensive details

#1

by nielsr HF Staff - opened Jul 15, 2025

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

This PR significantly enhances the model card by:

Adding the pipeline_tag: image-text-to-text to the metadata, ensuring the model is discoverable under this pipeline on the Hub.
Adding library_name: transformers to the metadata, indicating its compatibility with the Hugging Face Transformers library.
Updating and explicitly adding links to the paper (Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning), the project page (https://weiyana.github.io/Open-Vision-Reasoner/), and the GitHub repository (https://github.com/linkangheng/Open-Vision-Reasoner) for better visibility and access.
Integrating more detailed sections from the original GitHub README, including:
- Expanded "Performance Results" with sub-sections for Language, Visual, and Cognitive Behavior Analysis.
- A new "Training Pipeline" section to describe the two-stage training paradigm.
- A "Roadmap" section for future plans.
- The "Citation" information for proper attribution.

Improve model card: add pipeline tag, library name, and comprehensive details3a9d1a1e

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment