Improve model card: add pipeline tag, library name, and comprehensive details
#1
by
nielsr
HF Staff
- opened
This PR significantly enhances the model card by:
- Adding the
pipeline_tag: image-text-to-textto the metadata, ensuring the model is discoverable under this pipeline on the Hub. - Adding
library_name: transformersto the metadata, indicating its compatibility with the Hugging Face Transformers library. - Updating and explicitly adding links to the paper (Open Vision Reasoner: Transferring Linguistic Cognitive Behavior for Visual Reasoning), the project page (https://weiyana.github.io/Open-Vision-Reasoner/), and the GitHub repository (https://github.com/linkangheng/Open-Vision-Reasoner) for better visibility and access.
- Integrating more detailed sections from the original GitHub README, including:
- Expanded "Performance Results" with sub-sections for Language, Visual, and Cognitive Behavior Analysis.
- A new "Training Pipeline" section to describe the two-stage training paradigm.
- A "Roadmap" section for future plans.
- The "Citation" information for proper attribution.