Wakals
/

CoVT-7B-seg

Model card Files Files and versions

Improve model card: Add metadata, links, and expanded description

#1

by nielsr HF Staff - opened Nov 26, 2025

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

This PR enhances the model card for the CoVT checkpoint by:

Adding the pipeline_tag: image-text-to-text for better discoverability on the Hub.
Adding library_name: transformers to indicate compatibility and enable the "how to use" widget.
Including direct links to the paper (Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens), the project page (https://wakalsprojectpage.github.io/comt-website), and the GitHub repository (https://github.com/Wakals/CoMT).
Expanding the model description with an overview of the CoVT framework, extracted from the paper's main concepts.
Embedding key demonstration images from the GitHub repository.
Adding a BibTeX citation for the paper.

These updates will make the model card more informative, accessible, and user-friendly.

Improve model card: Add metadata, links, and expanded description420edf20

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Cannot merge

This branch has merge conflicts in the following files:

README.md

· Sign up or log in to comment