QiWang98
/

VideoRFT-SFT

Video-Text-to-Text

image-text-to-text

text-generation-inference

Model card Files Files and versions

Improve model card: update pipeline tag, add library name, paper details & content

#1

by nielsr HF Staff - opened Oct 15, 2025

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

This PR significantly enhances the model card by:

Updating the pipeline_tag from visual-question-answering to video-text-to-text to better reflect the model's comprehensive video reasoning capabilities.
Adding library_name: transformers to enable the automated "Use in Transformers" widget, as the model's configuration files and GitHub requirements demonstrate compatibility with the library.
Populating the content with the paper's abstract (as "Overview"), methodology, dataset details, setup instructions, training and evaluation guides, and acknowledgements, all sourced directly from the project's GitHub README.
Ensuring that the official paper link and GitHub repository link are prominently displayed.
Carefully linking all images to their raw GitHub URLs for proper rendering on the Hub.

This update provides a much richer and more accurate overview for users, improving discoverability and ease of use.

Improve model card: update pipeline tag, add library name, paper details & content6dae3258

QiWang98 changed pull request status to merged Oct 21, 2025

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment