Improve model card for VideoRFT with metadata and comprehensive content

by nielsr HF Staff - opened Oct 15, 2025

←

This PR significantly improves the model card for VideoRFT by:

Updating the pipeline_tag from visual-question-answering to video-text-to-text to accurately reflect the model's capabilities in video reasoning and text generation.
Adding library_name: transformers as the model is compatible with the Hugging Face Transformers library, enabling the automated "How to use" widget.
Populating the content section with a detailed description, including the paper abstract, methodology, dataset information, installation instructions, training and inference guidance, a runnable Python usage example, and the full citation information.
Ensuring direct links to the associated paper and the GitHub repository are prominently displayed.

These enhancements make the model card more informative and user-friendly for the Hugging Face community.

QiWang98 changed pull request status to merged Oct 21, 2025

Hi @QiWang98 could you check the code snippets of the model cards? Looks like those are wrong.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment