Improve model card: Add pipeline tag, library name, full content, and sample usage
#1
by
nielsr
HF Staff
- opened
This PR significantly enhances the model card for VideoRFT, aligning it with Hugging Face best practices.
Key changes include:
- Populated Content: Added a comprehensive description, abstract, methodology details, dataset information, setup instructions, training guidance, evaluation procedure, and a ready-to-use inference code snippet, all extracted from the paper and GitHub repository.
- Metadata Updates:\n - Changed
pipeline_tagfromvisual-question-answeringtovideo-text-to-textto better reflect the model's general video reasoning capabilities.- Added
library_name: transformersas the model is compatible with the π€ Transformers library, enabling the automated "Use in Transformers" widget.
- Added
- Links: Ensured proper linking to the Hugging Face paper page (VideoRFT: Incentivizing Video Reasoning Capability in MLLMs via Reinforced Fine-Tuning) and the GitHub repository (https://github.com/QiWang98/VideoRFT).
These updates will make the model more discoverable and easier to use for the community.
QiWang98
changed pull request status to
merged