Improve model card: Add metadata, links, and usage for GS-Reasoner
#1
by
nielsr
HF Staff
- opened
This PR significantly enhances the model card for GS-Reasoner by:
- Adding
pipeline_tag: image-text-to-textto improve discoverability on the Hub for models performing 3D visual grounding and spatial reasoning. - Specifying
library_name: transformersto enable the automated "Use in Transformers" widget, as evidenced byLlavaQwenForCausalLMarchitecture,LlavaProcessor,Qwen2Tokenizer, and the usage ofLlavaAgentwithin the project. - Including a direct link to the paper on Hugging Face: Reasoning in Space via Grounding in the World.
- Adding links to the project page (https://yiming-cc.github.io/gs-reasoner/) and the GitHub repository (https://github.com/WU-CVGL/GS-Reasoner).
- Providing a detailed model description based on the paper's abstract.
- Including information about available model weights.
- Adding a "Sample Usage" section with a Python code snippet extracted from the GitHub project's implied usage, demonstrating how to load and use the model with
LlavaAgent. - Adding a BibTeX citation for the paper.
These additions will improve the model's discoverability, usability, and documentation on the Hugging Face Hub.