Improve model card: Add metadata, links, and usage for GS-Reasoner

by nielsr HF Staff - opened Oct 17, 2025

←

This PR significantly enhances the model card for GS-Reasoner by:

Adding pipeline_tag: image-text-to-text to improve discoverability on the Hub for models performing 3D visual grounding and spatial reasoning.
Specifying library_name: transformers to enable the automated "Use in Transformers" widget, as evidenced by LlavaQwenForCausalLM architecture, LlavaProcessor, Qwen2Tokenizer, and the usage of LlavaAgent within the project.
Including a direct link to the paper on Hugging Face: Reasoning in Space via Grounding in the World.
Adding links to the project page (https://yiming-cc.github.io/gs-reasoner/) and the GitHub repository (https://github.com/WU-CVGL/GS-Reasoner).
Providing a detailed model description based on the paper's abstract.
Including information about available model weights.
Adding a "Sample Usage" section with a Python code snippet extracted from the GitHub project's implied usage, demonstrating how to load and use the model with LlavaAgent.
Adding a BibTeX citation for the paper.

These additions will improve the model's discoverability, usability, and documentation on the Hugging Face Hub.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment