Add model card and metadata for DSR Suite model

#1
by nielsr HF Staff - opened

Hi! I'm Niels from the Hugging Face community science team. I've opened this PR to add a comprehensive model card and metadata for your DSR Suite model.

This PR:

  • Adds the video-text-to-text pipeline tag for better discoverability.
  • Adds the library_name: transformers tag as indicated by the config.json and tokenizer_config.json files, enabling automated code snippets.
  • Links the model to the paper Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models.
  • Includes links to the project's GitHub repository, the associated Hugging Face dataset, and collection.
  • Provides an introduction summarizing the model's capabilities in dynamic spatial reasoning and details on its usage for evaluation.
  • Includes the correct BibTeX citation and acknowledgements.

Please review and merge if this looks good to you!

zhousc changed pull request status to merged
ARC Lab, Tencent PCG org

Thanks very much for your suggestions!

Sign up or log in to comment