Add comprehensive model card for LEGO (MV-ScanQA, TripAlign)

#1
by nielsr HF Staff - opened

This PR adds a comprehensive model card for the LEGO model, presented in the paper Advancing 3D Scene Understanding with MV-ScanQA Multi-View Reasoning Evaluation and TripAlign Pre-training Dataset.

It includes:

  • The appropriate pipeline_tag: image-text-to-text, allowing users to find the model easily on the Hub.
  • The library_name: transformers, indicating compatibility with the πŸ€— Transformers library.
  • The license: CC-BY-4.0 as specified in the repository.
  • Links to the paper, project page, and the official GitHub repository for more detailed information and code.
  • An overview of the model and its capabilities based on the paper abstract and GitHub README.
  • A sample Python usage example demonstrating how to load and use the LoRA adapter with its Fuyu base model.

Please review and merge this PR if everything looks good.

Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment