Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

OpenGVLab
/
VideoChat-Flash-Qwen2_5-7B_InternVideo2-1B

Video-Text-to-Text
Transformers
Safetensors
English
videochat_flash_qwen
feature-extraction
multimodal
custom_code
Eval Results (legacy)
Model card Files Files and versions
xet
Community
4

Instructions to use OpenGVLab/VideoChat-Flash-Qwen2_5-7B_InternVideo2-1B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • Transformers

    How to use OpenGVLab/VideoChat-Flash-Qwen2_5-7B_InternVideo2-1B with Transformers:

    # Load model directly
    from transformers import AutoModel
    model = AutoModel.from_pretrained("OpenGVLab/VideoChat-Flash-Qwen2_5-7B_InternVideo2-1B", trust_remote_code=True, dtype="auto")
  • Notebooks
  • Google Colab
  • Kaggle
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

The name of weight for vision_tower in the safetensors file and in the model do not match.

1
#2 opened 3 months ago by
xf2022
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs