Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Aliayub1995
/
VideoLLaMA2-7B

Visual Question Answering
Transformers
Safetensors
English
videollama2_mistral
text-generation
multimodal large language model
large video-language model
Model card Files Files and versions
xet
Community
1
VideoLLaMA2-7B / videollama2
5.73 MB
  • 2 contributors
History: 8 commits
Aliayub1995's picture
Aliayub1995
Update videollama2/mm_utils.py
7b8a2ab verified over 1 year ago
  • eval
    Upload 52 files over 1 year ago
  • model
    Upload 52 files over 1 year ago
  • serve
    Upload 52 files over 1 year ago
  • __init__.py
    4.9 kB
    Update videollama2/__init__.py over 1 year ago
  • constants.py
    649 Bytes
    Upload 52 files over 1 year ago
  • conversation.py
    21.1 kB
    Upload 52 files over 1 year ago
  • mm_utils.py
    15.2 kB
    Update videollama2/mm_utils.py over 1 year ago
  • train.py
    24.3 kB
    Upload 52 files over 1 year ago
  • train_flash_attn.py
    478 Bytes
    Upload 52 files over 1 year ago
  • utils.py
    4 kB
    Upload 52 files over 1 year ago
  • videollama2_trainer.py
    16.2 kB
    Upload 52 files over 1 year ago