Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

xinyiW915
/
DIVA-VQA

Visual Question Answering
deep-learning
vision
VQA
Transformer
CNN
Model card Files Files and versions
xet
Community
DIVA-VQA / src /extractor
34.7 kB
  • 2 contributors
History: 1 commit
Xinyi Wang
Initial commit
3bc966f 11 months ago
  • __init__.py
    48 Bytes
    Initial commit 11 months ago
  • extract_rf_feats.py
    14.8 kB
    Initial commit 11 months ago
  • extract_rf_subsampling.py
    13.8 kB
    Initial commit 11 months ago
  • extract_slowfast_clip.py
    4.6 kB
    Initial commit 11 months ago
  • extract_swint_clip.py
    1.44 kB
    Initial commit 11 months ago