Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
xinyiW915
/
CAMP-VQA
like
1
Visual Question Answering
8 datasets
deep-learning
vision
VQA
Transformer
CNN
arxiv:
2511.07290
arxiv:
2407.11496
License:
mit
Model card
Files
Files and versions
xet
Community
main
CAMP-VQA
/
src
/
extractor
46.1 kB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
Xinyi Wang
initial commit
b9b1b10
6 months ago
__init__.py
Safe
48 Bytes
initial commit
6 months ago
extract_clip_embeds.py
Safe
7.33 kB
initial commit
6 months ago
extract_clip_embeds_ablation.py
Safe
11 kB
initial commit
6 months ago
extract_frag.py
Safe
15 kB
initial commit
6 months ago
extract_frame_info.py
Safe
8.59 kB
initial commit
6 months ago
extract_slowfast_clip.py
Safe
2.72 kB
initial commit
6 months ago
extract_swint_clip.py
Safe
1.45 kB
initial commit
6 months ago