Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
xinyiW915
/
DIVA-VQA
like
1
Visual Question Answering
5 datasets
deep-learning
vision
VQA
Transformer
CNN
arxiv:
2508.10605
arxiv:
2407.11496
License:
mit
Model card
Files
Files and versions
xet
Community
main
DIVA-VQA
/
src
217 kB
2 contributors
History:
1 commit
Xinyi Wang
Initial commit
3bc966f
11 months ago
data_processing
Initial commit
11 months ago
extractor
Initial commit
11 months ago
utils
Initial commit
11 months ago
correlation_result.ipynb
Safe
46.6 kB
Initial commit
11 months ago
main_diva-vqa.py
Safe
10.2 kB
Initial commit
11 months ago
model_fine_tune.py
Safe
15 kB
Initial commit
11 months ago
model_regression.py
Safe
34 kB
Initial commit
11 months ago
model_regression_simple.py
Safe
33 kB
Initial commit
11 months ago
test_demo.py
Safe
7.65 kB
Initial commit
11 months ago
test_demo_time_flops.py
Safe
8.29 kB
Initial commit
11 months ago