Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
GreatBird
/
ViTP
like
2
Image Feature Extraction
remote-sensing
medical-imaging
vision-transformer
arxiv:
2509.17562
License:
cc-by-nc-4.0
Model card
Files
Files and versions
xet
Community
1
refs/pr/1
ViTP
/
pretrain_data
/
annotations
/
general_ann
1.32 GB
2 contributors
History:
1 commit
GreatBird
Upload 125 files
af20dda
verified
4 months ago
ai2d_train_12k.jsonl
4.08 MB
Upload 125 files
4 months ago
chartqa_train_18k.jsonl
9.31 MB
Upload 125 files
4 months ago
docvqa_train_10k.jsonl
14.2 MB
xet
Upload 125 files
4 months ago
dvqa_train_200k.jsonl
421 MB
xet
Upload 125 files
4 months ago
fit_rs_vqa_100k.jsonl
28.9 MB
xet
Upload 125 files
4 months ago
geoqa+.jsonl
28.1 MB
xet
Upload 125 files
4 months ago
sharegpt4v_instruct_gpt4-vision_cap100k.jsonl
119 MB
xet
Upload 125 files
4 months ago
sharegpt4v_mix665k_cap23k_coco-ap9k_lcs3k_sam9k_div2k_novg.jsonl
651 MB
xet
Upload 125 files
4 months ago
synthdog_en.jsonl
16.8 MB
xet
Upload 125 files
4 months ago
vqa_rgb_rsvqahr_train_instruct_100k.jsonl
24.5 MB
xet
Upload 125 files
4 months ago