dmusingu commited on
Commit
6d2cc7f
·
verified ·
1 Parent(s): 693c866

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +28 -0
README.md ADDED
@@ -0,0 +1,28 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - chest-xray
4
+ - radiology
5
+ - visual-question-answering
6
+ - mimic-cxr
7
+ license: apache-2.0
8
+ ---
9
+
10
+ # LAPVQA — VQA (Native / End-to-end)
11
+
12
+ Part of the [LAPVQA collection](https://huggingface.co/collections/dmusingu/lapvqa).
13
+
14
+ ## Description
15
+
16
+ VQA task heads trained with **end-to-end fine-tuning** — the encoder weights are
17
+ updated jointly with the task head, providing a baseline for how much improvement
18
+ domain adaptation yields over the frozen-encoder setup in [`lapvqa-vqa`](https://huggingface.co/dmusingu/lapvqa-vqa).
19
+
20
+ ## Files
21
+
22
+ | File | Encoder backbone |
23
+ |---|---|
24
+ | `clip-vit-l14_best.pt` | CLIP ViT-L/14 (fine-tuned) |
25
+ | `siglip_best.pt` | SigLIP (fine-tuned) |
26
+ | `florence2_best.pt` | Florence-2 (fine-tuned) |
27
+ | `coca_best.pt` | CoCa (fine-tuned) |
28
+ | `mae-vit-l16_best.pt` | MAE ViT-L/16 (fine-tuned) |