keeeeenw
/

MicroLlava

Visual Question Answering

text-generation

vision-language

Eval Results (legacy)

Model card Files Files and versions

keeeeenw commited on Aug 18, 2025

Commit

8781fa4

·

verified ·

1 Parent(s): f4e9991

Update README.md

Files changed (1) hide show

README.md +27 -1

README.md CHANGED Viewed

@@ -11,6 +11,31 @@ pipeline_tag: visual-question-answering
 license: apache-2.0
 base_model:
 - keeeeenw/MicroLlama
 ---
 # MicroLLaVA
@@ -178,4 +203,5 @@ This work builds upon the efforts of many in the open-source AI community:
 - **[`keeeeenw/MicroLlama`](https://huggingface.co/keeeeenw/MicroLlama)** I am also the creator of MicroLlama. Please help support my work!
 - **SigLIP2** authors for the efficient vision encoder architecture
 - Contributors to **LAION-CC-SBU-558K** and other datasets used in pretraining and finetuning
-- The Hugging Face ecosystem for hosting, tools, and community support

 license: apache-2.0
 base_model:
 - keeeeenw/MicroLlama
+model-index:
+  - name: MicroLLaVA (MicroLLaMA 300M + SigLIP2-so400m-patch4-384)
+    results:
+      - task:
+          type: visual-question-answering
+          name: VQAv2
+        dataset:
+          name: VQAv2
+          type: vqav2
+        metrics:
+          - name: Overall Accuracy
+            type: accuracy
+            value: 56.91
+          - name: Yes/No Accuracy
+            type: accuracy
+            value: 72.32
+          - name: Number Accuracy
+            type: accuracy
+            value: 43.89
+          - name: Other Accuracy
+            type: accuracy
+            value: 46.65
+        source:
+          name: Internal Evaluation on VQAv2 test-dev
+          url: https://visualqa.org/download.html
 ---
 # MicroLLaVA
 - **[`keeeeenw/MicroLlama`](https://huggingface.co/keeeeenw/MicroLlama)** I am also the creator of MicroLlama. Please help support my work!
 - **SigLIP2** authors for the efficient vision encoder architecture
 - Contributors to **LAION-CC-SBU-558K** and other datasets used in pretraining and finetuning
+- The Hugging Face ecosystem for hosting, tools, and community support