OmarAlasqa commited on
Commit
bfc2dc5
Β·
1 Parent(s): 930ad56

Restore README from HF

Browse files
Files changed (1) hide show
  1. README.md +101 -0
README.md ADDED
@@ -0,0 +1,101 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ tags:
4
+ - image-classification
5
+ - object-detection
6
+ - instance-segmentation
7
+ - medical-image-segmentation
8
+ - graph-neural-network
9
+ - mobile
10
+ - vision-gnn
11
+ - pytorch
12
+ datasets:
13
+ - imagenet-1k
14
+ - coco
15
+ - kvasir-seg
16
+ - dsb2018
17
+ metrics:
18
+ - top_1_accuracy
19
+ - mean_average_precision
20
+ - dice_score
21
+ - hausdorff_distance
22
+ library_name: pytorch
23
+ pipeline_tag: image-classification
24
+ ---
25
+
26
+ # GeoViG: Geometry-Aware Graph Reasoning for Mobile Vision Tasks
27
+
28
+ ## Model Variants
29
+
30
+ ### πŸ–ΌοΈ Image Classification β€” ImageNet-1K
31
+
32
+ | Model | Params (M) | MACs (G) | Top-1 Acc (%) | Checkpoint (pth) |
33
+ |:---|:---:|:---:|:---:|:---:|
34
+ | GeoViG-Ti | 3.5 | 0.9 | 75.2 | [Download](https://huggingface.co/TODO/geovig/resolve/main/pth/geovig_ti.pth) |
35
+ | GeoViG-S | 5.0 | 1.2 | 77.5 | [Download](https://huggingface.co/TODO/geovig/resolve/main/pth/geovig_s.pth) |
36
+ | GeoViG-M | 10.3 | 2.2 | 80.7 | [Download](https://huggingface.co/TODO/geovig/resolve/main/pth/geovig_m.pth) |
37
+ | GeoViG-B | 19.7 | 4.5 | 82.4 | [Download](https://huggingface.co/TODO/geovig/resolve/main/pth/geovig_b.pth) |
38
+
39
+ Trained for 300 epochs on 8Γ— NVIDIA A100 GPUs, batch size 1024, AdamW optimizer (lr=5e-4, weight decay=0.05), cosine schedule with 5-epoch warmup. Augmentations: RandAugment, Mixup (p=0.8), CutMix (p=1.0).
40
+
41
+ For CoreML checkpoint, please check:
42
+ For IPA chechpoints, please check:
43
+ ---
44
+
45
+ ### πŸ“¦ Object Detection & Instance Segmentation β€” MS COCO 2017
46
+
47
+ Backbone used with **Mask R-CNN**, 1Γ— schedule (12 epochs), pretrained on ImageNet-1K.
48
+
49
+ | Backbone | Params (M) | Box AP | Box APβ‚…β‚€ | Box AP₇₅ | Mask AP | Mask APβ‚…β‚€ | Mask AP₇₅ | Checkpoint |
50
+ |:---|:---:|:---:|:---:|:---:|:---:|:---:|:---:|:---:|
51
+ | GeoViG-M | 10.3 | 40.7 | 62.4 | 44.1 | 37.7 | 59.6 | 40.5 | [Download](https://huggingface.co/TODO/geovig/resolve/main/detection/geovig_m_coco.pth) |
52
+ | GeoViG-B | 19.7 | 42.5 | 64.0 | 46.8 | 38.9 | 61.2 | 41.7 | [Download](https://huggingface.co/TODO/geovig/resolve/main/detection/geovig_b_coco.pth) |
53
+
54
+ ---
55
+
56
+ ### πŸ₯ Medical Image Segmentation
57
+
58
+ **Kvasir-SEG β€” Polyp Segmentation**
59
+
60
+ | Model | Params (M) | mAP | Dice ↑ | IoU ↑ | Hausdorff Dist ↓ | Checkpoint |
61
+ |:---|:---:|:---:|:---:|:---:|:---:|:---:|
62
+ | GeoViG-M | 29.57 | 0.990 | **0.945** | **0.909** | **12.94** | [Download](https://huggingface.co/TODO/geovig/resolve/main/medical/geovig_m_kvasir.pth) |
63
+
64
+ **Data Science Bowl 2018 β€” Nuclei Segmentation**
65
+
66
+ | Model | Params (M) | mAP | Dice ↑ | IoU ↑ | Hausdorff Dist ↓ | Checkpoint |
67
+ |:---|:---:|:---:|:---:|:---:|:---:|:---:|
68
+ | GeoViG-M | 29.57 | 0.859 | **0.908** | **0.839** | **5.19** | [Download](https://huggingface.co/TODO/geovig/resolve/main/medical/geovig_m_dsb2018.pth) |
69
+
70
+ ---
71
+
72
+ ## πŸš€ Usage
73
+
74
+ Please check the github repo: https://github.com/OmarAlsaqa/GeoViG
75
+
76
+ ---
77
+
78
+ ## Citation
79
+
80
+ If you use GeoViG in your research, please cite:
81
+
82
+ ```bibtex
83
+ @article{alsaqa2026geovig,
84
+ title = {GeoViG: Geometry-Aware Graph Reasoning for Mobile Vision Tasks in Natural and Medical Images},
85
+ author = {Alsaqa, Omar and Mohammed, Emad and Aleem, Saiqa},
86
+ journal = {Under Review at IEEE EMBC},
87
+ year = {2026}
88
+ }
89
+ ```
90
+
91
+ ---
92
+
93
+ ## Acknowledgements
94
+
95
+ This work builds upon [MobileViG](https://github.com/SLDGroup/MobileViG) and uses the [MMDetection](https://github.com/open-mmlab/mmdetection) framework for detection and segmentation experiments. Training was performed on the Compute Canada A100 cluster.
96
+
97
+ ---
98
+
99
+ ## License
100
+
101
+ This project is released under the [Apache 2.0 License](LICENSE).