File size: 3,829 Bytes
bfc2dc5
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8e3e846
 
 
 
bfc2dc5
 
 
8e3e846
 
bfc2dc5
 
 
 
 
 
 
 
8e3e846
 
bfc2dc5
 
 
 
 
 
 
 
 
8e3e846
bfc2dc5
 
 
 
 
8e3e846
bfc2dc5
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
---
license: apache-2.0
tags:
  - image-classification
  - object-detection
  - instance-segmentation
  - medical-image-segmentation
  - graph-neural-network
  - mobile
  - vision-gnn
  - pytorch
datasets:
  - imagenet-1k
  - coco
  - kvasir-seg
  - dsb2018
metrics:
  - top_1_accuracy
  - mean_average_precision
  - dice_score
  - hausdorff_distance
library_name: pytorch
pipeline_tag: image-classification
---

# GeoViG: Geometry-Aware Graph Reasoning for Mobile Vision Tasks

## Model Variants

### πŸ–ΌοΈ Image Classification β€” ImageNet-1K

| Model | Params (M) | MACs (G) | Top-1 Acc (%) | Checkpoint (pth) |
|:---|:---:|:---:|:---:|:---:|
| GeoViG-Ti | 3.5 | 0.9 | 75.2 | [Download](https://huggingface.co/OmarAlasqa/GeoViG/blob/main/pth/geovig_ti_5e4_8G_300_75_22/checkpoint.pth) |
| GeoViG-S | 5.0 | 1.2 | 77.5 | [Download](https://huggingface.co/OmarAlasqa/GeoViG/blob/main/pth/geovig_s_5e4_8G_300_77_48/checkpoint.pth) |
| GeoViG-M | 10.3 | 2.2 | 80.7 | [Download](https://huggingface.co/OmarAlasqa/GeoViG/blob/main/pth/geovig_m_5e4_8G_300_80_70/checkpoint.pth) |
| GeoViG-B | 19.7 | 4.5 | 82.4 | [Download](https://huggingface.co/OmarAlasqa/GeoViG/blob/main/pth/geovig_b_5e4_8G_300_82_38/checkpoint.pth) |

Trained for 300 epochs on 8Γ— NVIDIA A100 GPUs, batch size 1024, AdamW optimizer (lr=5e-4, weight decay=0.05), cosine schedule with 5-epoch warmup. Augmentations: RandAugment, Mixup (p=0.8), CutMix (p=1.0).

For CoreML checkpoint, please check: https://huggingface.co/OmarAlasqa/GeoViG/tree/main/CoreML
For IPA chechpoints, please check: https://huggingface.co/OmarAlasqa/GeoViG/tree/main/IPA
---

### πŸ“¦ Object Detection & Instance Segmentation β€” MS COCO 2017

Backbone used with **Mask R-CNN**, 1Γ— schedule (12 epochs), pretrained on ImageNet-1K.

| Backbone | Params (M) | Box AP | Box APβ‚…β‚€ | Box AP₇₅ | Mask AP | Mask APβ‚…β‚€ | Mask AP₇₅ | Checkpoint |
|:---|:---:|:---:|:---:|:---:|:---:|:---:|:---:|:---:|
| GeoViG-M | 10.3 | 40.7 | 62.4 | 44.1 | 37.7 | 59.6 | 40.5 | [Download](https://huggingface.co/OmarAlasqa/GeoViG/blob/main/coco_det_seg_pth/geovig_m_det_seg/epoch_12.pth) |
| GeoViG-B | 19.7 | 42.5 | 64.0 | 46.8 | 38.9 | 61.2 | 41.7 | [Download](https://huggingface.co/OmarAlasqa/GeoViG/blob/main/coco_det_seg_pth/geovig_b_det_seg/epoch_12.pth) |

---

### πŸ₯ Medical Image Segmentation

**Kvasir-SEG β€” Polyp Segmentation**

| Model | Params (M) | mAP | Dice ↑ | IoU ↑ | Hausdorff Dist ↓ | Checkpoint |
|:---|:---:|:---:|:---:|:---:|:---:|:---:|
| GeoViG-M | 29.57 | 0.990 | **0.945** | **0.909** | **12.94** | [Download](https://huggingface.co/OmarAlasqa/GeoViG/blob/main/medical/kvasir_geovig_m/checkpoint.pth) |

**Data Science Bowl 2018 β€” Nuclei Segmentation**

| Model | Params (M) | mAP | Dice ↑ | IoU ↑ | Hausdorff Dist ↓ | Checkpoint |
|:---|:---:|:---:|:---:|:---:|:---:|:---:|
| GeoViG-M | 29.57 | 0.859 | **0.908** | **0.839** | **5.19** | [Download](https://huggingface.co/OmarAlasqa/GeoViG/blob/main/medical/dsb_geovig_m/checkpoint.pth) |

---

## πŸš€ Usage

Please check the github repo: https://github.com/OmarAlsaqa/GeoViG

---

## Citation

If you use GeoViG in your research, please cite:

```bibtex
@article{alsaqa2026geovig,
  title     = {GeoViG: Geometry-Aware Graph Reasoning for Mobile Vision Tasks in Natural and Medical Images},
  author    = {Alsaqa, Omar and Mohammed, Emad and Aleem, Saiqa},
  journal   = {Under Review at IEEE EMBC},
  year      = {2026}
}
```

---

## Acknowledgements

This work builds upon [MobileViG](https://github.com/SLDGroup/MobileViG) and uses the [MMDetection](https://github.com/open-mmlab/mmdetection) framework for detection and segmentation experiments. Training was performed on the Compute Canada A100 cluster.

---

## License

This project is released under the [Apache 2.0 License](LICENSE).