OSTswiss
/

ReGeo

 - openai/clip-vit-large-patch14
 tags:
 - Geo-Localization
+---
+# ReGeo – A Direct Regression Approach for Global Image Geo-Localization
+This paper presents a novel approach to Geo-Localization, a task
+that aims to predict geographic coordinates, i.e., latitude and
+longitude of an image based on its visual content. Traditional
+methods in this domain often rely on databases,
+complex pipelines or large-scale image classification networks.
+In contrast, we propose a direct regression approach that
+simplifies the process by predicting the geographic coordinates
+directly from the image features. We leverage a pre-trained
+Vision Transformer (ViT) model, specifically a pre-trained CLIP
+model, for feature extraction and introduce a regression head
+for coordinate prediction. Various configurations, including pre-
+training and task-specific adaptations, are tested and evaluated
+resulting in our model called ReGeo. Experimental results show
+that ReGeo offers competitive performance compared to existing
+SOTA approaches, despite being simpler and needing minimal
+supporting code pipelines.
+- **Demo:** Coming soon
+## Model Details
+- **Developed by:** Tobias Rothlin, tobias.rothlin@ost.ch
+- **Supervisor:** Mitra Purandare, mitra.purandare@ost.ch
+- **Model Card author:** Kevin Löffler, kevin.loeffler@ost.ch
+## How to Get Started with the Model
+Example inference:
+```
+# todo
+```
+[More Information Needed]