Image-to-Text
Transformers
PyTorch
English
Geo-Localization
kevinloeffler commited on
Commit
8c0a182
·
verified ·
1 Parent(s): aad006a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +40 -1
README.md CHANGED
@@ -8,4 +8,43 @@ base_model:
8
  - openai/clip-vit-large-patch14
9
  tags:
10
  - Geo-Localization
11
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8
  - openai/clip-vit-large-patch14
9
  tags:
10
  - Geo-Localization
11
+ ---
12
+ # ReGeo – A Direct Regression Approach for Global Image Geo-Localization
13
+
14
+ This paper presents a novel approach to Geo-Localization, a task
15
+ that aims to predict geographic coordinates, i.e., latitude and
16
+ longitude of an image based on its visual content. Traditional
17
+ methods in this domain often rely on databases,
18
+ complex pipelines or large-scale image classification networks.
19
+ In contrast, we propose a direct regression approach that
20
+ simplifies the process by predicting the geographic coordinates
21
+ directly from the image features. We leverage a pre-trained
22
+ Vision Transformer (ViT) model, specifically a pre-trained CLIP
23
+ model, for feature extraction and introduce a regression head
24
+ for coordinate prediction. Various configurations, including pre-
25
+ training and task-specific adaptations, are tested and evaluated
26
+ resulting in our model called ReGeo. Experimental results show
27
+ that ReGeo offers competitive performance compared to existing
28
+ SOTA approaches, despite being simpler and needing minimal
29
+ supporting code pipelines.
30
+
31
+ - **Demo:** Coming soon
32
+
33
+
34
+ ## Model Details
35
+
36
+ - **Developed by:** Tobias Rothlin, tobias.rothlin@ost.ch
37
+ - **Supervisor:** Mitra Purandare, mitra.purandare@ost.ch
38
+ - **Model Card author:** Kevin Löffler, kevin.loeffler@ost.ch
39
+
40
+
41
+ ## How to Get Started with the Model
42
+
43
+ Example inference:
44
+
45
+ ```
46
+ # todo
47
+ ```
48
+
49
+ [More Information Needed]
50
+