Improve model card: Add pipeline tag, library name, links, and description

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +18 -4
README.md CHANGED
@@ -1,7 +1,21 @@
1
  ---
2
- license: apache-2.0
3
- language:
4
- - en
5
  base_model:
6
  - Qwen/Qwen2.5-VL-7B-Instruct
7
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
 
 
 
2
  base_model:
3
  - Qwen/Qwen2.5-VL-7B-Instruct
4
+ language:
5
+ - en
6
+ license: apache-2.0
7
+ pipeline_tag: image-text-to-text
8
+ library_name: transformers
9
+ ---
10
+
11
+ # GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization
12
+
13
+ This repository hosts **GeoVista**, an agentic model designed for web-augmented visual reasoning, primarily focused on the geolocalization task. GeoVista leverages deep multimodal understanding by integrating tool invocation, such as an image-zoom-in tool and a web-search tool, within its reasoning loop. This allows the model to perform nuanced visual grounding and to confirm or refine hypotheses using external web information.
14
+
15
+ GeoVista was introduced in the paper:
16
+ [**GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization**](https://huggingface.co/papers/2511.15705)
17
+
18
+ ## Links
19
+
20
+ - **Project Page:** [https://ekonwang.github.io/geo-vista/](https://ekonwang.github.io/geo-vista/)
21
+ - **Code Repository:** [https://github.com/ekonwang/GeoVista](https://github.com/ekonwang/GeoVista)