gberton
/

MegaLoc

Visual Document Retrieval

visual-place-recognition

image-retrieval

Model card Files Files and versions

MegaLoc / README.md

Gabriele

Enable download tracking and update pipeline tag

7cb9f79 11 days ago

|

history blame contribute delete

1.5 kB

	---
	pipeline_tag: visual-document-retrieval
	library_name: pytorch
	license: mit
	tags:
	- visual-place-recognition
	- image-retrieval
	- arxiv:2502.17237
	---

	# MegaLoc

	MegaLoc is an image retrieval model for visual place recognition (VPR) that achieves state-of-the-art on most VPR datasets, including indoor and outdoor environments.

	Paper: [MegaLoc: One Retrieval to Place Them All](https://arxiv.org/abs/2502.17237) (CVPR 2025 Workshop)

	GitHub: [gmberton/MegaLoc](https://github.com/gmberton/MegaLoc)

	## Usage

	```python
	import torch
	model = torch.hub.load("gmberton/MegaLoc", "get_trained_model")
	model.eval()

	# Extract descriptor from an image
	image = torch.randn(1, 3, 322, 322) # [B, 3, H, W] - any size works
	with torch.no_grad():
	descriptor = model(image) # [B, 8448] L2-normalized descriptor
	```

	For benchmarking on VPR datasets, see [VPR-methods-evaluation](https://github.com/gmberton/VPR-methods-evaluation).

	## Qualitative Examples

	Top-1 retrieved images from the SF-XL test set (2.8M database images):

	![teaser](https://github.com/user-attachments/assets/a90b8d4c-ab53-4151-aacc-93493d583713)

	## Citation

	```bibtex
	@InProceedings{Berton_2025_CVPR,
	author = {Berton, Gabriele and Masone, Carlo},
	title = {MegaLoc: One Retrieval to Place Them All},
	booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops},
	month = {June},
	year = {2025},
	pages = {2861-2867}
	}
	```