fenglu96
/

ImAge4VPR

Model card Files Files and versions

ImAge4VPR / README.md

fenglu96's picture

Update README.md

3515d92 verified 10 days ago

|

history blame contribute delete

1.13 kB

	---
	license: mit
	---

	# ImAge

	ImAge is an implicit aggregation method to get robust global image descriptors for visual place recognition, which neither modifies the backbone nor needs an extra aggregator. This work outperforms previous SOTA methods on several VPR benchmarks.

	Paper: [Towards Implicit Aggregation: Robust Image Representation for Place Recognition in the Transformer Era](https://arxiv.org/pdf/2511.06024) (NeurIPS 2025)

	GitHub: [Lu-Feng/ImAge](https://github.com/Lu-Feng/ImAge)

	## Usage

	```python
	import torch
	model = torch.hub.load("Lu-Feng/ImAge", "ImAge")
	model.eval()

	# Extract descriptor from an image
	image = torch.randn(1, 3, 322, 322) # [B, 3, H, W]
	with torch.no_grad():
	descriptor = model(image) # [B, 6144] L2-normalized descriptor
	```

	## Citation

	```bibtex
	@inproceedings{ImAge,
	title={Towards Implicit Aggregation: Robust Image Representation for Place Recognition in the Transformer Era},
	author={Feng Lu and Tong Jin and Canming Ye and Xiangyuan Lan and Yunpeng Liu and Chun Yuan},
	booktitle={The Annual Conference on Neural Information Processing Systems},
	year={2025}
	}
	```