--- license: other license_name: license license_link: LICENSE ---

WildCross: A Cross-Modal Large Scale Benchmark for Place Recognition and Metric Depth Estimation in Natural Environments

[**Joshua Knights**](https://scholar.google.com/citations?user=RxbGr2EAAAAJ&hl=en)1,2 · **Joseph Reid**1 · [**Mark Cox**](https://scholar.google.com/citations?user=Bk3UD4EAAAAJ&hl=en)1
[**Kaushik Roy**](https://bit0123.github.io/)1 · [**David Hall**](https://scholar.google.com/citations?user=dosODoQAAAAJ&hl=en)1 · [**Peyman Moghadam**](https://scholar.google.com.au/citations?user=QAVcuWUAAAAJ&hl=en)1,2 1DATA61, CSIRO   2Queensland University of Technology
Paper PDF Project Page
This repository contains the pre-trained checkpoints for a variety of tasks on the WildCross benchmark ![teaser](./teaser.png) If you find this repository useful or use the WildCross dataset in your work, please cite us using the following: ``` @misc{knights2025wildcross, title={{WildCross: A Cross-Modal Large Scale Benchmark for Place Recognition and Metric Depth Estimation in Natural Environments}}, author={Joshua Knights, Joseph Reid, Mark Cox, Kaushik Roy, David Hall, Peyman Moghadam}, year={2025}, eprint={xxxxxxxxx}, archivePrefix={arXiv}, primaryClass={cs.CV}, url={https://arxiv.org/abs/xxxxxxxxxx}, } ``` ## Download Instructions Our dataset can be downloaded through the [**CSIRO Data Access Portal**](''). Detailed instructions for downloading the dataset can be found in the README file provided on the data access portal page. ## Training and Benchmarking Here we provide pre-trained checkpoints for a variety of tasks on WildCross. **Visual Place Recognition** ### Checkpoints | Model | Checkpoint Folder| |------------|------------| | NetVlad | [Link](https://huggingface.co/CSIRORobotics/WildCross/tree/main/VPR/NetVLAD) | | MixVPR | [Link](https://huggingface.co/CSIRORobotics/WildCross/tree/main/VPR/MixVPR) | | SALAD | [Link](https://huggingface.co/CSIRORobotics/WildCross/tree/main/VPR/SALAD) | | BoQ | [Link](https://huggingface.co/CSIRORobotics/WildCross/tree/main/VPR/BoQ) | **Cross Modal Place Recognition** ### Checkpoints | Model | Checkpoint Folder| |------------|------------| | Lip-Loc (ResNet50) | [Link](https://huggingface.co/CSIRORobotics/WildCross/tree/main/crossmodal/resnet50) | | Lip-Loc (Dino-v2) | [Link](https://huggingface.co/CSIRORobotics/WildCross/tree/main/crossmodal/dinov2) | | Lip-Loc (Dino-v3) | [Link](https://huggingface.co/CSIRORobotics/WildCross/tree/main/crossmodal/dinov3) | **Metric Depth Estimation** ### Checkpoints | Model | Checkpoint Folder| |------------|------------| | DepthAnythingV2-vits | [Link](https://huggingface.co/CSIRORobotics/WildCross/resolve/main/DepthAnythingV2/finetuned/vits.pth) | | DepthAnythingV2-vitb | [Link](https://huggingface.co/CSIRORobotics/WildCross/resolve/main/DepthAnythingV2/finetuned/vitb.pth) | | DepthAnythingV2-vitl | [Link](https://huggingface.co/CSIRORobotics/WildCross/resolve/main/DepthAnythingV2/finetuned/vitl.pth) | For instructions on how to use these checkpoints for training or evaluation, further instructions can be found on the [WildCross GitHub repository]().