File size: 1,854 Bytes
4f9a223
 
9a5abe1
 
 
 
 
4f9a223
 
 
 
 
 
9a5abe1
4f9a223
 
 
 
 
 
 
 
 
 
 
9a5abe1
4f9a223
 
 
 
 
9a5abe1
4f9a223
 
 
 
 
 
 
9a5abe1
4f9a223
9a5abe1
4f9a223
 
 
 
 
 
 
 
 
 
 
 
 
 
5e19bde
 
9a5abe1
58d60ef
4f9a223
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
---
license: cc-by-4.0
pipeline_tag: robotics
tags:
- visual-navigation
- sim-to-real
- topological-navigation
---

# FAINT

Fast, Appearance-Invariant Navigation Transformer (FAINT) is a learned policy for vision-based topological navigation.

[**Paper**](https://huggingface.co/papers/2509.11791) | [**Project Page**](https://lasuomela.github.io/faint/) | [**GitHub**](https://github.com/lasuomela/FAINT)

## Model Details

The `FAINT-Real` model uses [`Theia-Tiny-CDDSV`](https://theia.theaiinstitute.com/) as backbone, and was trained for 30 epochs with the ~1.2M samples of the datasets used in the GNM/ViNT papers.

This repo contains two versions of the trained model weights.
- `model_pytorch.pt`: Weights-only state dict of the Pytorch model.
- `model_torchscript.pt`: A 'standalone' Torchscript model for deployment.

## Usage

See the main GitHub [repo](https://github.com/lasuomela/FAINT) for details regarding input preprocessing, deployment with ROS2, and training.

### Torchscript

Only dependency is Pytorch.

```python
import torch
ckpt_path = 'FAINT-Real/model_torchscript.pt'
model = torch.jit.load(ckpt_path)
```

### Pytorch

Need to have the FAINT library installed.

```python
import torch
from faint.common.models.faint import FAINT

ckpt_path = 'FAINT-Real/model_pytorch.pt'
state_dict = torch.load(ckpt_path)

model = FAINT() # The weights in this repo correspond to FAINT initialized with the default arguments
model.load_state_dict(state_dict)
```

## Citation

If you use FAINT in your research, please use the following BibTeX entry:
```bibtex
@article{suomela2025synthetic,
  title={Synthetic vs. Real Training Data for Visual Navigation},
  author={Suomela, Lauri and Kuruppu Arachchige, Sasanka and Torres, German F. and Edelman, Harry and Kämäräinen, Joni-Kristian},
  journal={arXiv:2509.11791},
  year={2025}
}
```