NÓTT — Thermal Image Refinement for Monocular ORB-SLAM3

Part of the ANIMA Perception Suite by Robot Flow Labs.

Models

v3_sol — VIVID++ pretrained + SOL synthetic fine-tuned (BEST)

Fine-tuned on robotflowlabs/anima-thermal-synthetic (26K COCO synthetic + 399 VIVID real pairs). 8-GPU DDP with frozen BatchNorm.

  • ARE: 0.174 | val_loss: 0.079

v2 — VIVID++ pretrained (baseline)

Trained on VIVID++ dataset (71,917 thermal/depth paired frames, 24 sequences).

  • ARE: 0.106 | val_loss: 0.034

Exported Formats

Model Format File Size
v3_sol PyTorch pytorch/nott_v3_sol.pth 8.2MB
v3_sol SafeTensors pytorch/nott_v3_sol.safetensors 8.2MB
v3_sol ONNX onnx/nott_v3_sol.onnx 8.2MB
v3_sol TensorRT FP16 tensorrt/nott_v3_sol_fp16.trt 4.6MB
v3_sol TensorRT FP32 tensorrt/nott_v3_sol_fp32.trt 9.2MB
v2 PyTorch pytorch/nott_v2.pth 8.2MB
v2 SafeTensors pytorch/nott_v2.safetensors 8.2MB
v2 ONNX onnx/nott_v2.onnx 8.2MB
v2 TensorRT FP16 tensorrt/nott_v2_fp16.trt 4.5MB
v2 TensorRT FP32 tensorrt/nott_v2_fp32.trt 9.1MB

Architecture

T-RefNet — U-Net encoder-decoder with ConvGRU recurrent bottleneck (2M params, 8MB).

Paper

Thermal Image Refinement with Depth Estimation using Recurrent Networks for Monocular ORB-SLAM3 arXiv:2603.14998 — Şahin, Pham, Dang, Yegenoglu, Kayacan (2026)

Training Pipeline

VIVID++ (71K real thermal) → pretrain (lr=1e-3) → v2 (ARE=0.106)
    ↓
SOL synthetic (26K COCO + 399 real) → fine-tune (lr=1e-5, frozen BN, 8 GPU DDP) → v3_sol (ARE=0.174)

Usage

import torch
from anima_nott.thermal_refinement import ThermalRefinementNet

model = ThermalRefinementNet(in_channels=1, base_channels=32, num_levels=3, gru_layers=2)
state = torch.load("pytorch/nott_v3_sol.pth", weights_only=True)
model.load_state_dict(state)
model.eval()

thermal = torch.randn(1, 1, 256, 320)  # normalized [0,1]
refined, hidden = model(thermal)

Defense Application

Low-cost thermal SLAM for GPS-denied, low-light UAV navigation. Non-radiometric thermal cameras (~$150 FLIR Lepton 3.5). Target: <0.4m trajectory error, 25+ FPS on Jetson Xavier.

License

Apache-2.0 — Robot Flow Labs / AIFLOW LABS LIMITED

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train ilessio-aiflowlab/project_nott

Paper for ilessio-aiflowlab/project_nott