TUNI: Real-time RGB-T Semantic Segmentation with Unified Multi-Modal Feature Extraction and Cross-Modal Feature Fusion
Paper โข 2509.10005 โข Published
Paper: TUNI: Real-time RGB-T Semantic Segmentation with Unified Multi-Modal Feature Extraction and Cross-Modal Feature Fusion Venue: ICRA 2026 Authors: Xiaodong Guo et al.
| Dataset | Classes | mIoU | Pixel Acc |
|---|---|---|---|
| FMB | 15 | 62.4% | 91.4% |
| PST900 | 5 | TBD | TBD |
| CART | 12 | TBD | TBD |
tuni_fmb.pth โ PyTorch state dicttuni_fmb.safetensors โ Safetensors formattuni_fmb.onnx + tuni_fmb.onnx.data โ ONNX (opset 18)tuni_fmb_fp16.trt โ TensorRT FP16 enginetuni_fmb_fp32.trt โ TensorRT FP32 enginefrom def_tuni.model import TUNIModel
model = TUNIModel(variant="384_2242", n_classes=15)
model.load_checkpoint("tuni_fmb.pth")
model.eval().cuda()
# Inference
pred = model(rgb_tensor, thermal_tensor)
segmentation = pred.argmax(dim=1)
Part of the ANIMA Defense Module ecosystem. Built with ANIMA by Robot Flow Labs.