A Dual-Modulation Framework for RGB-T Crowd Counting via Spatially Modulated Attention and Adaptive Fusion
Paper โข 2509.17079 โข Published
Dual-Modulation Framework for RGB-T Crowd Counting via Spatially Modulated Attention and Adaptive Fusion.
Paper: ArXiv 2509.17079
model.pth โ PyTorch state dictmodel.safetensors โ SafeTensors formatmodel.onnx โ ONNX (opset 17)model_fp16.trt โ TensorRT FP16model_fp32.trt โ TensorRT FP32from def_rgbtcc.serve import RGBTCCInference
model = RGBTCCInference("model.pth")
result = model.predict(rgb_image, thermal_image)
print(f"Count: {result['count']:.1f}")
Part of the ANIMA Defense Module ecosystem (Wave 8). Products: ORACLE, ATLAS, NEMESIS