RAGNAROK - Multispectral Camouflaged Object Detection

Phase 1 RGB pretraining checkpoint for the RAGNAROK module.

Model Details

  • Paper: MCOD: First Multispectral Camouflaged Object Detection Benchmark (arXiv: 2509.15753)
  • Architecture: ResNet-50 8-channel backbone + Spectral Attention + PRNet Progressive Decoder
  • Parameters: 33.3M
  • Training: Phase 1 RGB pretraining on COD10K (4,040 images)
  • Input: 8-channel tensor (3 RGB + 4 NIR + 1 thermal), 352x352
  • Output: Binary camouflage segmentation mask

Training Results (Phase 1)

Metric Value
val/loss 2.256
val/iou 0.226
Epochs 50
Batch size 44 per GPU
GPUs 2x NVIDIA L4 (DDP)
Precision FP16 mixed

Available Formats

Format File Size
PyTorch (.pth) project_ragnarok_cuda_v1_phase1_best.pth 128MB
SafeTensors project_ragnarok_cuda_v1_phase1_best.safetensors 128MB
ONNX (opset 17) project_ragnarok_v1_phase1.onnx 111MB
TensorRT FP32 project_ragnarok_v1_phase1_trt_fp32.engine 114MB
TensorRT FP16 project_ragnarok_v1_phase1_trt_fp16.engine 57MB

Usage

import torch
from anima_ragnarok.model import RagnarokModel

model = RagnarokModel(config_path="configs/model_config.yaml")
state = torch.load("project_ragnarok_cuda_v1_phase1_best.pth")
model.load_state_dict(state["model_state_dict"], strict=False)
model.eval()

# Input: 8-channel image [B, 8, 352, 352]
output = model(input_tensor)
mask = output[0]  # [B, 1, 352, 352]

ANIMA Module

Part of the ANIMA robotics perception stack. See RobotFlow-Labs/defense-project-ragnarok.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support