File size: 4,615 Bytes

---
license: apache-2.0

language:
- en

base_model:
- princeton-vl/RAFT-Stereo

pipeline_tag: depth-estimation

tags:
- Axera
- RAFT-Stereo
- NPU
- Stereo-Matching
- Depth-Estimation
- Computer-Vision
---

# RAFT-Stereo on Axera NPU

This repository contains the [RAFT-Stereo](https://arxiv.org/abs/2109.07547) model converted for high-performance inference on the Axera NPU. 

This version is optimized with **w8a16** quantization and is compatible with **Pulsar2 version 4.2**.

## Convert Tools Links

For model conversion and deployment guidance:

* [AXera Platform GitHub Repo](https://github.com/AXERA-TECH/RAFT-Stereo.axera): Sample code and optimization guides for Axera NPU.
* [Pulsar2 Documentation](https://pulsar2-docs.readthedocs.io/en/latest/pulsar2/introduction.html): Guide for converting ONNX models to `.axmodel`.

## Support Platforms

- **AX650N/AX8850**
  - [M4N-Dock(爱芯派Pro)](https://wiki.sipeed.com/hardware/zh/maixIV/m4ndock/m4ndock.html)
  - [M.2 Accelerator card](https://docs.m5stack.com/en/ai_hardware/LLM-8850_Card)
- **AX630C**
  - [爱芯派2](https://axera-pi-2-docs-cn.readthedocs.io/zh-cn/latest/index.html)
  - [Module-LLM](https://docs.m5stack.com/zh_CN/module/Module-LLM)
  - [LLM630 Compute Kit](https://docs.m5stack.com/zh_CN/core/LLM630%20Compute%20Kit)
- **AX637**


| Chips | Model Variant | Latency (ms) |
| --- | --- | --- |
| AX650 | raft_steoro256x640_r1.axmodel | 20.9 |
| AX650 | raft_steoro256x640_r4.axmodel | 111.8 |

| Chips | Model Variant | Latency (ms) |
| --- | --- | --- |
| AX630C| raft_steoro256x640_r1_npu2.axmodel | 317.765 |
| AX630C| raft_steoro256x640_r4_npu2.axmodel | 825.793 |

| Chips | Model Variant | Latency (ms) |
| --- | --- | --- |
| AX637| raft_steoro256x640_r1_npu1.axmodel | 61.033 |
| AX637| raft_steoro256x640_r4_npu1.axmodel | 99.094 |

## How to Use

Download the repository and ensure the directory structure is organized as follows:

```text
├── ax630c
│   ├── raft_steoro256x640_r1.onnx
│   ├── raft_steoro256x640_r1_npu2.axmodel
│   ├── raft_steoro256x640_r4.onnx
│   └── raft_steoro256x640_r4_npu2.axmodel
├── ax650
│   ├── raft_steoro256x640_r1.axmodel
│   ├── raft_steoro256x640_r1.onnx
│   ├── raft_steoro384x1280_r4.axmodel
│   └── raft_steoro384x1280_r4.onnx
├── ax637
│   ├── raft_steoro256x640_r1_npu1.axmodel
│   └── raft_steoro256x640_r4_npu1.axmodel
├── examples
│   ├── left
│   │   ├── 000051_11.png
│   │   ├── 000058_11.png
│   │   ├── 000059_10.png
│   │   ├── 000121_10.png
│   │   ├── 000164_10.png
│   │   ├── 000167_11.png
│   │   ├── 000172_11.png
│   │   ├── 000179_10.png
│   │   ├── 000193_10.png
│   │   └── 000195_10.png
│   └── right
│       ├── 000051_11.png
│       ├── 000058_11.png
│       ├── 000059_10.png
│       ├── 000121_10.png
│       ├── 000164_10.png
│       ├── 000167_11.png
│       ├── 000172_11.png
│       ├── 000179_10.png
│       ├── 000193_10.png
│       └── 000195_10.png
├── infer.py
└── infer_onnx.py
```

### Prerequisites

1. **Environment:** Ensure you have the required Python environment activated with the following core packages installed:
* **NPU Runtime:** [`axengine` (PyAXEngine)](https://github.com/AXERA-TECH/pyaxengine)
* **Core Libraries:** `numpy`, `opencv-python`, and `matplotlib`.


2. **Model/Data:** Ensure the compiled `.axmodel`, and input stereo pairs are available on the host.

### Inference Command

Run the inference script by providing the compiled model, and input data.

```bash
python3 infer.py --left examples/left/000051_11.png --right examples/right/000051_11.png --model raft_steoro256x640_r1_npu2.axmodel --width 640 --height 256

```

### Inference with AX630C Host

### Results

The model generates a dense disparity map representing the depth of the scene. 

```
(base) root@ax630c:~/data# /python3 infer.py --left examples/left/000051_11.png --right examples/right/000051_11.png --model raft_steoro256x640_r4_npu2.axmodel --width 640 --height 256
[INFO] Available providers:  ['AxEngineExecutionProvider']
[INFO] Using provider: AxEngineExecutionProvider
[INFO] Chip type: ChipType.MC20E
[INFO] VNPU type: VNPUType.DISABLED
[INFO] Engine version: 2.7.2a
[INFO] Model type: 1 (full core)
[INFO] Compiler version: 5.1-patch1-dirty e0fdc66b-dirty
Saved: output-ax.png

```

### Example Visualization

![](output-ax.png)