Yolo-X: Optimized for Qualcomm Devices

YoloX is a machine learning model that predicts bounding boxes and classes of objects in an image.

This is based on the implementation of Yolo-X found here. This repository contains pre-exported model files optimized for Qualcomm® devices. You can use the Qualcomm® AI Hub Models library to export with custom configurations. More details on model performance across various devices, can be found here.

Qualcomm AI Hub Models uses Qualcomm AI Hub Workbench to compile, profile, and evaluate this model. Sign up to run these models on a hosted Qualcomm® device.

Getting Started

There are two ways to deploy this model on your device:

Option 1: Download Pre-Exported Models

Below are pre-exported model assets ready for deployment.

Runtime Precision Chipset SDK Versions Download
ONNX float Universal QAIRT 2.42, ONNX Runtime 1.24.3 Download
ONNX w8a16 Universal QAIRT 2.42, ONNX Runtime 1.24.3 Download
ONNX w8a8 Universal QAIRT 2.42, ONNX Runtime 1.24.3 Download
ONNX w8a8_mixed_int16 Universal QAIRT 2.42, ONNX Runtime 1.24.3 Download
QNN_DLC w8a16 Universal QAIRT 2.45 Download
QNN_DLC w8a8 Universal QAIRT 2.45 Download
QNN_DLC w8a8_mixed_int16 Universal QAIRT 2.45 Download
TFLITE w8a8 Universal QAIRT 2.45 Download

For more device-specific assets and performance metrics, visit Yolo-X on Qualcomm® AI Hub.

Option 2: Export with Custom Configurations

Use the Qualcomm® AI Hub Models Python library to compile and export the model with your own:

  • Custom weights (e.g., fine-tuned checkpoints)
  • Custom input shapes
  • Target device and runtime configurations

This option is ideal if you need to customize the model beyond the default configuration provided here.

See our repository for Yolo-X on GitHub for usage instructions.

Model Details

Model Type: Model_use_case.object_detection

Model Stats:

  • Model checkpoint: YoloX Small
  • Input resolution: 640x640
  • Number of parameters: 8.98M
  • Model size (float): 34.3 MB
  • Model size (w8a16): 9.53 MB
  • Model size (w8a8): 8.96 MB

Performance Summary

Model Runtime Precision Chipset Inference Time (ms) Peak Memory Range (MB) Primary Compute Unit
Yolo-X ONNX float Snapdragon® 8 Elite Gen 5 Mobile 6.126 ms 5 - 166 MB NPU
Yolo-X ONNX float Snapdragon® 8 Elite Mobile 7.679 ms 3 - 163 MB NPU
Yolo-X ONNX float Snapdragon® X2 Elite 7.526 ms 14 - 14 MB NPU
Yolo-X ONNX float Snapdragon® X Elite 13.518 ms 14 - 14 MB NPU
Yolo-X ONNX float Snapdragon® X Elite 13.518 ms 14 - 14 MB NPU
Yolo-X ONNX float Snapdragon® 8 Gen 3 Mobile 9.031 ms 5 - 190 MB NPU
Yolo-X ONNX float Qualcomm® QCS8550 (Proxy) 13.785 ms 0 - 18 MB NPU
Yolo-X ONNX float Qualcomm® QCS9075 16.441 ms 5 - 7 MB NPU
Yolo-X ONNX float Snapdragon® 8 Elite For Galaxy Mobile 7.679 ms 3 - 163 MB NPU
Yolo-X ONNX w8a16 Snapdragon® 8 Elite Gen 5 Mobile 11.064 ms 3 - 204 MB NPU
Yolo-X ONNX w8a16 Snapdragon® 8 Elite Mobile 9.145 ms 0 - 199 MB NPU
Yolo-X ONNX w8a16 Snapdragon® X2 Elite 11.396 ms 8 - 8 MB NPU
Yolo-X ONNX w8a16 Snapdragon® X Elite 15.09 ms 7 - 7 MB NPU
Yolo-X ONNX w8a16 Snapdragon® X Elite 15.09 ms 7 - 7 MB NPU
Yolo-X ONNX w8a16 Snapdragon® 8 Gen 3 Mobile 9.88 ms 3 - 242 MB NPU
Yolo-X ONNX w8a16 Qualcomm® QCS6490 792.955 ms 114 - 116 MB CPU
Yolo-X ONNX w8a16 Qualcomm® QCS8550 (Proxy) 13.927 ms 0 - 12 MB NPU
Yolo-X ONNX w8a16 Qualcomm® QCS9075 14.778 ms 2 - 5 MB NPU
Yolo-X ONNX w8a16 Qualcomm® QCM6690 425.409 ms 109 - 118 MB CPU
Yolo-X ONNX w8a16 Snapdragon® 8 Elite For Galaxy Mobile 9.145 ms 0 - 199 MB NPU
Yolo-X ONNX w8a16 Snapdragon® 7 Gen 4 Mobile 396.281 ms 113 - 123 MB CPU
Yolo-X ONNX w8a16 Snapdragon® 7 Gen 4 Mobile 396.281 ms 113 - 123 MB CPU
Yolo-X ONNX w8a8 Snapdragon® 8 Elite Gen 5 Mobile 8.037 ms 1 - 59 MB NPU
Yolo-X ONNX w8a8 Snapdragon® 8 Elite Mobile 5.805 ms 1 - 60 MB NPU
Yolo-X ONNX w8a8 Snapdragon® X2 Elite 8.19 ms 8 - 8 MB NPU
Yolo-X ONNX w8a8 Snapdragon® X Elite 9.279 ms 8 - 8 MB NPU
Yolo-X ONNX w8a8 Snapdragon® X Elite 9.279 ms 8 - 8 MB NPU
Yolo-X ONNX w8a8 Snapdragon® 8 Gen 3 Mobile 6.205 ms 1 - 90 MB NPU
Yolo-X ONNX w8a8 Qualcomm® QCS6490 115.846 ms 47 - 57 MB CPU
Yolo-X ONNX w8a8 Qualcomm® QCS8550 (Proxy) 8.332 ms 0 - 94 MB NPU
Yolo-X ONNX w8a8 Qualcomm® QCS9075 9.357 ms 1 - 4 MB NPU
Yolo-X ONNX w8a8 Qualcomm® QCM6690 98.36 ms 38 - 50 MB CPU
Yolo-X ONNX w8a8 Snapdragon® 8 Elite For Galaxy Mobile 5.805 ms 1 - 60 MB NPU
Yolo-X ONNX w8a8 Snapdragon® 7 Gen 4 Mobile 94.914 ms 55 - 66 MB CPU
Yolo-X ONNX w8a8 Snapdragon® 7 Gen 4 Mobile 94.914 ms 55 - 66 MB CPU
Yolo-X ONNX w8a8_mixed_int16 Snapdragon® 8 Elite Gen 5 Mobile 26.449 ms 12 - 740 MB NPU
Yolo-X ONNX w8a8_mixed_int16 Snapdragon® 8 Elite Mobile 24.75 ms 10 - 722 MB NPU
Yolo-X ONNX w8a8_mixed_int16 Snapdragon® X2 Elite 28.251 ms 19 - 19 MB NPU
Yolo-X ONNX w8a8_mixed_int16 Snapdragon® X Elite 36.488 ms 16 - 16 MB NPU
Yolo-X ONNX w8a8_mixed_int16 Snapdragon® X Elite 36.488 ms 16 - 16 MB NPU
Yolo-X ONNX w8a8_mixed_int16 Snapdragon® 8 Gen 3 Mobile 27.722 ms 11 - 835 MB NPU
Yolo-X ONNX w8a8_mixed_int16 Qualcomm® QCS6490 316.752 ms 103 - 111 MB CPU
Yolo-X ONNX w8a8_mixed_int16 Qualcomm® QCS8550 (Proxy) 38.286 ms 4 - 29 MB NPU
Yolo-X ONNX w8a8_mixed_int16 Qualcomm® QCS9075 39.006 ms 12 - 14 MB NPU
Yolo-X ONNX w8a8_mixed_int16 Qualcomm® QCM6690 208.411 ms 93 - 103 MB CPU
Yolo-X ONNX w8a8_mixed_int16 Snapdragon® 8 Elite For Galaxy Mobile 24.75 ms 10 - 722 MB NPU
Yolo-X ONNX w8a8_mixed_int16 Snapdragon® 7 Gen 4 Mobile 194.54 ms 112 - 123 MB CPU
Yolo-X ONNX w8a8_mixed_int16 Snapdragon® 7 Gen 4 Mobile 194.54 ms 112 - 123 MB CPU
Yolo-X QNN_DLC w8a16 Snapdragon® 8 Elite Gen 5 Mobile 1.931 ms 2 - 84 MB NPU
Yolo-X QNN_DLC w8a16 Snapdragon® 8 Elite Mobile 2.604 ms 2 - 83 MB NPU
Yolo-X QNN_DLC w8a16 Snapdragon® X2 Elite 2.629 ms 2 - 2 MB NPU
Yolo-X QNN_DLC w8a16 Snapdragon® X Elite 6.076 ms 2 - 2 MB NPU
Yolo-X QNN_DLC w8a16 Snapdragon® X Elite 6.076 ms 2 - 2 MB NPU
Yolo-X QNN_DLC w8a16 Snapdragon® 8 Gen 3 Mobile 3.631 ms 2 - 104 MB NPU
Yolo-X QNN_DLC w8a16 Qualcomm® QCS6490 17.177 ms 2 - 6 MB NPU
Yolo-X QNN_DLC w8a16 Qualcomm® QCS8275 (Proxy) 11.663 ms 2 - 72 MB NPU
Yolo-X QNN_DLC w8a16 Qualcomm® QCS8550 (Proxy) 5.553 ms 2 - 4 MB NPU
Yolo-X QNN_DLC w8a16 Qualcomm® SA8775P 6.137 ms 0 - 73 MB NPU
Yolo-X QNN_DLC w8a16 Qualcomm® SA8775P 6.137 ms 0 - 73 MB NPU
Yolo-X QNN_DLC w8a16 Qualcomm® SA8775P 6.137 ms 0 - 73 MB NPU
Yolo-X QNN_DLC w8a16 Qualcomm® QCS9075 6.008 ms 1 - 5 MB NPU
Yolo-X QNN_DLC w8a16 Qualcomm® QCM6690 49.558 ms 2 - 194 MB NPU
Yolo-X QNN_DLC w8a16 Qualcomm® QCS8450 (Proxy) 6.911 ms 2 - 107 MB NPU
Yolo-X QNN_DLC w8a16 Qualcomm® SA7255P 11.663 ms 2 - 72 MB NPU
Yolo-X QNN_DLC w8a16 Qualcomm® SA8295P 7.593 ms 2 - 72 MB NPU
Yolo-X QNN_DLC w8a16 Snapdragon® 8 Elite For Galaxy Mobile 2.604 ms 2 - 83 MB NPU
Yolo-X QNN_DLC w8a16 Snapdragon® 7 Gen 4 Mobile 7.044 ms 1 - 189 MB NPU
Yolo-X QNN_DLC w8a16 Snapdragon® 7 Gen 4 Mobile 7.044 ms 1 - 189 MB NPU
Yolo-X QNN_DLC w8a8 Snapdragon® 8 Elite Gen 5 Mobile 0.906 ms 1 - 52 MB NPU
Yolo-X QNN_DLC w8a8 Snapdragon® 8 Elite Mobile 1.171 ms 0 - 47 MB NPU
Yolo-X QNN_DLC w8a8 Snapdragon® X2 Elite 1.225 ms 1 - 1 MB NPU
Yolo-X QNN_DLC w8a8 Snapdragon® X Elite 2.727 ms 1 - 1 MB NPU
Yolo-X QNN_DLC w8a8 Snapdragon® X Elite 2.727 ms 1 - 1 MB NPU
Yolo-X QNN_DLC w8a8 Snapdragon® 8 Gen 3 Mobile 1.651 ms 0 - 74 MB NPU
Yolo-X QNN_DLC w8a8 Qualcomm® QCS6490 7.727 ms 3 - 6 MB NPU
Yolo-X QNN_DLC w8a8 Qualcomm® QCS8275 (Proxy) 5.84 ms 1 - 50 MB NPU
Yolo-X QNN_DLC w8a8 Qualcomm® QCS8550 (Proxy) 2.434 ms 1 - 3 MB NPU
Yolo-X QNN_DLC w8a8 Qualcomm® SA8775P 2.822 ms 1 - 53 MB NPU
Yolo-X QNN_DLC w8a8 Qualcomm® SA8775P 2.822 ms 1 - 53 MB NPU
Yolo-X QNN_DLC w8a8 Qualcomm® SA8775P 2.822 ms 1 - 53 MB NPU
Yolo-X QNN_DLC w8a8 Qualcomm® QCS9075 3.276 ms 1 - 4 MB NPU
Yolo-X QNN_DLC w8a8 Qualcomm® QCM6690 22.98 ms 3 - 165 MB NPU
Yolo-X QNN_DLC w8a8 Qualcomm® QCS8450 (Proxy) 3.085 ms 1 - 76 MB NPU
Yolo-X QNN_DLC w8a8 Qualcomm® SA7255P 5.84 ms 1 - 50 MB NPU
Yolo-X QNN_DLC w8a8 Qualcomm® SA8295P 3.769 ms 1 - 48 MB NPU
Yolo-X QNN_DLC w8a8 Snapdragon® 8 Elite For Galaxy Mobile 1.171 ms 0 - 47 MB NPU
Yolo-X QNN_DLC w8a8 Snapdragon® 7 Gen 4 Mobile 2.795 ms 1 - 49 MB NPU
Yolo-X QNN_DLC w8a8 Snapdragon® 7 Gen 4 Mobile 2.795 ms 1 - 49 MB NPU
Yolo-X QNN_DLC w8a8_mixed_int16 Snapdragon® 8 Elite Gen 5 Mobile 1.374 ms 1 - 70 MB NPU
Yolo-X QNN_DLC w8a8_mixed_int16 Snapdragon® 8 Elite Mobile 1.766 ms 1 - 67 MB NPU
Yolo-X QNN_DLC w8a8_mixed_int16 Snapdragon® X2 Elite 1.855 ms 1 - 1 MB NPU
Yolo-X QNN_DLC w8a8_mixed_int16 Snapdragon® X Elite 4.068 ms 1 - 1 MB NPU
Yolo-X QNN_DLC w8a8_mixed_int16 Snapdragon® X Elite 4.068 ms 1 - 1 MB NPU
Yolo-X QNN_DLC w8a8_mixed_int16 Snapdragon® 8 Gen 3 Mobile 2.42 ms 1 - 89 MB NPU
Yolo-X QNN_DLC w8a8_mixed_int16 Qualcomm® QCS8275 (Proxy) 8.13 ms 1 - 59 MB NPU
Yolo-X QNN_DLC w8a8_mixed_int16 Qualcomm® QCS8550 (Proxy) 3.696 ms 1 - 3 MB NPU
Yolo-X QNN_DLC w8a8_mixed_int16 Qualcomm® SA8775P 4.092 ms 1 - 65 MB NPU
Yolo-X QNN_DLC w8a8_mixed_int16 Qualcomm® SA8775P 4.092 ms 1 - 65 MB NPU
Yolo-X QNN_DLC w8a8_mixed_int16 Qualcomm® SA8775P 4.092 ms 1 - 65 MB NPU
Yolo-X QNN_DLC w8a8_mixed_int16 Qualcomm® QCS9075 4.505 ms 1 - 3 MB NPU
Yolo-X QNN_DLC w8a8_mixed_int16 Qualcomm® QCM6690 35.07 ms 1 - 179 MB NPU
Yolo-X QNN_DLC w8a8_mixed_int16 Qualcomm® SA7255P 8.13 ms 1 - 59 MB NPU
Yolo-X QNN_DLC w8a8_mixed_int16 Snapdragon® 8 Elite For Galaxy Mobile 1.766 ms 1 - 67 MB NPU
Yolo-X QNN_DLC w8a8_mixed_int16 Snapdragon® 7 Gen 4 Mobile 4.609 ms 1 - 180 MB NPU
Yolo-X QNN_DLC w8a8_mixed_int16 Snapdragon® 7 Gen 4 Mobile 4.609 ms 1 - 180 MB NPU
Yolo-X TFLITE w8a8 Snapdragon® 8 Elite Gen 5 Mobile 0.912 ms 0 - 50 MB NPU
Yolo-X TFLITE w8a8 Snapdragon® 8 Elite Mobile 1.193 ms 0 - 45 MB NPU
Yolo-X TFLITE w8a8 Snapdragon® 8 Gen 3 Mobile 1.65 ms 0 - 70 MB NPU
Yolo-X TFLITE w8a8 Qualcomm® QCS6490 7.217 ms 0 - 12 MB NPU
Yolo-X TFLITE w8a8 Qualcomm® QCS8275 (Proxy) 5.86 ms 0 - 46 MB NPU
Yolo-X TFLITE w8a8 Qualcomm® QCS8550 (Proxy) 2.485 ms 0 - 2 MB NPU
Yolo-X TFLITE w8a8 Qualcomm® SA8775P 2.901 ms 0 - 50 MB NPU
Yolo-X TFLITE w8a8 Qualcomm® SA8775P 2.901 ms 0 - 50 MB NPU
Yolo-X TFLITE w8a8 Qualcomm® SA8775P 2.901 ms 0 - 50 MB NPU
Yolo-X TFLITE w8a8 Qualcomm® QCS9075 3.362 ms 0 - 12 MB NPU
Yolo-X TFLITE w8a8 Qualcomm® QCM6690 21.738 ms 0 - 159 MB NPU
Yolo-X TFLITE w8a8 Qualcomm® QCS8450 (Proxy) 2.944 ms 0 - 71 MB NPU
Yolo-X TFLITE w8a8 Qualcomm® SA7255P 5.86 ms 0 - 46 MB NPU
Yolo-X TFLITE w8a8 Qualcomm® SA8295P 3.814 ms 0 - 44 MB NPU
Yolo-X TFLITE w8a8 Snapdragon® 8 Elite For Galaxy Mobile 1.193 ms 0 - 45 MB NPU
Yolo-X TFLITE w8a8 Snapdragon® 7 Gen 4 Mobile 2.78 ms 0 - 45 MB NPU
Yolo-X TFLITE w8a8 Snapdragon® 7 Gen 4 Mobile 2.78 ms 0 - 45 MB NPU

License

  • The license for the original implementation of Yolo-X can be found here.

References

Community

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support