soprano-onnx / README.md
KevinAHM's picture
Contents update
a566983
metadata
license: apache-2.0
base_model: ekwek/Soprano-80M
tags:
  - onnx

Soprano ONNX (KV Cache)

This repository hosts ONNX exports of the Soprano 80M model with KV caching.

Contents

  • onnx/soprano_backbone_kv.onnx (backbone with past_key_values)
  • onnx/soprano_decoder.onnx + onnx/soprano_decoder.onnx.data (vocoder decoder)
  • / (tokenizer assets)

Inference & demo

See the streaming inference code here: https://github.com/KevinAHM/soprano-web-onnx

Not compatible with WebGPU via onnxruntime-web as of January 2026.

Upstream

Original project: https://github.com/ekwek1/soprano