Upload folder using huggingface_hub
Browse files- test_outputs/README_tests.md +33 -0
- test_outputs/fp16/fp16_duration_predictor.fp16_out0.npy +3 -0
- test_outputs/fp16/fp16_duration_predictor.fp16_out0.wav +0 -0
- test_outputs/fp16/fp16_text_encoder.fp16_out0.npy +3 -0
- test_outputs/fp16/fp16_vocoder.fp16_out0.wav +0 -0
- test_outputs/int8_dynamic/int8_dynamic_duration_predictor.int8_out0.npy +3 -0
- test_outputs/int8_dynamic/int8_dynamic_text_encoder.int8_out0.npy +3 -0
- test_outputs/int8_dynamic/int8_dynamic_vector_estimator.int8_out0.npy +3 -0
- test_outputs/int8_dynamic/int8_dynamic_vocoder.int8_out0.wav +0 -0
test_outputs/README_tests.md
ADDED
|
@@ -0,0 +1,33 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# Supertonic Quantized – Test Outputs
|
| 2 |
+
|
| 3 |
+
This folder was generated automatically by a Colab script that:
|
| 4 |
+
|
| 5 |
+
1. Downloaded the Hugging Face repo **Shadow0482/supertonic-quantized**
|
| 6 |
+
2. Located all `*.onnx` models (both `fp16/` and `int8_dynamic/`)
|
| 7 |
+
3. Ran each model once with dummy inputs using ONNX Runtime
|
| 8 |
+
4. Saved:
|
| 9 |
+
- `.wav` files for audio-like tensors (1D or 2D, 1–2 channels, >=16 samples)
|
| 10 |
+
- `.npy` files for all other outputs
|
| 11 |
+
|
| 12 |
+
All paths below are relative to the `test_outputs/` directory.
|
| 13 |
+
|
| 14 |
+
## Per-model results
|
| 15 |
+
|
| 16 |
+
- `fp16/duration_predictor.fp16.onnx -> fp16/fp16_duration_predictor.fp16_out0.npy`
|
| 17 |
+
- `fp16/text_encoder.fp16.onnx -> fp16/fp16_text_encoder.fp16_out0.npy`
|
| 18 |
+
- `fp16/vector_estimator.fp16.onnx -> FAILED`
|
| 19 |
+
- `fp16/vocoder.fp16.onnx -> fp16/fp16_vocoder.fp16_out0.wav`
|
| 20 |
+
- `int8_dynamic/duration_predictor.int8.onnx -> int8_dynamic/int8_dynamic_duration_predictor.int8_out0.npy`
|
| 21 |
+
- `int8_dynamic/text_encoder.int8.onnx -> int8_dynamic/int8_dynamic_text_encoder.int8_out0.npy`
|
| 22 |
+
- `int8_dynamic/vector_estimator.int8.onnx -> int8_dynamic/int8_dynamic_vector_estimator.int8_out0.npy`
|
| 23 |
+
- `int8_dynamic/vocoder.int8.onnx -> int8_dynamic/int8_dynamic_vocoder.int8_out0.wav`
|
| 24 |
+
|
| 25 |
+
## Models that failed to load / run
|
| 26 |
+
|
| 27 |
+
- `fp16/vector_estimator.fp16.onnx` -> **FAILED**: [ONNXRuntimeError] : 1 : FAIL : Load model from /content/supertonic/supertonic_quantized/fp16/vector_estimator.fp16.onnx failed:Type Error: Type (tensor(float16)) of output arg (/vector_field/main_blocks.3/attn/Cast_output_0) of node (/vector_field/main_blocks.3/attn/Cast) does not match expected type (tensor(float)).
|
| 28 |
+
|
| 29 |
+
|
| 30 |
+
> Note:
|
| 31 |
+
> These tests use synthetic dummy inputs. They confirm that the
|
| 32 |
+
> quantized ONNX graphs load and execute, but they are **not**
|
| 33 |
+
> a replacement for real end-to-end TTS quality evaluation.
|
test_outputs/fp16/fp16_duration_predictor.fp16_out0.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:adae1157c6f7966fff1b160279a438f80e8121d75fb1ae7fd23929f75ce7bdf1
|
| 3 |
+
size 132
|
test_outputs/fp16/fp16_duration_predictor.fp16_out0.wav
ADDED
|
Binary file (46 Bytes). View file
|
|
|
test_outputs/fp16/fp16_text_encoder.fp16_out0.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ee27b7abb5e574f8eb582405d7a0ba92b526420d3bca2e0b89e3394409b73879
|
| 3 |
+
size 10368
|
test_outputs/fp16/fp16_vocoder.fp16_out0.wav
ADDED
|
Binary file (61.5 kB). View file
|
|
|
test_outputs/int8_dynamic/int8_dynamic_duration_predictor.int8_out0.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:584b0425a13c8ee9a35808a806fb4bd86264bda3ed799425caa7c9644e5a79dc
|
| 3 |
+
size 132
|
test_outputs/int8_dynamic/int8_dynamic_text_encoder.int8_out0.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ee27b7abb5e574f8eb582405d7a0ba92b526420d3bca2e0b89e3394409b73879
|
| 3 |
+
size 10368
|
test_outputs/int8_dynamic/int8_dynamic_vector_estimator.int8_out0.npy
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:5856cde54e8cc4b4dff2f218d3cc8834aaa204a38b96ebf0b168ea54ce010040
|
| 3 |
+
size 5888
|
test_outputs/int8_dynamic/int8_dynamic_vocoder.int8_out0.wav
ADDED
|
Binary file (61.5 kB). View file
|
|
|