Shadow0482 commited on
Commit
faa0459
·
verified ·
1 Parent(s): c1e5443

Upload folder using huggingface_hub

Browse files
test_outputs/README_tests.md ADDED
@@ -0,0 +1,33 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Supertonic Quantized – Test Outputs
2
+
3
+ This folder was generated automatically by a Colab script that:
4
+
5
+ 1. Downloaded the Hugging Face repo **Shadow0482/supertonic-quantized**
6
+ 2. Located all `*.onnx` models (both `fp16/` and `int8_dynamic/`)
7
+ 3. Ran each model once with dummy inputs using ONNX Runtime
8
+ 4. Saved:
9
+ - `.wav` files for audio-like tensors (1D or 2D, 1–2 channels, >=16 samples)
10
+ - `.npy` files for all other outputs
11
+
12
+ All paths below are relative to the `test_outputs/` directory.
13
+
14
+ ## Per-model results
15
+
16
+ - `fp16/duration_predictor.fp16.onnx -> fp16/fp16_duration_predictor.fp16_out0.npy`
17
+ - `fp16/text_encoder.fp16.onnx -> fp16/fp16_text_encoder.fp16_out0.npy`
18
+ - `fp16/vector_estimator.fp16.onnx -> FAILED`
19
+ - `fp16/vocoder.fp16.onnx -> fp16/fp16_vocoder.fp16_out0.wav`
20
+ - `int8_dynamic/duration_predictor.int8.onnx -> int8_dynamic/int8_dynamic_duration_predictor.int8_out0.npy`
21
+ - `int8_dynamic/text_encoder.int8.onnx -> int8_dynamic/int8_dynamic_text_encoder.int8_out0.npy`
22
+ - `int8_dynamic/vector_estimator.int8.onnx -> int8_dynamic/int8_dynamic_vector_estimator.int8_out0.npy`
23
+ - `int8_dynamic/vocoder.int8.onnx -> int8_dynamic/int8_dynamic_vocoder.int8_out0.wav`
24
+
25
+ ## Models that failed to load / run
26
+
27
+ - `fp16/vector_estimator.fp16.onnx` -> **FAILED**: [ONNXRuntimeError] : 1 : FAIL : Load model from /content/supertonic/supertonic_quantized/fp16/vector_estimator.fp16.onnx failed:Type Error: Type (tensor(float16)) of output arg (/vector_field/main_blocks.3/attn/Cast_output_0) of node (/vector_field/main_blocks.3/attn/Cast) does not match expected type (tensor(float)).
28
+
29
+
30
+ > Note:
31
+ > These tests use synthetic dummy inputs. They confirm that the
32
+ > quantized ONNX graphs load and execute, but they are **not**
33
+ > a replacement for real end-to-end TTS quality evaluation.
test_outputs/fp16/fp16_duration_predictor.fp16_out0.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:adae1157c6f7966fff1b160279a438f80e8121d75fb1ae7fd23929f75ce7bdf1
3
+ size 132
test_outputs/fp16/fp16_duration_predictor.fp16_out0.wav ADDED
Binary file (46 Bytes). View file
 
test_outputs/fp16/fp16_text_encoder.fp16_out0.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ee27b7abb5e574f8eb582405d7a0ba92b526420d3bca2e0b89e3394409b73879
3
+ size 10368
test_outputs/fp16/fp16_vocoder.fp16_out0.wav ADDED
Binary file (61.5 kB). View file
 
test_outputs/int8_dynamic/int8_dynamic_duration_predictor.int8_out0.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:584b0425a13c8ee9a35808a806fb4bd86264bda3ed799425caa7c9644e5a79dc
3
+ size 132
test_outputs/int8_dynamic/int8_dynamic_text_encoder.int8_out0.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ee27b7abb5e574f8eb582405d7a0ba92b526420d3bca2e0b89e3394409b73879
3
+ size 10368
test_outputs/int8_dynamic/int8_dynamic_vector_estimator.int8_out0.npy ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5856cde54e8cc4b4dff2f218d3cc8834aaa204a38b96ebf0b168ea54ce010040
3
+ size 5888
test_outputs/int8_dynamic/int8_dynamic_vocoder.int8_out0.wav ADDED
Binary file (61.5 kB). View file