docs: fp16/int8 full benchmark + RTFx
Browse files
README.md
CHANGED
|
@@ -62,9 +62,16 @@ the encoder/decoder pad-masks from the **input tensor's seq dim** (so
|
|
| 62 |
|
| 63 |
## Benchmark — AISHELL-1 test (CoreML on ANE)
|
| 64 |
|
| 65 |
-
|
| 66 |
-
|
| 67 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 68 |
|
| 69 |
Reproduces the published Paraformer-large AISHELL-1 number — confirming the
|
| 70 |
conversion (front-end + encoder + CIF + decoder) is faithful.
|
|
|
|
| 62 |
|
| 63 |
## Benchmark — AISHELL-1 test (CoreML on ANE)
|
| 64 |
|
| 65 |
+
Full test set (7,176 utts), full-CoreML pipeline on M5 Pro ANE:
|
| 66 |
+
|
| 67 |
+
| Precision | size (enc+dec) | CER | median RTFx | peak RAM |
|
| 68 |
+
|-----------|----------------|-----|-------------|----------|
|
| 69 |
+
| fp16 (default) | 411 MB | **2.12%** | 85× | 0.38 GB |
|
| 70 |
+
| int8 | 207 MB | **2.12%** | 84× | 0.24 GB |
|
| 71 |
+
|
| 72 |
+
Official Paraformer-large AISHELL-1 ≈ 1.95% CER (the ~0.17 pp gap is fp16 + the
|
| 73 |
+
fixed-shape decoder padding). int8 weight quantization is accuracy-neutral (CER
|
| 74 |
+
unchanged), ~half the size/memory.
|
| 75 |
|
| 76 |
Reproduces the published Paraformer-large AISHELL-1 number — confirming the
|
| 77 |
conversion (front-end + encoder + CIF + decoder) is faithful.
|