alexwengg commited on
Commit
5dd557b
·
verified ·
1 Parent(s): 01f0617

docs: fp16/int8 full benchmark + RTFx

Browse files
Files changed (1) hide show
  1. README.md +10 -3
README.md CHANGED
@@ -62,9 +62,16 @@ the encoder/decoder pad-masks from the **input tensor's seq dim** (so
62
 
63
  ## Benchmark — AISHELL-1 test (CoreML on ANE)
64
 
65
- | Metric | full-CoreML (ANE) | Official Paraformer-large |
66
- |--------|-------------------|---------------------------|
67
- | **CER** | **2.12%** (full test, 7,176 utts) | ~1.95% |
 
 
 
 
 
 
 
68
 
69
  Reproduces the published Paraformer-large AISHELL-1 number — confirming the
70
  conversion (front-end + encoder + CIF + decoder) is faithful.
 
62
 
63
  ## Benchmark — AISHELL-1 test (CoreML on ANE)
64
 
65
+ Full test set (7,176 utts), full-CoreML pipeline on M5 Pro ANE:
66
+
67
+ | Precision | size (enc+dec) | CER | median RTFx | peak RAM |
68
+ |-----------|----------------|-----|-------------|----------|
69
+ | fp16 (default) | 411 MB | **2.12%** | 85× | 0.38 GB |
70
+ | int8 | 207 MB | **2.12%** | 84× | 0.24 GB |
71
+
72
+ Official Paraformer-large AISHELL-1 ≈ 1.95% CER (the ~0.17 pp gap is fp16 + the
73
+ fixed-shape decoder padding). int8 weight quantization is accuracy-neutral (CER
74
+ unchanged), ~half the size/memory.
75
 
76
  Reproduces the published Paraformer-large AISHELL-1 number — confirming the
77
  conversion (front-end + encoder + CIF + decoder) is faithful.