| # Endpointing Model Benchmark Report | |
| **Model:** `/data/smart-turn-v3.0.onnx` | |
| **Generated:** 2025-12-03 16:04:09 UTC | |
| ## Accuracy Results | |
| **Total Samples:** 31,473 | |
| **Unique Languages:** ๐ธ๐ฆ Arabic, ๐ง๐ฉ Bengali, ๐ฉ๐ฐ Danish, ๐ฉ๐ช German, ๐ฌ๐ง ๐บ๐ธ English, ๐ซ๐ฎ Finnish, ๐ซ๐ท French, ๐ฎ๐ณ Hindi, ๐ฎ๐ฉ Indonesian, ๐ฎ๐น Italian, ๐ฏ๐ต Japanese, ๐ฐ๐ท Korean, ๐ฎ๐ณ Marathi, ๐ณ๐ฑ Dutch, ๐ณ๐ด Norwegian, ๐ต๐ฑ Polish, ๐ต๐น Portuguese, ๐ท๐บ Russian, ๐ช๐ธ Spanish, ๐น๐ท Turkish, ๐บ๐ฆ Ukrainian, ๐ป๐ณ Vietnamese, ๐จ๐ณ Chinese | |
| **Unique Datasets:** chirp3_1, chirp3_2, human_5, human_convcollector_1, liva_1, midcentury_1, mundo_1, orpheus_endfiller_1, orpheus_grammar_1, orpheus_midfiller_1, rime_2 | |
| ### Overall Performance | |
| | Metric | Sample Count | Accuracy (%) | False Positives (%) | False Negatives (%) | | |
| |--------|--------------|--------------|---------------------|---------------------| | |
| | Overall | 31,473 | 91.60 | 4.68 | 3.72 | | |
| ### Performance by Language | |
| | Language | Sample Count | Accuracy (%) | False Positives (%) | False Negatives (%) | | |
| |----------|--------------|--------------|---------------------|---------------------| | |
| | ๐น๐ท Turkish | 966 | 97.10 | 1.66 | 1.24 | | |
| | ๐ฏ๐ต Japanese | 834 | 96.88 | 1.92 | 1.20 | | |
| | ๐ฐ๐ท Korean | 890 | 96.74 | 1.12 | 2.13 | | |
| | ๐ฉ๐ช German | 1,322 | 96.22 | 2.42 | 1.36 | | |
| | ๐ซ๐ท French | 1,253 | 96.17 | 1.52 | 2.31 | | |
| | ๐ณ๐ฑ Dutch | 1,401 | 96.15 | 2.00 | 1.86 | | |
| | ๐ต๐น Portuguese | 1,398 | 95.42 | 2.79 | 1.79 | | |
| | ๐ฎ๐น Italian | 782 | 94.88 | 3.07 | 2.05 | | |
| | ๐ซ๐ฎ Finnish | 1,010 | 94.85 | 3.17 | 1.98 | | |
| | ๐ฎ๐ฉ Indonesian | 971 | 94.54 | 4.12 | 1.34 | | |
| | ๐บ๐ฆ Ukrainian | 929 | 94.51 | 2.80 | 2.69 | | |
| | ๐ต๐ฑ Polish | 976 | 94.47 | 2.87 | 2.66 | | |
| | ๐ณ๐ด Norwegian | 1,014 | 93.98 | 3.55 | 2.47 | | |
| | ๐ท๐บ Russian | 1,470 | 93.54 | 3.33 | 3.13 | | |
| | ๐ฎ๐ณ Hindi | 1,295 | 93.36 | 4.40 | 2.24 | | |
| | ๐ฉ๐ฐ Danish | 779 | 93.07 | 4.88 | 2.05 | | |
| | ๐ธ๐ฆ Arabic | 947 | 88.60 | 6.97 | 4.44 | | |
| | ๐จ๐ณ Chinese | 945 | 88.57 | 4.76 | 6.67 | | |
| | ๐ฌ๐ง ๐บ๐ธ English | 7,722 | 88.31 | 6.00 | 5.70 | | |
| | ๐ฎ๐ณ Marathi | 774 | 87.47 | 8.53 | 4.01 | | |
| | ๐ช๐ธ Spanish | 1,791 | 86.71 | 4.69 | 8.60 | | |
| | ๐ง๐ฉ Bengali | 1,000 | 84.10 | 10.90 | 5.00 | | |
| | ๐ป๐ณ Vietnamese | 1,004 | 81.57 | 14.94 | 3.49 | | |
| ### Performance by Dataset | |
| | Dataset | Sample Count | Accuracy (%) | False Positives (%) | False Negatives (%) | | |
| |---------|--------------|--------------|---------------------|---------------------| | |
| | rime_2 | 396 | 99.75 | 0.00 | 0.25 | | |
| | human_5 | 402 | 96.27 | 1.00 | 2.74 | | |
| | chirp3_1 | 16,300 | 94.53 | 2.93 | 2.53 | | |
| | orpheus_endfiller_1 | 182 | 94.51 | 0.00 | 5.49 | | |
| | orpheus_grammar_1 | 163 | 92.64 | 3.68 | 3.68 | | |
| | orpheus_midfiller_1 | 140 | 91.43 | 3.57 | 5.00 | | |
| | human_convcollector_1 | 90 | 91.11 | 3.33 | 5.56 | | |
| | chirp3_2 | 8,428 | 90.27 | 6.68 | 3.05 | | |
| | midcentury_1 | 1,044 | 85.44 | 11.78 | 2.78 | | |
| | liva_1 | 3,832 | 84.68 | 6.92 | 8.40 | | |
| | mundo_1 | 496 | 72.78 | 5.24 | 21.98 | | |