Automatic Speech Recognition
Transformers
Safetensors
Khmer
English
troryongasr
custom_code
Kimang18 commited on
Commit
1286163
·
verified ·
1 Parent(s): 7fc219a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -0
README.md CHANGED
@@ -108,6 +108,7 @@ The evaluation assesses two capabilities — language detection and transcriptio
108
  **Task:** Given audio input, detect the language.
109
 
110
  <div align="center">
 
111
  | Metric | Description |
112
  |--------|-------------|
113
  | **Precision** | Proportion of predicted languages that are correct |
@@ -121,6 +122,7 @@ The evaluation assesses two capabilities — language detection and transcriptio
121
  **Task:** Convert audio to text (transcription).
122
 
123
  <div align="center">
 
124
  | Metric | Description |
125
  |--------|-------------|
126
  | **Token Error Rate** | Proportion of incorrectly transcribed tokens |
@@ -138,6 +140,7 @@ The evaluation assesses two capabilities — language detection and transcriptio
138
  #### Language Detection Results
139
 
140
  <div align="center">
 
141
  | Dataset | Precision | Recall | Accuracy | F1-score |
142
  |---------|-----------|--------|----------|----------|
143
  | google/fleurs (Khmer) | 100% | 100% | 100% | 100% |
@@ -152,6 +155,7 @@ The evaluation assesses two capabilities — language detection and transcriptio
152
  #### Transcription Results
153
 
154
  <div align="center">
 
155
  | Metric | Combined (Khmer + English) | Khmer | English |
156
  |--------|---------------------------|-------|---------|
157
  | Token Error Rate | 29% | 56% | 19% |
@@ -254,6 +258,7 @@ Khmer datasets include [`DDD-Cambodia/khm-asr-cultural`](https://huggingface.co/
254
  Split `clean.100` of [`openslr/librispeech_asr`](https://huggingface.co/datasets/openslr/librispeech_asr) was used as English dataset.
255
 
256
  <div align="center">
 
257
  | Dataset | Language | Training examples | Validation examples | Description |
258
  | --------- | ---------- | ----------------- | ------------------- |- |
259
  | **openslr/openslr** | Khmer | 2906 | 0 | Multi-speaker TTS data for Khmer language (split `SLR42`) |
 
108
  **Task:** Given audio input, detect the language.
109
 
110
  <div align="center">
111
+
112
  | Metric | Description |
113
  |--------|-------------|
114
  | **Precision** | Proportion of predicted languages that are correct |
 
122
  **Task:** Convert audio to text (transcription).
123
 
124
  <div align="center">
125
+
126
  | Metric | Description |
127
  |--------|-------------|
128
  | **Token Error Rate** | Proportion of incorrectly transcribed tokens |
 
140
  #### Language Detection Results
141
 
142
  <div align="center">
143
+
144
  | Dataset | Precision | Recall | Accuracy | F1-score |
145
  |---------|-----------|--------|----------|----------|
146
  | google/fleurs (Khmer) | 100% | 100% | 100% | 100% |
 
155
  #### Transcription Results
156
 
157
  <div align="center">
158
+
159
  | Metric | Combined (Khmer + English) | Khmer | English |
160
  |--------|---------------------------|-------|---------|
161
  | Token Error Rate | 29% | 56% | 19% |
 
258
  Split `clean.100` of [`openslr/librispeech_asr`](https://huggingface.co/datasets/openslr/librispeech_asr) was used as English dataset.
259
 
260
  <div align="center">
261
+
262
  | Dataset | Language | Training examples | Validation examples | Description |
263
  | --------- | ---------- | ----------------- | ------------------- |- |
264
  | **openslr/openslr** | Khmer | 2906 | 0 | Multi-speaker TTS data for Khmer language (split `SLR42`) |