Instructions to use KrorngAI/TrorYongASR-tiny with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use KrorngAI/TrorYongASR-tiny with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="KrorngAI/TrorYongASR-tiny", trust_remote_code=True)# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("KrorngAI/TrorYongASR-tiny", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
Update README.md
Browse files
README.md
CHANGED
|
@@ -108,6 +108,7 @@ The evaluation assesses two capabilities — language detection and transcriptio
|
|
| 108 |
**Task:** Given audio input, detect the language.
|
| 109 |
|
| 110 |
<div align="center">
|
|
|
|
| 111 |
| Metric | Description |
|
| 112 |
|--------|-------------|
|
| 113 |
| **Precision** | Proportion of predicted languages that are correct |
|
|
@@ -121,6 +122,7 @@ The evaluation assesses two capabilities — language detection and transcriptio
|
|
| 121 |
**Task:** Convert audio to text (transcription).
|
| 122 |
|
| 123 |
<div align="center">
|
|
|
|
| 124 |
| Metric | Description |
|
| 125 |
|--------|-------------|
|
| 126 |
| **Token Error Rate** | Proportion of incorrectly transcribed tokens |
|
|
@@ -138,6 +140,7 @@ The evaluation assesses two capabilities — language detection and transcriptio
|
|
| 138 |
#### Language Detection Results
|
| 139 |
|
| 140 |
<div align="center">
|
|
|
|
| 141 |
| Dataset | Precision | Recall | Accuracy | F1-score |
|
| 142 |
|---------|-----------|--------|----------|----------|
|
| 143 |
| google/fleurs (Khmer) | 100% | 100% | 100% | 100% |
|
|
@@ -152,6 +155,7 @@ The evaluation assesses two capabilities — language detection and transcriptio
|
|
| 152 |
#### Transcription Results
|
| 153 |
|
| 154 |
<div align="center">
|
|
|
|
| 155 |
| Metric | Combined (Khmer + English) | Khmer | English |
|
| 156 |
|--------|---------------------------|-------|---------|
|
| 157 |
| Token Error Rate | 29% | 56% | 19% |
|
|
@@ -254,6 +258,7 @@ Khmer datasets include [`DDD-Cambodia/khm-asr-cultural`](https://huggingface.co/
|
|
| 254 |
Split `clean.100` of [`openslr/librispeech_asr`](https://huggingface.co/datasets/openslr/librispeech_asr) was used as English dataset.
|
| 255 |
|
| 256 |
<div align="center">
|
|
|
|
| 257 |
| Dataset | Language | Training examples | Validation examples | Description |
|
| 258 |
| --------- | ---------- | ----------------- | ------------------- |- |
|
| 259 |
| **openslr/openslr** | Khmer | 2906 | 0 | Multi-speaker TTS data for Khmer language (split `SLR42`) |
|
|
|
|
| 108 |
**Task:** Given audio input, detect the language.
|
| 109 |
|
| 110 |
<div align="center">
|
| 111 |
+
|
| 112 |
| Metric | Description |
|
| 113 |
|--------|-------------|
|
| 114 |
| **Precision** | Proportion of predicted languages that are correct |
|
|
|
|
| 122 |
**Task:** Convert audio to text (transcription).
|
| 123 |
|
| 124 |
<div align="center">
|
| 125 |
+
|
| 126 |
| Metric | Description |
|
| 127 |
|--------|-------------|
|
| 128 |
| **Token Error Rate** | Proportion of incorrectly transcribed tokens |
|
|
|
|
| 140 |
#### Language Detection Results
|
| 141 |
|
| 142 |
<div align="center">
|
| 143 |
+
|
| 144 |
| Dataset | Precision | Recall | Accuracy | F1-score |
|
| 145 |
|---------|-----------|--------|----------|----------|
|
| 146 |
| google/fleurs (Khmer) | 100% | 100% | 100% | 100% |
|
|
|
|
| 155 |
#### Transcription Results
|
| 156 |
|
| 157 |
<div align="center">
|
| 158 |
+
|
| 159 |
| Metric | Combined (Khmer + English) | Khmer | English |
|
| 160 |
|--------|---------------------------|-------|---------|
|
| 161 |
| Token Error Rate | 29% | 56% | 19% |
|
|
|
|
| 258 |
Split `clean.100` of [`openslr/librispeech_asr`](https://huggingface.co/datasets/openslr/librispeech_asr) was used as English dataset.
|
| 259 |
|
| 260 |
<div align="center">
|
| 261 |
+
|
| 262 |
| Dataset | Language | Training examples | Validation examples | Description |
|
| 263 |
| --------- | ---------- | ----------------- | ------------------- |- |
|
| 264 |
| **openslr/openslr** | Khmer | 2906 | 0 | Multi-speaker TTS data for Khmer language (split `SLR42`) |
|