Commit
·
9603413
1
Parent(s):
52867a6
Update README.md
Browse files
README.md
CHANGED
|
@@ -32,7 +32,11 @@ model-index:
|
|
| 32 |
|
| 33 |
# Sharif-wav2vec2
|
| 34 |
|
| 35 |
-
This is the fine-tuned version of Sharif Wav2vec2 for Farsi. The base model was fine-tuned on 108 hours of Commonvoice's Farsi samples with a sampling rate equal to 16kHz. Afterward, we trained a 5gram using [kenlm](https://github.com/kpu/kenlm) toolkit and used it in the processor which increased our accuracy on online ASR.
|
|
|
|
|
|
|
|
|
|
|
|
|
| 36 |
|
| 37 |
```shell
|
| 38 |
pip install pyctcdecode
|
|
@@ -74,11 +78,18 @@ print(prediction[0])
|
|
| 74 |
|
| 75 |
*Result (WER)*:
|
| 76 |
|
| 77 |
-
|
|
| 78 |
|---|---|
|
| 79 |
| 3.4 | 8.6 |
|
| 80 |
|
| 81 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 82 |
## Citation
|
| 83 |
If you want to cite this model you can use this:
|
| 84 |
|
|
|
|
| 32 |
|
| 33 |
# Sharif-wav2vec2
|
| 34 |
|
| 35 |
+
This is the fine-tuned version of Sharif Wav2vec2 for Farsi. The base model was fine-tuned on 108 hours of Commonvoice's Farsi samples with a sampling rate equal to 16kHz. Afterward, we trained a 5gram using [kenlm](https://github.com/kpu/kenlm) toolkit and used it in the processor which increased our accuracy on online ASR.
|
| 36 |
+
|
| 37 |
+
## Usage
|
| 38 |
+
|
| 39 |
+
When using the model make sure that your speech input is sampled at 16Khz. Prior to the usage, you may need to install the below dependencies:
|
| 40 |
|
| 41 |
```shell
|
| 42 |
pip install pyctcdecode
|
|
|
|
| 78 |
|
| 79 |
*Result (WER)*:
|
| 80 |
|
| 81 |
+
| clean | other |
|
| 82 |
|---|---|
|
| 83 |
| 3.4 | 8.6 |
|
| 84 |
|
| 85 |
|
| 86 |
+
## Evaluation
|
| 87 |
+
|
| 88 |
+
For the evaluation use the code below:
|
| 89 |
+
```python
|
| 90 |
+
?
|
| 91 |
+
```
|
| 92 |
+
|
| 93 |
## Citation
|
| 94 |
If you want to cite this model you can use this:
|
| 95 |
|