legacy-datasets/common_voice
Updated • 1.5k • 144
How to use Siyam/SKYLy with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="Siyam/SKYLy") # Load model directly
from transformers import AutoProcessor, AutoModelForCTC
processor = AutoProcessor.from_pretrained("Siyam/SKYLy")
model = AutoModelForCTC.from_pretrained("Siyam/SKYLy")This model is a fine-tuned version of facebook/wav2vec2-large-xlsr-53 on the common_voice dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer |
|---|---|---|---|---|
| 4.4215 | 4.26 | 400 | 1.6323 | 0.9857 |
| 0.5716 | 8.51 | 800 | 0.6679 | 0.5107 |
| 0.1721 | 12.77 | 1200 | 0.6935 | 0.4632 |
| 0.1063 | 17.02 | 1600 | 0.7533 | 0.4432 |
| 0.0785 | 21.28 | 2000 | 0.7208 | 0.4255 |
| 0.0608 | 25.53 | 2400 | 0.7481 | 0.4117 |
| 0.0493 | 29.79 | 2800 | 0.7645 | 0.4083 |