AILAB-VNUHCM/vivos
Updated • 520 • 16
How to use Thienpkae/working with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="Thienpkae/working") # Load model directly
from transformers import AutoProcessor, AutoModelForCTC
processor = AutoProcessor.from_pretrained("Thienpkae/working")
model = AutoModelForCTC.from_pretrained("Thienpkae/working")# Load model directly
from transformers import AutoProcessor, AutoModelForCTC
processor = AutoProcessor.from_pretrained("Thienpkae/working")
model = AutoModelForCTC.from_pretrained("Thienpkae/working")This model is a fine-tuned version of facebook/wav2vec2-base on the vivos dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer |
|---|---|---|---|---|
| 6.3485 | 2.0 | 292 | 3.7183 | 1.0 |
| 3.4479 | 4.0 | 584 | 3.5977 | 1.0 |
| 2.948 | 6.0 | 876 | 1.7093 | 0.8420 |
| 1.2556 | 8.0 | 1168 | 1.0140 | 0.5846 |
| 0.9216 | 10.0 | 1460 | 0.8558 | 0.5142 |
| 0.7769 | 12.0 | 1752 | 0.7731 | 0.4643 |
| 0.6968 | 14.0 | 2044 | 0.7458 | 0.4394 |
| 0.6813 | 16.0 | 2336 | 0.7549 | 0.4385 |
| 0.5996 | 18.0 | 2628 | 0.7186 | 0.4128 |
| 0.572 | 20.0 | 2920 | 0.7102 | 0.4126 |
Base model
facebook/wav2vec2-base
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="Thienpkae/working")