Dev372/Medical_STT_Dataset_1.1
Viewer • Updated • 7.96k • 66 • 2
How to use Dev372/Medical_tiny_en_1_1v with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("automatic-speech-recognition", model="Dev372/Medical_tiny_en_1_1v") # Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("Dev372/Medical_tiny_en_1_1v")
model = AutoModelForSpeechSeq2Seq.from_pretrained("Dev372/Medical_tiny_en_1_1v")# Load model directly
from transformers import AutoProcessor, AutoModelForSpeechSeq2Seq
processor = AutoProcessor.from_pretrained("Dev372/Medical_tiny_en_1_1v")
model = AutoModelForSpeechSeq2Seq.from_pretrained("Dev372/Medical_tiny_en_1_1v")This model is a fine-tuned version of openai/whisper-tiny.en on the Medical dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer |
|---|---|---|---|---|
| 1.8857 | 0.1554 | 55 | 1.6694 | 13.1520 |
| 1.3264 | 0.3107 | 110 | 1.0577 | 11.8358 |
| 0.9159 | 0.4661 | 165 | 0.8809 | 10.3857 |
| 0.8292 | 0.6215 | 220 | 0.7654 | 9.8893 |
| 0.641 | 0.7768 | 275 | 0.6364 | 9.2557 |
| 0.5445 | 0.9322 | 330 | 0.4931 | 8.6417 |
| 0.4072 | 1.0876 | 385 | 0.3397 | 8.2759 |
| 0.2378 | 1.2429 | 440 | 0.2414 | 8.1322 |
| 0.2109 | 1.3983 | 495 | 0.2116 | 7.6684 |
| 0.1641 | 1.5537 | 550 | 0.1940 | 7.6423 |
| 0.1498 | 1.7090 | 605 | 0.1819 | 7.1198 |
| 0.1445 | 1.8644 | 660 | 0.1752 | 6.8095 |
| 0.1349 | 2.0198 | 715 | 0.1679 | 6.7181 |
| 0.1032 | 2.1751 | 770 | 0.1661 | 6.7344 |
| 0.0898 | 2.3305 | 825 | 0.1632 | 6.8291 |
| 0.1032 | 2.4859 | 880 | 0.1606 | 6.7278 |
| 0.0845 | 2.6412 | 935 | 0.1592 | 6.7083 |
| 0.0958 | 2.7966 | 990 | 0.1578 | 6.5743 |
| 0.097 | 2.9520 | 1045 | 0.1570 | 6.5515 |
| 0.0689 | 3.1073 | 1100 | 0.1566 | 6.5482 |
Base model
openai/whisper-tiny.en
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("automatic-speech-recognition", model="Dev372/Medical_tiny_en_1_1v")