compulsi0n/heart-failure-audio
Viewer • Updated • 1.77k • 31
How to use compulsi0n/whisper-hf-rslora with PEFT:
Task type is invalid.
This model is a fine-tuned version of openai/whisper-large-v3-turbo on compulsion/heart-failure-audio. It achieves the following results on the evaluation set:
A PEFT rank-stablized rank-stabilized LoRA adapter of whisper-large-v3-turbo finetuned on heart failure audio data that is conversational, longitudinal, and focused on chronic illness management and care coordination in a community-based healthcare setting.
To be used in ASR tasks specifically in the heart failure domain.
Normalized for PHI redactions and throught Transformer's BasicTextNormalizer.
| Model | Raw WER (%) | Normalised WER (%) |
|---|---|---|
| Baseline | 35.00 | 26.71 |
| rsLoRA | 26.18 | 20.71 |
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Wer |
|---|---|---|---|---|
| 2.3062 | 1.0 | 92 | 1.1343 | 0.2388 |
| 1.0317 | 2.0 | 184 | 0.7145 | 0.2620 |
| 0.6833 | 3.0 | 276 | 0.6606 | 0.2105 |
| 0.5934 | 4.0 | 368 | 0.6292 | 0.2122 |
| 0.5104 | 5.0 | 460 | 0.6347 | 0.2521 |
| 0.4392 | 6.0 | 552 | 0.6444 | 0.2729 |
| 0.3653 | 7.0 | 644 | 0.6701 | 0.2198 |
| 0.3178 | 8.0 | 736 | 0.6919 | 0.2424 |