|
|
--- |
|
|
library_name: transformers |
|
|
license: apache-2.0 |
|
|
datasets: |
|
|
- mozilla-foundation/common_voice_17_0 |
|
|
language: |
|
|
- bn |
|
|
metrics: |
|
|
- wer |
|
|
base_model: |
|
|
- banglabridge/base-bn-lora-adapter |
|
|
model-index: |
|
|
- name: Whisper Base Bn - BanglaBridge |
|
|
results: |
|
|
- task: |
|
|
name: Automatic Speech Recognition |
|
|
type: automatic-speech-recognition |
|
|
dataset: |
|
|
name: Common Voice 17.0 |
|
|
type: mozilla-foundation/common_voice_17_0 |
|
|
config: bn |
|
|
split: None |
|
|
args: 'config: bn, split: test' |
|
|
metrics: |
|
|
- name: Wer |
|
|
type: wer |
|
|
value: 22.56397 |
|
|
--- |
|
|
|
|
|
# Whisper Base Bn - by BanglaBridge |
|
|
|
|
|
This model is a fine-tuned version of [openai/whisper-base](https://huggingface.co/openai/whisper-base) on the Common Voice 17.0 dataset. |
|
|
|
|
|
It is the merged model from this fine-tuned PEFT LoRA adapter: [banglabridge/base-bn-lora-adapter](https://huggingface.co/banglabridge/base-bn-lora-adapter) |
|
|
|
|
|
It achieves the following results on the test set: |
|
|
- Wer: 44.93734 |
|
|
- Normalized Wer: 22.56397 |
|
|
|
|
|
Refer to the adapter repository for more details on the finetuning: [banglabridge/base-bn-lora-adapter](https://huggingface.co/banglabridge/base-bn-lora-adapter) |
|
|
|
|
|
|
|
|
### Framework versions |
|
|
|
|
|
- Transformers 4.40.2 |
|
|
- Pytorch 2.6.0+cu124 |
|
|
- Tokenizers 0.19.1 |
|
|
- Peft 0.10.0 |