File size: 1,264 Bytes
6abcb58
 
fdda40f
 
 
 
 
 
 
 
bb462c9
fdda40f
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
6abcb58
 
fdda40f
 
 
e470eda
 
 
fdda40f
 
 
 
e470eda
fdda40f
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
---
library_name: transformers
license: apache-2.0
datasets:
- mozilla-foundation/common_voice_17_0
language:
- bn
metrics:
- wer
base_model:
- banglabridge/base-bn-lora-adapter
model-index:
- name: Whisper Base Bn - BanglaBridge
  results:
  - task:
      name: Automatic Speech Recognition
      type: automatic-speech-recognition
    dataset:
      name: Common Voice 17.0
      type: mozilla-foundation/common_voice_17_0
      config: bn
      split: None
      args: 'config: bn, split: test'
    metrics:
    - name: Wer
      type: wer
      value: 22.56397
---

# Whisper Base Bn - by BanglaBridge

This model is a fine-tuned version of [openai/whisper-base](https://huggingface.co/openai/whisper-base) on the Common Voice 17.0 dataset.

It is the merged model from this fine-tuned PEFT LoRA adapter: [banglabridge/base-bn-lora-adapter](https://huggingface.co/banglabridge/base-bn-lora-adapter)

It achieves the following results on the test set:
- Wer: 44.93734
- Normalized Wer: 22.56397

Refer to the adapter repository for more details on the finetuning: [banglabridge/base-bn-lora-adapter](https://huggingface.co/banglabridge/base-bn-lora-adapter)


### Framework versions

- Transformers 4.40.2
- Pytorch 2.6.0+cu124
- Tokenizers 0.19.1
- Peft 0.10.0