SPEAK-ASR/whisper-si-exp-1
Updated
•
94
Note Used dataset: 1/3 OpenSLR Dataset (~60h) Lora Projection Layers to: Query (Q) and Value (V) layers
Note Use `pranay-j/whisper-small-hindi` as base model
Note Used dataset: Full OpenSLR Dataset (~180h)
Note Used dataset: OpenSLR + YouTube Dataset (~185h)
Note Used WER to choose the best model instead of loss Used dataset: OpenSLR + YouTube Dataset (~185h) Changed Lora Projection Layers to: Q, V, Key (K), and Output (O) layers
Note Revert Lora Projection Layers to: Q, V layers Change num_epoch from 5 to 15