northtech commited on
Commit
9c4a9a0
·
verified ·
1 Parent(s): f706f91

Add model card

Browse files
Files changed (1) hide show
  1. README.md +74 -0
README.md ADDED
@@ -0,0 +1,74 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: tr
3
+ license: mit
4
+ base_model: openai/whisper-large-v3
5
+ tags:
6
+ - automatic-speech-recognition
7
+ - whisper
8
+ - finetuned
9
+ - lora
10
+ - turkish
11
+ ---
12
+
13
+ # northtechai/northstt-stageb-300
14
+
15
+ This is a fine-tuned version of openai/whisper-large-v3 using LoRA (Low-Rank Adaptation) for Turkish speech recognition.
16
+
17
+ ## Model Details
18
+
19
+ - **Base Model**: openai/whisper-large-v3
20
+ - **Fine-tuning Method**: LoRA (Low-Rank Adaptation)
21
+ - **Language**: Turkish
22
+ - **Task**: Automatic Speech Recognition
23
+
24
+ ## Usage
25
+
26
+ ```python
27
+ from transformers import pipeline
28
+
29
+ # Load the model
30
+ asr = pipeline(
31
+ "automatic-speech-recognition",
32
+ model="northtechai/northstt-stageb-300",
33
+ generate_kwargs={
34
+ "condition_on_prev_tokens": True,
35
+ "no_repeat_ngram_size": 2,
36
+ "temperature": 0.0,
37
+ "do_sample": False,
38
+ "language": "tr",
39
+ }
40
+ )
41
+
42
+ # Transcribe audio
43
+ result = asr("path/to/audio.wav")
44
+ print(result["text"])
45
+ ```
46
+
47
+ ## Training Data
48
+
49
+ Training data information not provided.
50
+
51
+ ## Performance
52
+
53
+ This model has been fine-tuned for improved performance on Turkish speech recognition tasks.
54
+
55
+ ## Limitations
56
+
57
+ - Optimized for Turkish language
58
+ - Performance may vary on other languages
59
+ - Requires appropriate audio preprocessing (16kHz sample rate)
60
+
61
+ ## Citation
62
+
63
+ If you use this model, please cite the original Whisper paper:
64
+
65
+ ```bibtex
66
+ @misc{radford2022robust,
67
+ title={Robust Speech Recognition via Large-Scale Weak Supervision},
68
+ author={Radford, Alec and Kim, Jong Wook and Xu, Tao and Brockman, Greg and McLeavey, Christine and Sutskever, Ilya},
69
+ year={2022},
70
+ eprint={2212.04356},
71
+ archivePrefix={arXiv},
72
+ primaryClass={cs.CL}
73
+ }
74
+ ```