web30india
/

LLM-GUJARATI

Automatic Speech Recognition

Model card Files Files and versions

mananvh commited on Mar 28, 2024

Commit

dca507e

·

verified ·

1 Parent(s): 4bd6ad6

Update README.md

Files changed (1) hide show

README.md +19 -0

README.md CHANGED Viewed

@@ -29,6 +29,25 @@ model-index:
       value: 12.33
       name: WER
 ## Training and evaluation data
 Training Data:

       value: 12.33
       name: WER
+## Usage
+  In order to infer a single audio file using this model, the following code snippet can be used:
+```python
+>>> import torch
+>>> from transformers import pipeline
+>>> # path to the audio file to be transcribed
+>>> audio = "/path/to/audio.format"
+>>> device = "cuda:0" if torch.cuda.is_available() else "cpu"
+>>> transcribe = pipeline(task="automatic-speech-recognition", model="mananvh/LLM_GUJARATI", chunk_length_s=30, device=device)
+>>> transcribe.model.config.forced_decoder_ids = transcribe.tokenizer.get_decoder_prompt_ids(language="gu", task="transcribe")
+>>> print('Transcription: ', transcribe(audio)["text"])
 ## Training and evaluation data
 Training Data: