sha1779
/

Faster_BengaliRegionalASR

Automatic Speech Recognition

Model card Files Files and versions

Faster_BengaliRegionalASR / README.md

sha1779's picture

Create README.md

1aa6801 verified about 1 year ago

|

history blame contribute delete

943 Bytes

	---
	license: apache-2.0
	language:
	- bn
	base_model:
	- sha1779/BengaliRegionalASR
	pipeline_tag: automatic-speech-recognition
	---

	This is the CTranslate2 version which is faster than base version.

	## requirements
	```bash
	pip install ctranslate2
	```
	## Base model to Ctranslate format conversion
	```bash
	!ct2-transformers-converter --model sha1779/BengaliRegionalASR --output_dir sha1779/Faster_BengaliRegionalASR --copy_files tokenizer.json preprocessor_config.json --quantization float16
	```

	## Run the model
	```bash
	pip install faster-whisper

	```

	```python
	from faster_whisper import WhisperModel

	model_size = "sha1779/Faster_BengaliRegionalASR"

	model = WhisperModel(model_size, device="cuda", compute_type="float16")
	segments, info = model.transcribe("audio.mp3", beam_size=5, language="en", condition_on_previous_text=False)

	for segment in segments:
	print("[%.2fs -> %.2fs] %s" % (segment.start, segment.end, segment.text))

	```