amitkot
/

whisper-tiny-he

Automatic Speech Recognition

Model card Files Files and versions

whisper-tiny-he / README.md

amitkot's picture

Upload README.md with huggingface_hub

ff42d72 verified about 1 month ago

|

history blame contribute delete

1.52 kB

	---
	language: he
	license: apache-2.0
	library_name: transformers
	tags:
	- whisper
	- audio
	- automatic-speech-recognition
	- hebrew
	datasets:
	- ivrit-ai/whisper-training
	base_model: openai/whisper-tiny
	pipeline_tag: automatic-speech-recognition
	---

	# whisper-tiny-he

	Hebrew fine-tuned [Whisper Tiny](https://huggingface.co/openai/whisper-tiny) for automatic speech recognition.

	## Training

	- Base model: [openai/whisper-tiny](https://huggingface.co/openai/whisper-tiny)
	- Dataset: [ivrit-ai/whisper-training](https://huggingface.co/datasets/ivrit-ai/whisper-training) (~400h Hebrew)
	- Method: Supervised fine-tuning with `Seq2SeqTrainer`
	- Steps: 5,000 (streaming, effective batch size 16)
	- Hardware: Apple M4 (MPS), fp32
	- Final eval WER: 0.659 (on 200-sample test split)

	## Usage

	```python
	from transformers import WhisperProcessor, WhisperForConditionalGeneration

	processor = WhisperProcessor.from_pretrained("amitkot/whisper-tiny-he")
	model = WhisperForConditionalGeneration.from_pretrained("amitkot/whisper-tiny-he")

	model.generation_config.language = "he"
	model.generation_config.task = "transcribe"
	```

	## Training pipeline

	Trained using [whisper-acft-pipeline](https://github.com/amitkot/whisper-acft-pipeline):

	```bash
	uv run python scripts/finetune.py --config configs/hebrew_tiny_finetune.yaml
	```

	## See also

	- [amitkot/whisper-tiny-he-acft](https://huggingface.co/amitkot/whisper-tiny-he-acft) — ACFT-optimized version of this model for short audio (FUTO Keyboard)