Spaces:

Rahul2020
/

EAG_s9_sample

Sleeping

EAG_s9_sample / README.md

add eng only

1095508 4 months ago

1.25 kB

	---
	license: apache-2.0
	sdk: gradio
	emoji: 🚀
	colorFrom: red
	---
	---
	license: apache-2.0
	sdk: gradio
	emoji: 🚀
	colorFrom: red
	---
	# Whisper-Small Speech-to-English (Gradio)

	Drop these files into a Hugging Face Space (Gradio template):
	- `app.py`
	- `requirements.txt`

	This app uses `openai/whisper-small` in translate mode to convert spoken audio into English text (Whisper's `translate` task). The model runs CPU-only by default and is suitable for small/medium audio files.

	## Usage
	- Click the microphone recorder to record or upload an audio file.
	- Click Transcribe to get English text output (the app translates input speech into English).

	## Debug
	Set `DEBUG = True` in `app.py` to enable logging and save resampled WAVs (written to your system temp directory) for inspection.

	## Run locally
	```powershell
	# Windows PowerShell
	python -m venv venv_hf
	venv_hf\Scripts\Activate.ps1
	pip install -r requirements.txt
	python app.py
	```

	Open the Gradio URL shown in the console (usually http://0.0.0.0:7860).

	## Notes
	- The `openai/whisper-small` model runs on CPU and may take time for longer files.
	- For other target languages or lower latency consider using the Hugging Face Inference API or a separate text translation pipeline.