Duplicate from unilux/whisper-medium-v1-luxembourgish

545cc1b about 2 months ago

1.55 kB

	---
	license: open-mdw
	language:
	- lb
	base_model:
	- openai/whisper-medium
	pipeline_tag: automatic-speech-recognition
	---

	# unilux/whisper-medium-v1-luxembourgish

	## Model Card

	### 🧠 Model Details
	- Model name: whisper-medium-v1-luxembourgish
	- Organization: University of Luxembourg — Department of Humanities
	- Project: [Luxembourgish Automatic Speech Recognition (LuxASR)](https://luxasr.uni.lu/)
	- Type: Speech-to-Text (ASR)
	- Language: Luxembourgish (`lb`)
	- Architecture: Whisper (Medium)
	- Model size: ~764M parameters
	- License: [Open Model, Data & Weights (open-mdw)](https://www.openmdw.org)

	This model is part of the LuxASR open model family for Luxembourgish speech recognition. Fine-tuned on Luxembourgish audio–text pairs (≈150+ hours).

	The tiny, base, small, and medium models are open-sourced; the larger flagship LuxASR model, used in the webservice, the API and the iOS and Android apps, remains closed-source.

	---

	### 🚀 Intended Use
	- Transcribe Luxembourgish speech into text.
	- Research and development of Luxembourgish ASR.
	- Accessibility and media transcription.

	---

	### ⚙️ Usage Example

	```python
	from transformers import pipeline

	pipe = pipeline("automatic-speech-recognition", model="unilux/whisper-medium-v1-luxembourgish")
	result = pipe("example.wav")
	print(result["text"])
	```
	---

	### 🧡 Acknowledgements
	Developed by the LuxASR team, University of Luxembourg.
	See [luxasr.uni.lu](https://luxasr.uni.lu/) for project details.