| --- |
| license: open-mdw |
| language: |
| - lb |
| base_model: |
| - openai/whisper-medium |
| pipeline_tag: automatic-speech-recognition |
| --- |
| |
| # unilux/whisper-medium-v1-luxembourgish |
|
|
| ## Model Card |
|
|
| ### 🧠 Model Details |
| - **Model name:** whisper-medium-v1-luxembourgish |
| - **Organization:** University of Luxembourg — Department of Humanities |
| - **Project:** [Luxembourgish Automatic Speech Recognition (LuxASR)](https://luxasr.uni.lu/) |
| - **Type:** Speech-to-Text (ASR) |
| - **Language:** Luxembourgish (`lb`) |
| - **Architecture:** Whisper (Medium) |
| - **Model size:** ~764M parameters |
| - **License:** [Open Model, Data & Weights (open-mdw)](https://www.openmdw.org) |
|
|
| This model is part of the **LuxASR** open model family for Luxembourgish speech recognition. Fine-tuned on Luxembourgish audio–text pairs (≈150+ hours). |
|
|
| The *tiny*, *base*, *small*, and *medium* models are open-sourced; the larger flagship LuxASR model, used in the webservice, the API and the iOS and Android apps, remains closed-source. |
|
|
| --- |
|
|
| ### 🚀 Intended Use |
| - Transcribe Luxembourgish speech into text. |
| - Research and development of Luxembourgish ASR. |
| - Accessibility and media transcription. |
|
|
| --- |
|
|
| ### ⚙️ Usage Example |
|
|
| ```python |
| from transformers import pipeline |
| |
| pipe = pipeline("automatic-speech-recognition", model="unilux/whisper-medium-v1-luxembourgish") |
| result = pipe("example.wav") |
| print(result["text"]) |
| ``` |
| --- |
|
|
| ### 🧡 Acknowledgements |
| Developed by the **LuxASR** team, University of Luxembourg. |
| See [luxasr.uni.lu](https://luxasr.uni.lu/) for project details. |
|
|