File size: 1,552 Bytes
545cc1b
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
---
license: open-mdw
language:
- lb
base_model:
- openai/whisper-medium
pipeline_tag: automatic-speech-recognition
---

# unilux/whisper-medium-v1-luxembourgish

## Model Card

### 🧠 Model Details
- **Model name:** whisper-medium-v1-luxembourgish  
- **Organization:** University of Luxembourg — Department of Humanities  
- **Project:** [Luxembourgish Automatic Speech Recognition (LuxASR)](https://luxasr.uni.lu/)  
- **Type:** Speech-to-Text (ASR)  
- **Language:** Luxembourgish (`lb`)  
- **Architecture:** Whisper (Medium)  
- **Model size:** ~764M parameters  
- **License:** [Open Model, Data & Weights (open-mdw)](https://www.openmdw.org)

This model is part of the **LuxASR** open model family for Luxembourgish speech recognition. Fine-tuned on Luxembourgish audio–text pairs (≈150+ hours).

The *tiny*, *base*, *small*, and *medium* models are open-sourced; the larger flagship LuxASR model, used in the webservice, the API and the iOS and Android apps, remains closed-source.

---

### 🚀 Intended Use
- Transcribe Luxembourgish speech into text.  
- Research and development of Luxembourgish ASR.  
- Accessibility and media transcription.

---

### ⚙️ Usage Example

```python
from transformers import pipeline

pipe = pipeline("automatic-speech-recognition", model="unilux/whisper-medium-v1-luxembourgish")
result = pipe("example.wav")
print(result["text"])
```
---

### 🧡 Acknowledgements
Developed by the **LuxASR** team, University of Luxembourg.  
See [luxasr.uni.lu](https://luxasr.uni.lu/) for project details.