Update README.md
Browse files
README.md
CHANGED
|
@@ -1,5 +1,12 @@
|
|
| 1 |
---
|
| 2 |
-
license:
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 3 |
---
|
| 4 |
|
| 5 |
# Stremma ELM(extendable language model)
|
|
@@ -8,11 +15,10 @@ The initial model that was used for https://stremma.ai service, serves as a base
|
|
| 8 |
|
| 9 |
## Detais
|
| 10 |
|
| 11 |
-
|
| 12 |
|
| 13 |
The models were trained on either English or Danish data. The English and Danish models were trained on the task of speech recognition. The multilingual extensions were trained on both speech recognition and speech translation.
|
| 14 |
|
| 15 |
## Usage
|
| 16 |
|
| 17 |
-
TBD
|
| 18 |
-
|
|
|
|
| 1 |
---
|
| 2 |
+
license: mit
|
| 3 |
+
language:
|
| 4 |
+
- en
|
| 5 |
+
- de
|
| 6 |
+
- da
|
| 7 |
+
- 'no'
|
| 8 |
+
- sv
|
| 9 |
+
pipeline_tag: automatic-speech-recognition
|
| 10 |
---
|
| 11 |
|
| 12 |
# Stremma ELM(extendable language model)
|
|
|
|
| 15 |
|
| 16 |
## Detais
|
| 17 |
|
| 18 |
+
Stremma ELM is a transformer-based encoder-decoder model, also referred to as a sequence-to-sequence model. It was trained on 0.1M hours of labeled audio and 0.4M hours of pseudolabeled audio collected via Stremma SaaS during period of work in manual transcribing.
|
| 19 |
|
| 20 |
The models were trained on either English or Danish data. The English and Danish models were trained on the task of speech recognition. The multilingual extensions were trained on both speech recognition and speech translation.
|
| 21 |
|
| 22 |
## Usage
|
| 23 |
|
| 24 |
+
TBD
|
|
|