Update README.md
#107
by Sci-fi-vy - opened
README.md
CHANGED
|
@@ -1,5 +1,5 @@
|
|
| 1 |
---
|
| 2 |
-
language:
|
| 3 |
- en
|
| 4 |
- zh
|
| 5 |
- de
|
|
@@ -29,7 +29,7 @@ language:
|
|
| 29 |
- da
|
| 30 |
- hu
|
| 31 |
- ta
|
| 32 |
-
- no
|
| 33 |
- th
|
| 34 |
- ur
|
| 35 |
- hr
|
|
@@ -99,10 +99,14 @@ language:
|
|
| 99 |
- ba
|
| 100 |
- jw
|
| 101 |
- su
|
|
|
|
| 102 |
tags:
|
| 103 |
- audio
|
| 104 |
- automatic-speech-recognition
|
| 105 |
- hf-asr-leaderboard
|
|
|
|
|
|
|
|
|
|
| 106 |
widget:
|
| 107 |
- example_title: Librispeech sample 1
|
| 108 |
src: https://cdn-media.huggingface.co/speech_samples/sample1.flac
|
|
@@ -188,6 +192,7 @@ Which forces the model to predict in English under the task of speech recognitio
|
|
| 188 |
## Transcription
|
| 189 |
|
| 190 |
### English to English
|
|
|
|
| 191 |
In this example, the context tokens are 'unforced', meaning the model automatically predicts the output language
|
| 192 |
(English) and task (transcribe).
|
| 193 |
|
|
@@ -217,7 +222,8 @@ In this example, the context tokens are 'unforced', meaning the model automatica
|
|
| 217 |
The context tokens can be removed from the start of the transcription by setting `skip_special_tokens=True`.
|
| 218 |
|
| 219 |
### French to French
|
| 220 |
-
|
|
|
|
| 221 |
|
| 222 |
```python
|
| 223 |
>>> from transformers import WhisperProcessor, WhisperForConditionalGeneration
|
|
@@ -245,6 +251,7 @@ The following example demonstrates French to French transcription by setting the
|
|
| 245 |
```
|
| 246 |
|
| 247 |
## Translation
|
|
|
|
| 248 |
Setting the task to "translate" forces the Whisper model to perform speech translation.
|
| 249 |
|
| 250 |
### French to English
|
|
@@ -392,4 +399,4 @@ There are also potential dual use concerns that come with releasing Whisper. Whi
|
|
| 392 |
year = {2022},
|
| 393 |
copyright = {arXiv.org perpetual, non-exclusive license}
|
| 394 |
}
|
| 395 |
-
```
|
|
|
|
| 1 |
---
|
| 2 |
+
language:
|
| 3 |
- en
|
| 4 |
- zh
|
| 5 |
- de
|
|
|
|
| 29 |
- da
|
| 30 |
- hu
|
| 31 |
- ta
|
| 32 |
+
- 'no'
|
| 33 |
- th
|
| 34 |
- ur
|
| 35 |
- hr
|
|
|
|
| 99 |
- ba
|
| 100 |
- jw
|
| 101 |
- su
|
| 102 |
+
|
| 103 |
tags:
|
| 104 |
- audio
|
| 105 |
- automatic-speech-recognition
|
| 106 |
- hf-asr-leaderboard
|
| 107 |
+
- code
|
| 108 |
+
- legal
|
| 109 |
+
|
| 110 |
widget:
|
| 111 |
- example_title: Librispeech sample 1
|
| 112 |
src: https://cdn-media.huggingface.co/speech_samples/sample1.flac
|
|
|
|
| 192 |
## Transcription
|
| 193 |
|
| 194 |
### English to English
|
| 195 |
+
|
| 196 |
In this example, the context tokens are 'unforced', meaning the model automatically predicts the output language
|
| 197 |
(English) and task (transcribe).
|
| 198 |
|
|
|
|
| 222 |
The context tokens can be removed from the start of the transcription by setting `skip_special_tokens=True`.
|
| 223 |
|
| 224 |
### French to French
|
| 225 |
+
|
| 226 |
+
The following example demonstrates French to French transcription by setting the appropriate decoder ids.
|
| 227 |
|
| 228 |
```python
|
| 229 |
>>> from transformers import WhisperProcessor, WhisperForConditionalGeneration
|
|
|
|
| 251 |
```
|
| 252 |
|
| 253 |
## Translation
|
| 254 |
+
|
| 255 |
Setting the task to "translate" forces the Whisper model to perform speech translation.
|
| 256 |
|
| 257 |
### French to English
|
|
|
|
| 399 |
year = {2022},
|
| 400 |
copyright = {arXiv.org perpetual, non-exclusive license}
|
| 401 |
}
|
| 402 |
+
```
|