Update README.md
Browse files
README.md
CHANGED
|
@@ -102,43 +102,12 @@ language:
|
|
| 102 |
tags:
|
| 103 |
- audio
|
| 104 |
- automatic-speech-recognition
|
|
|
|
| 105 |
license: mit
|
| 106 |
library_name: ctranslate2
|
|
|
|
| 107 |
---
|
| 108 |
|
| 109 |
-
# Whisper base model for CTranslate2
|
| 110 |
-
|
| 111 |
-
This repository contains the conversion of [openai/whisper-base](https://huggingface.co/openai/whisper-base) to the [CTranslate2](https://github.com/OpenNMT/CTranslate2) model format.
|
| 112 |
-
|
| 113 |
-
This model can be used in CTranslate2 or projects based on CTranslate2 such as [faster-whisper](https://github.com/guillaumekln/faster-whisper).
|
| 114 |
-
|
| 115 |
-
## Example
|
| 116 |
-
|
| 117 |
-
```python
|
| 118 |
-
from faster_whisper import WhisperModel
|
| 119 |
-
|
| 120 |
-
model = WhisperModel("base")
|
| 121 |
-
|
| 122 |
-
segments, info = model.transcribe("audio.mp3")
|
| 123 |
-
for segment in segments:
|
| 124 |
-
print("[%.2fs -> %.2fs] %s" % (segment.start, segment.end, segment.text))
|
| 125 |
-
```
|
| 126 |
-
|
| 127 |
-
## Conversion details
|
| 128 |
-
|
| 129 |
-
The original model was converted with the following command:
|
| 130 |
-
|
| 131 |
-
```
|
| 132 |
-
ct2-transformers-converter --model openai/whisper-base --output_dir faster-whisper-base \
|
| 133 |
-
--copy_files tokenizer.json --quantization float16
|
| 134 |
-
```
|
| 135 |
-
|
| 136 |
-
Note that the model weights are saved in FP16. This type can be changed when the model is loaded using the [`compute_type` option in CTranslate2](https://opennmt.net/CTranslate2/quantization.html).
|
| 137 |
-
|
| 138 |
-
## More information
|
| 139 |
-
|
| 140 |
-
**For more information about the original model, see its [model card](https://huggingface.co/openai/whisper-base).**
|
| 141 |
-
|
| 142 |
A copy of `rhasspy/faster-whisper-base-int8` , with metadata added for `speaches` to recognize.
|
| 143 |
|
| 144 |
---
|
|
|
|
| 102 |
tags:
|
| 103 |
- audio
|
| 104 |
- automatic-speech-recognition
|
| 105 |
+
- ctranslate2
|
| 106 |
license: mit
|
| 107 |
library_name: ctranslate2
|
| 108 |
+
pipeline_tag: automatic-speech-recognition
|
| 109 |
---
|
| 110 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 111 |
A copy of `rhasspy/faster-whisper-base-int8` , with metadata added for `speaches` to recognize.
|
| 112 |
|
| 113 |
---
|