CrazyMonkey0 commited on
Commit
9ef2068
Β·
1 Parent(s): 4d18a16

docs(readme): add Faster Whisper usage and base model attribution

Browse files
Files changed (1) hide show
  1. README.md +17 -13
README.md CHANGED
@@ -10,7 +10,7 @@ app_port: 7860
10
  short_description: "English learning API"
11
  models:
12
  - Qwen/Qwen2.5-1.5B-Instruct
13
- - openai/whisper-small.en
14
  - facebook/mms-tts-eng
15
  - allegro/BiDi-eng-pol
16
  tags:
@@ -43,9 +43,12 @@ This project uses several open-source AI models from Hugging Face.
43
  Each model retains its original license as listed below:
44
 
45
  ### πŸ”Š Speech Recognition
46
- - [**Whisper Small (English)**](https://huggingface.co/openai/whisper-small.en)
47
- Licensed under the [Apache License 2.0](http://www.apache.org/licenses/LICENSE-2.0).
48
- Developed by [**OpenAI**](https://openai.com/)
 
 
 
49
 
50
  ### πŸ—£οΈ Text-to-Speech (TTS)
51
  - [**facebook/mms-tts-eng**](https://huggingface.co/facebook/mms-tts-eng)
@@ -74,15 +77,14 @@ The source code of this application is distributed separately under the license
74
 
75
  ## πŸ“š References
76
 
77
- ### 1. Whisper Small (English) β€” OpenAI
78
- @misc{radford2022whisper,
79
- doi = {10.48550/ARXIV.2212.04356},
80
- url = {https://arxiv.org/abs/2212.04356},
81
- author = {Radford, Alec and Kim, Jong Wook and Xu, Tao and Brockman, Greg and McLeavey, Christine and Sutskever, Ilya},
82
- title = {Robust Speech Recognition via Large-Scale Weak Supervision},
83
- publisher = {arXiv},
84
- year = {2022},
85
- copyright = {arXiv.org perpetual, non-exclusive license}
86
  }
87
 
88
  ### 2. facebook/mms-tts-eng -- AI at Meta
@@ -123,6 +125,8 @@ This project would not be possible without the amazing work of the open-source c
123
  Special thanks to the teams and organizations that created and maintain the following models and tools:
124
 
125
  - **[OpenAI](https://openai.com/)** for [**Whisper Small (English)**](https://huggingface.co/openai/whisper-small.en) β€” Licensed under [Apache License 2.0](http://www.apache.org/licenses/LICENSE-2.0).
 
 
126
  - **[Facebook AI Research (FAIR)](https://ai.facebook.com/)** for [**facebook/mms-tts-eng**](https://huggingface.co/facebook/mms-tts-eng) β€” Licensed under [Creative Commons Attribution Non Commercial 4.0 (CC BY-NC 4.0)](https://creativecommons.org/licenses/by-nc/4.0/).
127
  - **[Qwen Team](https://qwen.ai/)** for [**Qwen/Qwen2.5-1.5B-Instruct**](https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct) β€” Licensed under [Apache License 2.0](http://www.apache.org/licenses/LICENSE-2.0).
128
  - **[Allegro ML Research](https://ml.allegro.tech/)** for [**BiDi-eng-pol**](https://huggingface.co/allegro/BiDi-eng-pol) β€” Licensed under [Creative Commons Attribution 4.0 International (CC BY 4.0)](https://creativecommons.org/licenses/by/4.0/).
 
10
  short_description: "English learning API"
11
  models:
12
  - Qwen/Qwen2.5-1.5B-Instruct
13
+ - faster_whisper/faster-whisper-small-en
14
  - facebook/mms-tts-eng
15
  - allegro/BiDi-eng-pol
16
  tags:
 
43
  Each model retains its original license as listed below:
44
 
45
  ### πŸ”Š Speech Recognition
46
+ - **Faster Whisper Small (English)**
47
+ Efficient CPU/GPU implementation of OpenAI Whisper Small.
48
+ Base model: **openai/whisper-small.en**
49
+ Licensed under [mit](https://choosealicense.com/licenses/mit/).
50
+ Implementation by [**Faster Whisper**](https://huggingface.co/Systran/faster-whisper-small.en)
51
+
52
 
53
  ### πŸ—£οΈ Text-to-Speech (TTS)
54
  - [**facebook/mms-tts-eng**](https://huggingface.co/facebook/mms-tts-eng)
 
77
 
78
  ## πŸ“š References
79
 
80
+ ### 1. Faster Whisper Small (English) β€” Systran
81
+ @misc{faster_whisper_small,
82
+ title = {Faster Whisper Small English},
83
+ author = {Systran AI},
84
+ note = {CTranslate2-converted version of [**OpenAI Whisper Small**](https://huggingface.co/openai/whisper-small.en) for fast inference with faster_whisper},
85
+ url = {https://huggingface.co/Systran/faster-whisper-small.en},
86
+ year = {2024},
87
+ license = {MIT (conversion), original model Apache 2.0}
 
88
  }
89
 
90
  ### 2. facebook/mms-tts-eng -- AI at Meta
 
125
  Special thanks to the teams and organizations that created and maintain the following models and tools:
126
 
127
  - **[OpenAI](https://openai.com/)** for [**Whisper Small (English)**](https://huggingface.co/openai/whisper-small.en) β€” Licensed under [Apache License 2.0](http://www.apache.org/licenses/LICENSE-2.0).
128
+ - **[Systran / Faster Whisper](https://huggingface.co/Systran/faster-whisper-small.en)** for [**Faster Whisper Small (English)**](https://huggingface.co/openai/whisper-small.en) β€” a CTranslate2-converted version of **OpenAI Whisper Small**, optimized for fast CPU/GPU inference.
129
+ Licensed under [MIT](https://choosealicense.com/licenses/mit/) and Apache 2.0 (original model)
130
  - **[Facebook AI Research (FAIR)](https://ai.facebook.com/)** for [**facebook/mms-tts-eng**](https://huggingface.co/facebook/mms-tts-eng) β€” Licensed under [Creative Commons Attribution Non Commercial 4.0 (CC BY-NC 4.0)](https://creativecommons.org/licenses/by-nc/4.0/).
131
  - **[Qwen Team](https://qwen.ai/)** for [**Qwen/Qwen2.5-1.5B-Instruct**](https://huggingface.co/Qwen/Qwen2.5-1.5B-Instruct) β€” Licensed under [Apache License 2.0](http://www.apache.org/licenses/LICENSE-2.0).
132
  - **[Allegro ML Research](https://ml.allegro.tech/)** for [**BiDi-eng-pol**](https://huggingface.co/allegro/BiDi-eng-pol) β€” Licensed under [Creative Commons Attribution 4.0 International (CC BY 4.0)](https://creativecommons.org/licenses/by/4.0/).