RapidAI
/

RapidSpeech

Model card Files Files and versions

lovemefan commited on Feb 8

Commit

3ce5a26

·

1 Parent(s): c2e2a7e

update readme

Files changed (1) hide show

README.md +25 -50

README.md CHANGED Viewed

@@ -1,52 +1,27 @@
 ---
-frameworks:
-- other
-license: Apache License 2.0
-tags: []
-tasks:
-- auto-speech-recognition
-#model-type:
-##如 gpt、phi、llama、chatglm、baichuan 等
-#- gpt
-#domain:
-##如 nlp、cv、audio、multi-modal
-#- nlp
-#language:
-##语言代码列表 https://help.aliyun.com/document_detail/215387.html?spm=a2c4g.11186623.0.0.9f8d7467kni6Aa
-#- cn
-#metrics:
-##如 CIDEr、Blue、ROUGE 等
-#- CIDEr
-#tags:
-##各种自定义，包括 pretrained、fine-tuned、instruction-tuned、RL-tuned 等训练方法和其他
-#- pretrained
-#tools:
-##如 vllm、fastchat、llamacpp、AdaSeq 等
-#- vllm
 ---
-### 当前模型的贡献者未提供更加详细的模型介绍。模型文件和权重，可浏览“模型文件”页面获取。
-#### 您可以通过如下git clone命令，或者ModelScope SDK来下载模型
-SDK下载
-```bash
-#安装ModelScope
-pip install modelscope
-```
-```python
-#SDK模型下载
-from modelscope import snapshot_download
-model_dir = snapshot_download('RapidAI/RapidSpeech')
-```
-Git下载
-```
-#Git模型下载
-git clone https://www.modelscope.cn/RapidAI/RapidSpeech.git
-```
-<p style="color: lightgrey;">如果您是本模型的贡献者，我们邀请您根据<a href="https://modelscope.cn/docs/ModelScope%E6%A8%A1%E5%9E%8B%E6%8E%A5%E5%85%A5%E6%B5%81%E7%A8%8B%E6%A6%82%E8%A7%88" style="color: lightgrey; text-decoration: underline;">模型贡献文档</a>，及时完善模型卡片内容。</p>

 ---
+license: apache-2.0
 ---
+# RapidSpeech.cpp (https://github.com/RapidAI/RapidSpeech.cpp)️
+**RapidSpeech.cpp** is a high-performance, **edge-native speech intelligence framework** built on top of **ggml**.
+It aims to provide **pure C++**, **zero-dependency**, and **on-device inference** for large-scale ASR (Automatic Speech Recognition) and TTS (Text-to-Speech) models.
+------
+## 🌟 Key Differentiators
+While the open-source ecosystem already offers powerful cloud-side frameworks such as **vLLM-omni**, as well as mature on-device solutions like **sherpa-onnx**, **RapidSpeech.cpp** introduces a new generation of design choices focused on edge deployment.
+### 1. vs. vLLM: Edge-first, not cloud-throughput-first
+- **vLLM**
+    - Designed for data centers and cloud environments
+    - Strongly coupled with Python and CUDA
+    - Maximizes GPU throughput via techniques such as PageAttention
+- **RapidSpeech.cpp**
+    - Designed specifically for **edge and on-device inference**
+    - Optimized for **low latency, low memory footprint, and lightweight deployment**
+    - Runs on embedded devices, mobile platforms, laptops, and even NPU-only systems
+    - **No Python runtime required**