Update README.md
Browse files
README.md
CHANGED
|
@@ -13,7 +13,7 @@ license_link: >-
|
|
| 13 |
|
| 14 |
|
| 15 |
<p align="center">
|
| 16 |
-
👨💻 <a href="https://github.com/SkyworkAI/Skywork" target="_blank">Github</a> • 🤗 <a href="https://huggingface.co/Skywork" target="_blank">Hugging Face</a>• 🤖 <a href="https://modelscope.cn/organization/Skywork" target="_blank">ModelScope</a> • 💬 <a href="https://github.com/SkyworkAI/Skywork/blob/main/misc/wechat.png?raw=true" target="_blank">WeChat</a>• 📜<a href="
|
| 17 |
|
| 18 |
</p>
|
| 19 |
|
|
@@ -48,9 +48,9 @@ license_link: >-
|
|
| 48 |
**Skywork-13B-Base-8bits** is a quantizated model of **Skywork-13B-Base**, to support deployment and inference on consumer-grade GPUs.
|
| 49 |
|
| 50 |
|
| 51 |
-
如果您希望了解更多的信息,如训练方案,评估方法,请参考我们的[技术报告](
|
| 52 |
|
| 53 |
-
If you are interested in more training and evaluation details, please refer to our [technical report](
|
| 54 |
|
| 55 |
## 训练数据(Training Data)
|
| 56 |
我们精心搭建了数据清洗流程对文本中的低质量数据、有害信息、敏感信息进行清洗过滤。我们的Skywork-13B-Base模型是在清洗后的3.2TB高质量中、英、代码数据上进行训练,其中英文占比52.2%,中文占比39.6%,代码占比8%,在兼顾中文和英文上的表现的同时,代码能力也能有保证。
|
|
|
|
| 13 |
|
| 14 |
|
| 15 |
<p align="center">
|
| 16 |
+
👨💻 <a href="https://github.com/SkyworkAI/Skywork" target="_blank">Github</a> • 🤗 <a href="https://huggingface.co/Skywork" target="_blank">Hugging Face</a>• 🤖 <a href="https://modelscope.cn/organization/Skywork" target="_blank">ModelScope</a> • 💬 <a href="https://github.com/SkyworkAI/Skywork/blob/main/misc/wechat.png?raw=true" target="_blank">WeChat</a>• 📜<a href="http://arxiv.org/abs/2310.19341" target="_blank">Tech Report</a>
|
| 17 |
|
| 18 |
</p>
|
| 19 |
|
|
|
|
| 48 |
**Skywork-13B-Base-8bits** is a quantizated model of **Skywork-13B-Base**, to support deployment and inference on consumer-grade GPUs.
|
| 49 |
|
| 50 |
|
| 51 |
+
如果您希望了解更多的信息,如训练方案,评估方法,请参考我们的[技术报告](http://arxiv.org/abs/2310.19341),[Skymath](https://arxiv.org/abs/2310.16713)论文,[SkyworkMM](https://github.com/will-singularity/Skywork-MM/blob/main/skywork_mm.pdf)论文。
|
| 52 |
|
| 53 |
+
If you are interested in more training and evaluation details, please refer to our [technical report](http://arxiv.org/abs/2310.19341), [Skymath]((https://arxiv.org/skywork-tech-report)) paper and [SkyworkMM](https://github.com/will-singularity/Skywork-MM/blob/main/skywork_mm.pdf) paper.
|
| 54 |
|
| 55 |
## 训练数据(Training Data)
|
| 56 |
我们精心搭建了数据清洗流程对文本中的低质量数据、有害信息、敏感信息进行清洗过滤。我们的Skywork-13B-Base模型是在清洗后的3.2TB高质量中、英、代码数据上进行训练,其中英文占比52.2%,中文占比39.6%,代码占比8%,在兼顾中文和英文上的表现的同时,代码能力也能有保证。
|