DataoceanAI1
/

dolphin-small

Automatic Speech Recognition

Model card Files Files and versions

DataoceanAI commited on 2 days ago

Commit

f51368a

·

verified ·

1 Parent(s): e709b04

Update README.md

Files changed (1) hide show

README.md +14 -0

README.md CHANGED Viewed

@@ -10,6 +10,20 @@ language: multilingual
 [Huggingface](https://huggingface.co/DataoceanAI)
 [Modelscope](https://www.modelscope.cn/organization/DataoceanAI)
 Dolphin is a multilingual, multitask ASR model developed through a collaboration between Dataocean AI and Tsinghua University. It supports 40 Eastern languages across East Asia, South Asia, Southeast Asia, and the Middle East, while also supporting 22 Chinese dialects. It is trained on over 210,000 hours of data, which includes both DataoceanAI's proprietary datasets and open-source datasets. The model can perform speech recognition, voice activity detection (VAD), segmentation, and language identification (LID).
 ## Approach

 [Huggingface](https://huggingface.co/DataoceanAI)
 [Modelscope](https://www.modelscope.cn/organization/DataoceanAI)
+# Repository Notice
+This model is officially maintained by **Dataocean AI**.
+To ensure compatibility with existing user code and download links, we keep two official repositories for the same model:
+- Original / legacy repository: DataoceanAI
+- Organization / enterprise repository: DataoceanAI1
+Both repositories are maintained by the same team and contain the same model files.
+DataoceanAI1 is the newly created enterprise organization account, while DataoceanAI is kept to avoid breaking existing user download scripts and links.
+Please do not regard either repository as an unofficial copy or unauthorized redistribution.
 Dolphin is a multilingual, multitask ASR model developed through a collaboration between Dataocean AI and Tsinghua University. It supports 40 Eastern languages across East Asia, South Asia, Southeast Asia, and the Middle East, while also supporting 22 Chinese dialects. It is trained on over 210,000 hours of data, which includes both DataoceanAI's proprietary datasets and open-source datasets. The model can perform speech recognition, voice activity detection (VAD), segmentation, and language identification (LID).
 ## Approach