Linly / README_zh.md
thepianist9's picture
Upload folder using huggingface_hub
79f9f38 verified
# ๆ•ฐๅญ—ไบบๆ™บ่ƒฝๅฏน่ฏ็ณป็ปŸ - Linly-Talker โ€” โ€œๆ•ฐๅญ—ไบบไบคไบ’๏ผŒไธŽ่™šๆ‹Ÿ็š„่‡ชๅทฑไบ’ๅŠจโ€
<div align="center">
<h1>Linly-Talker WebUI</h1>
[![madewithlove](https://img.shields.io/badge/made_with-%E2%9D%A4-red?style=for-the-badge&labelColor=orange)](https://github.com/Kedreamix/Linly-Talker)
<img src="docs/linly_logo.png" /><br>
[![Open In Colab](https://img.shields.io/badge/Colab-F9AB00?style=for-the-badge&logo=googlecolab&color=525252)](https://colab.research.google.com/github/Kedreamix/Linly-Talker/blob/main/colab_webui.ipynb)
[![Licence](https://img.shields.io/badge/LICENSE-MIT-green.svg?style=for-the-badge)](https://github.com/Kedreamix/Linly-Talker/blob/main/LICENSE)
[![Huggingface](https://img.shields.io/badge/๐Ÿค—%20-Models%20Repo-yellow.svg?style=for-the-badge)](https://huggingface.co/Kedreamix/Linly-Talker)
[**English**](./README.md) | [**ไธญๆ–‡็ฎ€ไฝ“**](./README_zh.md)
</div>
**2023.12 ๆ›ดๆ–ฐ** ๐Ÿ“†
**็”จๆˆทๅฏไปฅไธŠไผ ไปปๆ„ๅ›พ็‰‡่ฟ›่กŒๅฏน่ฏ**
**2024.01 ๆ›ดๆ–ฐ** ๐Ÿ“†
- **ไปคไบบๅ…ดๅฅ‹็š„ๆถˆๆฏ๏ผๆˆ‘็Žฐๅœจๅทฒ็ปๅฐ†ๅผบๅคง็š„GeminiProๅ’ŒQwenๅคงๆจกๅž‹่žๅ…ฅๅˆฐๆˆ‘ไปฌ็š„ๅฏน่ฏๅœบๆ™ฏไธญใ€‚็”จๆˆท็Žฐๅœจๅฏไปฅๅœจๅฏน่ฏไธญไธŠไผ ไปปไฝ•ๅ›พ็‰‡๏ผŒไธบๆˆ‘ไปฌ็š„ไบ’ๅŠจๅขžๆทปไบ†ๅ…จๆ–ฐ็š„ๅฑ‚้ขใ€‚**
- **ๆ›ดๆ–ฐไบ†FastAPI็š„้ƒจ็ฝฒ่ฐƒ็”จๆ–นๆณ•ใ€‚**
- **ๆ›ดๆ–ฐไบ†ๅพฎ่ฝฏTTS็š„้ซ˜็บง่ฎพ็ฝฎ้€‰้กน๏ผŒๅขžๅŠ ๅฃฐ้Ÿณ็ง็ฑป็š„ๅคšๆ ทๆ€ง๏ผŒไปฅๅŠๅŠ ๅ…ฅ่ง†้ข‘ๅญ—ๅน•ๅŠ ๅผบๅฏ่ง†ๅŒ–ใ€‚**
- **ๆ›ดๆ–ฐไบ†GPTๅคš่ฝฎๅฏน่ฏ็ณป็ปŸ๏ผŒไฝฟๅพ—ๅฏน่ฏๆœ‰ไธŠไธ‹ๆ–‡่”็ณป๏ผŒๆ้ซ˜ๆ•ฐๅญ—ไบบ็š„ไบคไบ’ๆ€งๅ’Œ็œŸๅฎžๆ„Ÿใ€‚**
**2024.02 ๆ›ดๆ–ฐ** ๐Ÿ“†
- **ๆ›ดๆ–ฐไบ†Gradio็š„็‰ˆๆœฌไธบๆœ€ๆ–ฐ็‰ˆๆœฌ4.16.0๏ผŒไฝฟๅพ—็•Œ้ขๆ‹ฅๆœ‰ๆ›ดๅคš็š„ๅŠŸ่ƒฝ๏ผŒๆฏ”ๅฆ‚ๅฏไปฅๆ‘„ๅƒๅคดๆ‹ๆ‘„ๅ›พ็‰‡ๆž„ๅปบๆ•ฐๅญ—ไบบ็ญ‰ใ€‚**
- **ๆ›ดๆ–ฐไบ†ASRๅ’ŒTHG๏ผŒๅ…ถไธญASRๅŠ ๅ…ฅไบ†้˜ฟ้‡Œ็š„FunASR๏ผŒๅ…ทไฝ“ๆ›ดๅฟซ็š„้€Ÿๅบฆ๏ผ›THG้ƒจๅˆ†ๅŠ ๅ…ฅไบ†Wav2Lipๆจกๅž‹๏ผŒER-NeRFๅœจๅ‡†ๅค‡ไธญ(Comming Soon)ใ€‚**
- **ๅŠ ๅ…ฅไบ†่ฏญ้Ÿณๅ…‹้š†ๆ–นๆณ•GPT-SoVITSๆจกๅž‹๏ผŒ่ƒฝๅคŸ้€š่ฟ‡ๅพฎ่ฐƒไธ€ๅˆ†้’Ÿๅฏนๅบ”ไบบ็š„่ฏญๆ–™่ฟ›่กŒๅ…‹้š†๏ผŒๆ•ˆๆžœ่ฟ˜ๆ˜ฏ็›ธๅฝ“ไธ้”™็š„๏ผŒๅ€ผๅพ—ๆŽจ่ใ€‚**
- **้›†ๆˆไธ€ไธชWebUI็•Œ้ข๏ผŒ่ƒฝๅคŸๆ›ดๅฅฝ็š„่ฟ่กŒLinly-Talkerใ€‚**
**2024.04 ๆ›ดๆ–ฐ** ๐Ÿ“†
- **ๆ›ดๆ–ฐไบ†้™ค Edge TTS็š„ Paddle TTS็š„็ฆป็บฟๆ–นๅผใ€‚**
- **ๆ›ดๆ–ฐไบ†ER-NeRFไฝœไธบAvatar็”Ÿๆˆ็š„้€‰ๆ‹ฉไน‹ไธ€ใ€‚**
- **ๆ›ดๆ–ฐไบ†app_talk.py๏ผŒๅœจไธๅŸบไบŽๅฏน่ฏๅœบๆ™ฏๅฏ่‡ช็”ฑไธŠไผ ่ฏญ้Ÿณๅ’Œๅ›พ็‰‡่ง†้ข‘็”Ÿๆˆใ€‚**
**2024.05 ๆ›ดๆ–ฐ** ๐Ÿ“†
- **ๆ›ดๆ–ฐ้›ถๅŸบ็ก€ๅฐ็™ฝ้ƒจ็ฝฒ AutoDL ๆ•™็จ‹๏ผŒๅนถไธ”ๆ›ดๆ–ฐไบ†codewithgpu็š„้•œๅƒ๏ผŒๅฏไปฅไธ€้”ฎ่ฟ›่กŒไฝ“้ชŒๅ’Œๅญฆไน ใ€‚**
- **ๆ›ดๆ–ฐไบ†WebUI.py๏ผŒLinly-Talker WebUIๆ”ฏๆŒๅคšๆจกๅ—ใ€ๅคšๆจกๅž‹ๅ’Œๅคš้€‰้กน**
**2024.06 ๆ›ดๆ–ฐ** ๐Ÿ“†
- **ๆ›ดๆ–ฐMuseTalkๅŠ ๅ…ฅLinly-Talkerไน‹ไธญ๏ผŒๅนถไธ”ๆ›ดๆ–ฐไบ†WebUIไธญ๏ผŒ่ƒฝๅคŸๅŸบๆœฌๅฎž็Žฐๅฎžๆ—ถๅฏน่ฏใ€‚**
- **ๆ”น่ฟ›็š„WebUIๅœจ้ป˜่ฎค่ฎพ็ฝฎไธ‹ไธๅŠ ่ฝฝLLMๆจกๅž‹๏ผŒไปฅๅ‡ๅฐ‘ๆ˜พๅญ˜ไฝฟ็”จ๏ผŒๅนถไธ”ๅฏไปฅ็›ดๆŽฅ้€š่ฟ‡้—ฎ้ข˜ๅ›žๅคๅฎŒๆˆๅฃๆ’ญๅŠŸ่ƒฝใ€‚็ฒพ็ป†ๅŒ–ๅŽ็š„WebUIๅŒ…ๅซไปฅไธ‹ไธ‰ไธชไธป่ฆๅŠŸ่ƒฝ๏ผšไธชๆ€งๅŒ–่ง’่‰ฒ็”Ÿๆˆใ€ๆ•ฐๅญ—ไบบๅคš่ฝฎๆ™บ่ƒฝๅฏน่ฏไปฅๅŠMuseTalkๅฎžๆ—ถๅฏน่ฏใ€‚่ฟ™ไบ›ๆ”น่ฟ›ไธไป…ๅ‡ๅฐ‘ไบ†ๅ…ˆๅ‰็š„ๆ˜พๅญ˜ๅ†—ไฝ™๏ผŒ่ฟ˜ๅขžๅŠ ไบ†ๆ›ดๅคšๆ็คบ๏ผŒไปฅๅธฎๅŠฉ็”จๆˆทๆ›ด่ฝปๆพๅœฐไฝฟ็”จใ€‚**
---
<details>
<summary>็›ฎๅฝ•</summary>
<!-- TOC -->
- [ๆ•ฐๅญ—ไบบๆ™บ่ƒฝๅฏน่ฏ็ณป็ปŸ - Linly-Talker โ€” โ€œๆ•ฐๅญ—ไบบไบคไบ’๏ผŒไธŽ่™šๆ‹Ÿ็š„่‡ชๅทฑไบ’ๅŠจโ€](#ๆ•ฐๅญ—ไบบๆ™บ่ƒฝๅฏน่ฏ็ณป็ปŸ---linly-talker--ๆ•ฐๅญ—ไบบไบคไบ’ไธŽ่™šๆ‹Ÿ็š„่‡ชๅทฑไบ’ๅŠจ)
- [ไป‹็ป](#ไป‹็ป)
- [TO DO LIST](#to-do-list)
- [็คบไพ‹](#็คบไพ‹)
- [ๅˆ›ๅปบ็Žฏๅขƒ](#ๅˆ›ๅปบ็Žฏๅขƒ)
- [ASR - Speech Recognition](#asr---speech-recognition)
- [Whisper](#whisper)
- [FunASR](#funasr)
- [Coming Soon](#coming-soon)
- [TTS Text To Speech](#tts-text-to-speech)
- [Edge TTS](#edge-tts)
- [PaddleTTS](#paddletts)
- [Coming Soon](#coming-soon-1)
- [Voice Clone](#voice-clone)
- [GPT-SoVITS๏ผˆๆŽจ่๏ผ‰](#gpt-sovitsๆŽจ่)
- [XTTS](#xtts)
- [Coming Soon](#coming-soon-2)
- [THG - Avatar](#thg---avatar)
- [SadTalker](#sadtalker)
- [Wav2Lip](#wav2lip)
- [ER-NeRF](#er-nerf)
- [MuseTalk](#musetalk)
- [Coming Soon](#coming-soon-3)
- [LLM - Conversation](#llm---conversation)
- [Linly-AI](#linly-ai)
- [Qwen](#qwen)
- [Gemini-Pro](#gemini-pro)
- [ChatGPT](#chatgpt)
- [ChatGLM](#chatglm)
- [GPT4Free](#gpt4free)
- [LLM ๅคšๆจกๅž‹้€‰ๆ‹ฉ](#llm-ๅคšๆจกๅž‹้€‰ๆ‹ฉ)
- [Coming Soon](#coming-soon-4)
- [ไผ˜ๅŒ–](#ไผ˜ๅŒ–)
- [Gradio](#gradio)
- [ๅฏๅŠจWebUI](#ๅฏๅŠจwebui)
- [WebUI](#webui)
- [Old Verison](#old-verison)
- [ๆ–‡ไปถๅคน็ป“ๆž„](#ๆ–‡ไปถๅคน็ป“ๆž„)
- [่ตžๅŠฉ](#่ตžๅŠฉ)
- [ๅ‚่€ƒ](#ๅ‚่€ƒ)
- [Star History](#star-history)
<!-- /TOC -->
</details>
## ไป‹็ป
Linly-Talkerๆ˜ฏไธ€ๆฌพๅˆ›ๆ–ฐ็š„ๆ•ฐๅญ—ไบบๅฏน่ฏ็ณป็ปŸ๏ผŒๅฎƒ่žๅˆไบ†ๆœ€ๆ–ฐ็š„ไบบๅทฅๆ™บ่ƒฝๆŠ€ๆœฏ๏ผŒๅŒ…ๆ‹ฌๅคงๅž‹่ฏญ่จ€ๆจกๅž‹๏ผˆLLM๏ผ‰๐Ÿค–ใ€่‡ชๅŠจ่ฏญ้Ÿณ่ฏ†ๅˆซ๏ผˆASR๏ผ‰๐ŸŽ™๏ธใ€ๆ–‡ๆœฌๅˆฐ่ฏญ้Ÿณ่ฝฌๆข๏ผˆTTS๏ผ‰๐Ÿ—ฃ๏ธๅ’Œ่ฏญ้Ÿณๅ…‹้š†ๆŠ€ๆœฏ๐ŸŽคใ€‚่ฟ™ไธช็ณป็ปŸ้€š่ฟ‡Gradioๅนณๅฐๆไพ›ไบ†ไธ€ไธชไบคไบ’ๅผ็š„Web็•Œ้ข๏ผŒๅ…่ฎธ็”จๆˆทไธŠไผ ๅ›พ็‰‡๐Ÿ“ทไธŽAI่ฟ›่กŒไธชๆ€งๅŒ–็š„ๅฏน่ฏไบคๆต๐Ÿ’ฌใ€‚
็ณป็ปŸ็š„ๆ ธๅฟƒ็‰น็‚นๅŒ…ๆ‹ฌ๏ผš
1. **ๅคšๆจกๅž‹้›†ๆˆ**๏ผšLinly-Talkerๆ•ดๅˆไบ†Linlyใ€GeminiProใ€Qwen็ญ‰ๅคงๆจกๅž‹๏ผŒไปฅๅŠWhisperใ€SadTalker็ญ‰่ง†่ง‰ๆจกๅž‹๏ผŒๅฎž็Žฐไบ†้ซ˜่ดจ้‡็š„ๅฏน่ฏๅ’Œ่ง†่ง‰็”Ÿๆˆใ€‚
2. **ๅคš่ฝฎๅฏน่ฏ่ƒฝๅŠ›**๏ผš้€š่ฟ‡GPTๆจกๅž‹็š„ๅคš่ฝฎๅฏน่ฏ็ณป็ปŸ๏ผŒLinly-Talker่ƒฝๅคŸ็†่งฃๅนถ็ปดๆŒไธŠไธ‹ๆ–‡็›ธๅ…ณ็š„่ฟž่ดฏๅฏน่ฏ๏ผŒๆžๅคงๅœฐๆๅ‡ไบ†ไบคไบ’็š„็œŸๅฎžๆ„Ÿใ€‚
3. **่ฏญ้Ÿณๅ…‹้š†**๏ผšๅˆฉ็”จGPT-SoVITS็ญ‰ๆŠ€ๆœฏ๏ผŒ็”จๆˆทๅฏไปฅไธŠไผ ไธ€ๅˆ†้’Ÿ็š„่ฏญ้Ÿณๆ ทๆœฌ่ฟ›่กŒๅพฎ่ฐƒ๏ผŒ็ณป็ปŸๅฐ†ๅ…‹้š†็”จๆˆท็š„ๅฃฐ้Ÿณ๏ผŒไฝฟๅพ—ๆ•ฐๅญ—ไบบ่ƒฝๅคŸไปฅ็”จๆˆท็š„ๅฃฐ้Ÿณ่ฟ›่กŒๅฏน่ฏใ€‚
4. **ๅฎžๆ—ถไบ’ๅŠจ**๏ผš็ณป็ปŸๆ”ฏๆŒๅฎžๆ—ถ่ฏญ้Ÿณ่ฏ†ๅˆซๅ’Œ่ง†้ข‘ๅญ—ๅน•๏ผŒไฝฟๅพ—็”จๆˆทๅฏไปฅ้€š่ฟ‡่ฏญ้ŸณไธŽๆ•ฐๅญ—ไบบ่ฟ›่กŒ่‡ช็„ถ็š„ไบคๆตใ€‚
5. **่ง†่ง‰ๅขžๅผบ**๏ผš้€š่ฟ‡ๆ•ฐๅญ—ไบบ็”Ÿๆˆ็ญ‰ๆŠ€ๆœฏ๏ผŒLinly-Talker่ƒฝๅคŸ็”Ÿๆˆ้€ผ็œŸ็š„ๆ•ฐๅญ—ไบบๅฝข่ฑก๏ผŒๆไพ›ๆ›ดๅŠ ๆฒ‰ๆตธๅผ็š„ไฝ“้ชŒใ€‚
Linly-Talker็š„่ฎพ่ฎก็†ๅฟตๆ˜ฏๅˆ›้€ ไธ€็งๅ…จๆ–ฐ็š„ไบบๆœบไบคไบ’ๆ–นๅผ๏ผŒไธไป…ไป…ๆ˜ฏ็ฎ€ๅ•็š„้—ฎ็ญ”๏ผŒ่€Œๆ˜ฏ้€š่ฟ‡้ซ˜ๅบฆ้›†ๆˆ็š„ๆŠ€ๆœฏ๏ผŒๆไพ›ไธ€ไธช่ƒฝๅคŸ็†่งฃใ€ๅ“ๅบ”ๅนถๆจกๆ‹Ÿไบบ็ฑปไบคๆต็š„ๆ™บ่ƒฝๆ•ฐๅญ—ไบบใ€‚
![The system architecture of multimodal humanโ€“computer interaction.](docs/HOI.png)
> ๆŸฅ็œ‹ๆˆ‘ไปฌ็š„ไป‹็ป่ง†้ข‘ [demo video](https://www.bilibili.com/video/BV1rN4y1a76x/)
>
> ๅœจB็ซ™ไธŠๆˆ‘ๅฝ•ไบ†ไธ€็ณปๅˆ—่ง†้ข‘๏ผŒไนŸไปฃ่กจๆˆ‘ๆ›ดๆ–ฐ็š„ๆฏไธ€ๆญฅไธŽไฝฟ็”จๆ–นๆณ•๏ผŒ่ฏฆ็ป†ๆŸฅ็œ‹[ๆ•ฐๅญ—ไบบๆ™บ่ƒฝๅฏน่ฏ็ณป็ปŸ - Linly-Talkerๅˆ้›†](https://space.bilibili.com/241286257/channel/collectiondetail?sid=2065753)
>
> - [๐Ÿ”ฅ๐Ÿ”ฅ๐Ÿ”ฅๆ•ฐๅญ—ไบบๅฏน่ฏ็ณป็ปŸ Linly-Talker๐Ÿ”ฅ๐Ÿ”ฅ๐Ÿ”ฅ](https://www.bilibili.com/video/BV1rN4y1a76x/)
> - [๐Ÿš€ๆ•ฐๅญ—ไบบ็š„ๆœชๆฅ๏ผšLinly-Talker+GPT-SoVIT่ฏญ้Ÿณๅ…‹้š†ๆŠ€ๆœฏ็š„่ต‹่ƒฝไน‹้“](https://www.bilibili.com/video/BV1S4421A7gh/)
> - [AutoDLๅนณๅฐ้ƒจ็ฝฒLinly-Talker (0ๅŸบ็ก€ๅฐ็™ฝ่ถ…่ฏฆ็ป†ๆ•™็จ‹)](https://www.bilibili.com/video/BV1uT421m74z/)
> - [Linly-Talker ๆ›ดๆ–ฐ็ฆป็บฟTTS้›†ๆˆๅŠๅฎšๅˆถๆ•ฐๅญ—ไบบๆ–นๆกˆ](https://www.bilibili.com/video/BV1Mr421u7NN/)
## TO DO LIST
- [x] ๅŸบๆœฌๅฎŒๆˆๅฏน่ฏ็ณป็ปŸๆต็จ‹๏ผŒ่ƒฝๅคŸ`่ฏญ้Ÿณๅฏน่ฏ`
- [x] ๅŠ ๅ…ฅไบ†LLMๅคงๆจกๅž‹๏ผŒๅŒ…ๆ‹ฌ`Linly`๏ผŒ`Qwen`ๅ’Œ`GeminiPro`็š„ไฝฟ็”จ
- [x] ๅฏไธŠไผ `ไปปๆ„ๆ•ฐๅญ—ไบบ็…ง็‰‡`่ฟ›่กŒๅฏน่ฏ
- [x] LinlyๅŠ ๅ…ฅ`FastAPI`่ฐƒ็”จๆ–นๅผ
- [x] ๅˆฉ็”จๅพฎ่ฝฏ`TTS`ๅŠ ๅ…ฅ้ซ˜็บง้€‰้กน๏ผŒๅฏ่ฎพ็ฝฎๅฏนๅบ”ไบบๅฃฐไปฅๅŠ้Ÿณ่ฐƒ็ญ‰ๅ‚ๆ•ฐ๏ผŒๅขžๅŠ ๅฃฐ้Ÿณ็š„ๅคšๆ ทๆ€ง
- [x] ่ง†้ข‘็”ŸๆˆๅŠ ๅ…ฅ`ๅญ—ๅน•`๏ผŒ่ƒฝๅคŸๆ›ดๅฅฝ็š„่ฟ›่กŒๅฏ่ง†ๅŒ–
- [x] GPT`ๅคš่ฝฎๅฏน่ฏ`็ณป็ปŸ๏ผˆๆ้ซ˜ๆ•ฐๅญ—ไบบ็š„ไบคไบ’ๆ€งๅ’Œ็œŸๅฎžๆ„Ÿ๏ผŒๅขžๅผบๆ•ฐๅญ—ไบบ็š„ๆ™บ่ƒฝ๏ผ‰
- [x] ไผ˜ๅŒ–Gradio็•Œ้ข๏ผŒๅŠ ๅ…ฅๆ›ดๅคšๆจกๅž‹๏ผŒๅฆ‚Wav2Lip๏ผŒFunASR็ญ‰
- [x] `่ฏญ้Ÿณๅ…‹้š†`ๆŠ€ๆœฏ๏ผŒๅŠ ๅ…ฅGPT-SoVITS๏ผŒๅช้œ€่ฆไธ€ๅˆ†้’Ÿ็š„่ฏญ้Ÿณ็ฎ€ๅ•ๅพฎ่ฐƒๅณๅฏ๏ผˆ่ฏญ้Ÿณๅ…‹้š†ๅˆๆˆ่‡ชๅทฑๅฃฐ้Ÿณ๏ผŒๆ้ซ˜ๆ•ฐๅญ—ไบบๅˆ†่บซ็š„็œŸๅฎžๆ„Ÿๅ’Œไบ’ๅŠจไฝ“้ชŒ๏ผ‰
- [x] ๅŠ ๅ…ฅ็ฆป็บฟTTSไปฅๅŠNeRF-based็š„ๆ–นๆณ•ๅ’Œๆจกๅž‹
- [x] Linly-Talker WebUIๆ”ฏๆŒๅคšๆจกๅ—ใ€ๅคšๆจกๅž‹ๅ’Œๅคš้€‰้กน
- [x] ไธบLinly-TalkerๆทปๅŠ MuseTalkๅŠŸ่ƒฝ๏ผŒๅŸบๆœฌ่พพๅˆฐๅฎžๆ—ถ็š„้€Ÿๅบฆ๏ผŒไบคๆต้€Ÿๅบฆๅพˆๅฟซ
- [x] ้›†ๆˆMuseTalk่ฟ›ๅ…ฅLinly-Talker WebUI
- [ ] `ๅฎžๆ—ถ`่ฏญ้Ÿณ่ฏ†ๅˆซ๏ผˆไบบไธŽๆ•ฐๅญ—ไบบไน‹้—ดๅฐฑๅฏไปฅ้€š่ฟ‡่ฏญ้Ÿณ่ฟ›่กŒๅฏน่ฏไบคๆต)
๐Ÿ”† ่ฏฅ้กน็›ฎ Linly-Talker ๆญฃๅœจ่ฟ›่กŒไธญ - ๆฌข่ฟŽๆๅ‡บPR่ฏทๆฑ‚๏ผๅฆ‚ๆžœๆ‚จๆœ‰ไปปไฝ•ๅ…ณไบŽๆ–ฐ็š„ๆจกๅž‹ๆ–นๆณ•ใ€็ ”็ฉถใ€ๆŠ€ๆœฏๆˆ–ๅ‘็Žฐ่ฟ่กŒ้”™่ฏฏ็š„ๅปบ่ฎฎ๏ผŒ่ฏท้šๆ—ถ็ผ–่พ‘ๅนถๆไบค PRใ€‚ๆ‚จไนŸๅฏไปฅๆ‰“ๅผ€ไธ€ไธช้—ฎ้ข˜ๆˆ–้€š่ฟ‡็”ตๅญ้‚ฎไปถ็›ดๆŽฅ่”็ณปๆˆ‘ใ€‚๐Ÿ“ฉโญ ๅฆ‚ๆžœๆ‚จๅ‘็Žฐ่ฟ™ไธชGithub Projectๆœ‰็”จ๏ผŒ่ฏท็ป™ๅฎƒ็‚นไธชๆ˜Ÿ๏ผ๐Ÿคฉ
> ๅฆ‚ๆžœๅœจ้ƒจ็ฝฒ็š„ๆ—ถๅ€™ๆœ‰ไปปไฝ•็š„้—ฎ้ข˜๏ผŒๅฏไปฅๅ…ณๆณจ[ๅธธ่ง้—ฎ้ข˜ๆฑ‡ๆ€ป.md](https://github.com/Kedreamix/Linly-Talker/blob/main/ๅธธ่ง้—ฎ้ข˜ๆฑ‡ๆ€ป.md)้ƒจๅˆ†๏ผŒๆˆ‘ๅทฒ็ปๆ•ด็†ไบ†ๅฏ่ƒฝๅ‡บ็Žฐ็š„ๆ‰€ๆœ‰้—ฎ้ข˜๏ผŒๅฆๅค–ไบคๆต็พคไนŸๅœจ่ฟ™้‡Œ๏ผŒๆˆ‘ไผšๅฎšๆ—ถๆ›ดๆ–ฐ๏ผŒๆ„Ÿ่ฐขๅคงๅฎถ็š„ๅ…ณๆณจไธŽไฝฟ็”จ๏ผ๏ผ๏ผ
## ็คบไพ‹
| ๆ–‡ๅญ—/่ฏญ้Ÿณๅฏน่ฏ | ๆ•ฐๅญ—ไบบๅ›ž็ญ” |
| :----------------------------------------------------------: | :----------------------------------------------------------: |
| ๅบ”ๅฏนๅŽ‹ๅŠ›ๆœ€ๆœ‰ๆ•ˆ็š„ๆ–นๆณ•ๆ˜ฏไป€ไนˆ๏ผŸ | <video src="https://github.com/Kedreamix/Linly-Talker/assets/61195303/f1deb189-b682-4175-9dea-7eeb0fb392ca"></video> |
| ๅฆ‚ไฝ•่ฟ›่กŒๆ—ถ้—ด็ฎก็†๏ผŸ | <video src="https://github.com/Kedreamix/Linly-Talker/assets/61195303/968b5c43-4dce-484b-b6c6-0fd4d621ac03"></video> |
| ๆ’ฐๅ†™ไธ€็ฏ‡ไบคๅ“ไน้Ÿณไนไผš่ฏ„่ฎบ๏ผŒ่ฎจ่ฎบไนๅ›ข็š„่กจๆผ”ๅ’Œ่ง‚ไผ—็š„ๆ•ดไฝ“ไฝ“้ชŒใ€‚ | <video src="https://github.com/Kedreamix/Linly-Talker/assets/61195303/f052820f-6511-4cf0-a383-daf8402630db"></video> |
| ็ฟป่ฏ‘ๆˆไธญๆ–‡๏ผšLuck is a dividend of sweat. The more you sweat, the luckier you get. | <video src="https://github.com/Kedreamix/Linly-Talker/assets/61195303/118eec13-a9f7-4c38-b4ad-044d36ba9776"></video> |
## ๅˆ›ๅปบ็Žฏๅขƒ
AutoDLๅทฒๅ‘ๅธƒ้•œๅƒ๏ผŒๅฏไปฅ็›ดๆŽฅไฝฟ็”จ๏ผŒ[https://www.codewithgpu.com/i/Kedreamix/Linly-Talker/Kedreamix-Linly-Talker](https://www.codewithgpu.com/i/Kedreamix/Linly-Talker/Kedreamix-Linly-Talker)๏ผŒไนŸๅฏไปฅไฝฟ็”จdockerๆฅ็›ดๆŽฅๅˆ›ๅปบ็Žฏๅขƒ๏ผŒๆˆ‘ไนŸไผšๆŒ็ปญไธๆ–ญ็š„ๆ›ดๆ–ฐ้•œๅƒ
```bash
docker pull registry.cn-beijing.aliyuncs.com/codewithgpu2/kedreamix-linly-talker:cMDvNE4RYl
```
Windowsๆˆ‘ๅŠ ๅ…ฅไบ†ไธ€ไธชpythonไธ€้”ฎๆ•ดๅˆๅŒ…๏ผŒๅฏไปฅๆŒ‰้กบๅบ่ฟ›่กŒ่ฟ่กŒ๏ผŒๆŒ‰็…ง้œ€ๆฑ‚ๆŒ‰็…ง็›ธๅบ”็š„ไพ่ต–๏ผŒๅนถไธ”ไธ‹่ฝฝๅฏนๅบ”็š„ๆจกๅž‹๏ผŒๅณๅฏ่ฟ่กŒ๏ผŒไธป่ฆๆŒ‰็…งcondaไปฅๅŽไปŽ02ๅผ€ๅง‹ๅฎ‰่ฃ…pytorch่ฟ›่กŒ่ฟ่กŒ๏ผŒๅฆ‚ๆžœๆœ‰้—ฎ้ข˜๏ผŒ่ฏท้šๆ—ถไธŽๆˆ‘ๆฒŸ้€š
[Windowsไธ€้”ฎๆ•ดๅˆๅŒ…](https://pan.quark.cn/s/cc8f19c45a15)
ไธ‹่ฝฝไปฃ็ 
```bash
git clone https://github.com/Kedreamix/Linly-Talker.git --depth 1
```
่‹ฅไฝฟ็”จLinly-Talker๏ผŒๅฏไปฅ็›ดๆŽฅ็”จanaconda่ฟ›่กŒๅฎ‰่ฃ…็Žฏๅขƒ๏ผŒๅ‡ ไนŽๅŒ…ๆ‹ฌๆ‰€ๆœ‰็š„ๆจกๅž‹ๆ‰€้œ€่ฆ็š„ไพ่ต–๏ผŒๅ…ทไฝ“ๆ“ไฝœๅฆ‚ไธ‹๏ผš
```bash
conda create -n linly python=3.10
conda activate linly
# pytorchๅฎ‰่ฃ…ๆ–นๅผ1๏ผšcondaๅฎ‰่ฃ…
# CUDA 11.7
# conda install pytorch==2.0.1 torchvision==0.15.2 torchaudio==2.0.2 pytorch-cuda=11.7 -c pytorch -c nvidia
# CUDA 11.8
# conda install pytorch==2.0.1 torchvision==0.15.2 torchaudio==2.0.2 pytorch-cuda=11.8 -c pytorch -c nvidia
# pytorchๅฎ‰่ฃ…ๆ–นๅผ2๏ผšpip ๅฎ‰่ฃ…
# CUDA 11.7
# pip install torch==2.0.1 torchvision==0.15.2 torchaudio==2.0.2
# CUDA 11.8
pip install torch==2.0.1 torchvision==0.15.2 torchaudio==2.0.2 --index-url https://download.pytorch.org/whl/cu118
conda install -q ffmpeg # ffmpeg==4.2.2
# ๅ‡็บงpip
python -m pip install --upgrade pip
# ๆ›ดๆข pypi ๆบๅŠ ้€Ÿๅบ“็š„ๅฎ‰่ฃ…
pip config set global.index-url https://pypi.tuna.tsinghua.edu.cn/simple
pip install tb-nightly -i https://mirrors.aliyun.com/pypi/simple
pip install -r requirements_webui.txt
# ๅฎ‰่ฃ…ๆœ‰ๅ…ณmusetalkไพ่ต–
pip install --no-cache-dir -U openmim
mim install mmengine
mim install "mmcv>=2.0.1"
mim install "mmdet>=3.1.0"
mim install "mmpose>=1.1.0"
# ๅฎ‰่ฃ…NeRF-basedไพ่ต–๏ผŒๅฏ่ƒฝ้—ฎ้ข˜่พƒๅคš๏ผŒๅฏไปฅๅ…ˆๆ”พๅผƒ
pip install "git+https://github.com/facebookresearch/pytorch3d.git"
pip install -r TFG/requirements_nerf.txt
# ่‹ฅpyaudioๅ‡บ็Žฐ้—ฎ้ข˜๏ผŒๅฏๅฎ‰่ฃ…ๅฏนๅบ”ไพ่ต–
# sudo apt-get install libasound-dev portaudio19-dev libportaudio2 libportaudiocpp0
# ๆณจๆ„ไปฅไธ‹ๅ‡ ไธชๆจกๅ—๏ผŒ่‹ฅๅฎ‰่ฃ…ไธๆˆๅŠŸ๏ผŒๅฏไปฅ่ฟ›ๅ…ฅ่ทฏๅพ„ๅˆฉ็”จpip install . ๆˆ–่€… python setup.py install็ผ–่ฏ‘ๅฎ‰่ฃ…
# NeRF/freqencoder
# NeRF/gridencoder
# NeRF/raymarching
# NeRF/shencoder
```
ไปฅไธ‹ๆ˜ฏๆ—ง็‰ˆๆœฌ็š„ไธ€ไบ›ๅฎ‰่ฃ…ๆ–นๆณ•๏ผŒๅฏ่ƒฝๅญ˜ๅœจไผšไธ€ไบ›ไพ่ต–ๅ†ฒ็ช็š„้—ฎ้ข˜๏ผŒไฝ†ๆ˜ฏไนŸไธไผšๅ‡บ็Žฐๅคชๅคšbug๏ผŒไฝ†ๆ˜ฏไธบไบ†ๆ›ดๅฅฝๆ›ดๆ–นไพฟ็š„ๅฎ‰่ฃ…๏ผŒๆˆ‘ๅฐฑๆ›ดๆ–ฐไบ†ไธŠ่ฟฐ็‰ˆๆœฌ๏ผŒไปฅไธ‹็‰ˆๆœฌๅฏไปฅๅฟฝ็•ฅ๏ผŒๆˆ–่€…้‡ๅˆฐ้—ฎ้ข˜ๅฏไปฅๅ‚่€ƒไธ€ไธ‹
> ้ฆ–ๅ…ˆไฝฟ็”จanacondaๅฎ‰่ฃ…็Žฏๅขƒ๏ผŒๅฎ‰่ฃ…pytorch็Žฏๅขƒ๏ผŒๅ…ทไฝ“ๆ“ไฝœๅฆ‚ไธ‹๏ผš
>
> ```bash
> conda create -n linly python=3.10
> conda activate linly
>
> # pytorchๅฎ‰่ฃ…ๆ–นๅผ1๏ผšcondaๅฎ‰่ฃ…๏ผˆๆŽจ่๏ผ‰
> conda install pytorch==1.12.1 torchvision==0.13.1 torchaudio==0.12.1 cudatoolkit=11.3 -c pytorch
>
> # pytorchๅฎ‰่ฃ…ๆ–นๅผ2๏ผšpip ๅฎ‰่ฃ…
> pip install torch==1.12.1+cu113 torchvision==0.13.1+cu113 torchaudio==0.12.1 --extra-index-url https://download.pytorch.org/whl/cu113
>
> conda install -q ffmpeg # ffmpeg==4.2.2
>
> pip install -r requirements_app.txt
> ```
>
> ่‹ฅไฝฟ็”จ่ฏญ้Ÿณๅ…‹้š†็ญ‰ๆจกๅž‹๏ผŒ้œ€่ฆๆ›ด้ซ˜็‰ˆๆœฌ็š„Pytorch๏ผŒไฝ†ๆ˜ฏๅŠŸ่ƒฝไนŸไผšๆ›ดๅŠ ไธฐๅฏŒ๏ผŒไธ่ฟ‡้œ€่ฆ็š„้ฉฑๅŠจ็‰ˆๆœฌๅฏ่ƒฝ่ฆๅˆฐcuda11.8๏ผŒๅฏ้€‰ๆ‹ฉ
>
> ```bash
> conda create -n linly python=3.10
> conda activate linly
>
> pip install torch==2.0.1 torchvision==0.15.2 torchaudio==2.0.2 --index-url https://download.pytorch.org/whl/cu118
>
> conda install -q ffmpeg # ffmpeg==4.2.2
>
> pip install -r requirements_app.txt
>
> # ๅฎ‰่ฃ…่ฏญ้Ÿณๅ…‹้š†ๅฏนๅบ”็š„ไพ่ต–
> pip install -r VITS/requirements_gptsovits.txt
> ```
>
> ่‹ฅๅธŒๆœ›ไฝฟ็”จNeRF-based็ญ‰ๆจกๅž‹็ญ‰่ฏ๏ผŒๅฏ่ƒฝ้œ€่ฆๅฎ‰่ฃ…ไธ€ไธ‹ๅฏนๅบ”็š„็Žฏๅขƒ
>
> ```bash
> # ๅฎ‰่ฃ…NeRFๅฏนๅบ”็š„ไพ่ต–
> pip install "git+https://github.com/facebookresearch/pytorch3d.git"
> pip install -r TFG/requirements_nerf.txt
>
> # ่‹ฅpyaudioๅ‡บ็Žฐ้—ฎ้ข˜๏ผŒๅฏๅฎ‰่ฃ…ๅฏนๅบ”ไพ่ต–
> # sudo apt-get update
> # sudo apt-get install libasound-dev portaudio19-dev libportaudio2 libportaudiocpp0
>
> # ๆณจๆ„ไปฅไธ‹ๅ‡ ไธชๆจกๅ—๏ผŒ่‹ฅๅฎ‰่ฃ…ไธๆˆๅŠŸ๏ผŒๅฏไปฅ่ฟ›ๅ…ฅ่ทฏๅพ„ๅˆฉ็”จpip install . ๆˆ–่€… python setup.py install็ผ–่ฏ‘ๅฎ‰่ฃ…
> # NeRF/freqencoder
> # NeRF/gridencoder
> # NeRF/raymarching
> # NeRF/shencoder
> ```
>
> ่‹ฅไฝฟ็”จPaddleTTS๏ผŒๅฏๅฎ‰่ฃ…ๅฏนๅบ”็š„็Žฏๅขƒ
>
> ```bash
> pip install -r TTS/requirements_paddle.txt
> ```
>
> ่‹ฅไฝฟ็”จFunASR่ฏญ้Ÿณ่ฏ†ๅˆซๆจกๅž‹๏ผŒๅฏๅฎ‰่ฃ…็Žฏๅขƒ
>
> ```
> pip install -r ASR/requirements_funasr.txt
> ```
>
> ่‹ฅไฝฟ็”จMuesTalkๆจกๅž‹๏ผŒๅฏๅฎ‰่ฃ…็Žฏๅขƒ
>
> ```bash
> pip install --no-cache-dir -U openmim
> mim install mmengine
> mim install "mmcv>=2.0.1"
> mim install "mmdet>=3.1.0"
> mim install "mmpose>=1.1.0"
> pip install -r TFG/requirements_musetalk.txt
> ```
>
ๆŽฅไธ‹ๆฅ่ฟ˜้œ€่ฆๅฎ‰่ฃ…ๅฏนๅบ”็š„ๆจกๅž‹๏ผŒๆœ‰ไปฅไธ‹ไธ‹่ฝฝๆ–นๅผ๏ผŒไธ‹่ฝฝๅŽๅฎ‰่ฃ…ๆ–‡ไปถๆžถ็ป“ๆž„ๆ”พ็ฝฎ๏ผŒๆ–‡ไปถๅคน็ป“ๆž„ๅœจๆœฌๆ–‡ๆœ€ๅŽๆœ‰่ฏดๆ˜Ž๏ผŒๅปบ่ฎฎไปŽๅคธๅ…‹็ฝ‘็›˜ไธ‹่ฝฝ๏ผŒไผš็ฌฌไธ€ๆ—ถ้—ดๆ›ดๆ–ฐ
- [Baidu (็™พๅบฆไบ‘็›˜)](https://pan.baidu.com/s/1eF13O-8wyw4B3MtesctQyg?pwd=linl) (Password: `linl`)
- [huggingface](https://huggingface.co/Kedreamix/Linly-Talker)
- [modelscope](https://www.modelscope.cn/models/Kedreamix/Linly-Talker/summary)
- [Quark(ๅคธๅ…‹็ฝ‘็›˜)](https://pan.quark.cn/s/f48f5e35796b)
ๆˆ‘ๅˆถไฝœไธ€ไธช่„šๆœฌๅฏไปฅๅฎŒๆˆไธ‹่ฟฐๆ‰€ๆœ‰ๆจกๅž‹็š„ไธ‹่ฝฝ๏ผŒๆ— ้œ€็”จๆˆท่ฟ‡ๅคšๆ“ไฝœใ€‚่ฟ™็งๆ–นๅผ้€‚ๅˆ็ฝ‘็ปœ็จณๅฎš็š„ๆƒ…ๅ†ต๏ผŒๅนถไธ”็‰นๅˆซ้€‚ๅˆ Linux ็”จๆˆทใ€‚ๅฏนไบŽ Windows ็”จๆˆท๏ผŒไนŸๅฏไปฅไฝฟ็”จ Git ๆฅไธ‹่ฝฝๆจกๅž‹ใ€‚ๅฆ‚ๆžœ็ฝ‘็ปœ็Žฏๅขƒไธ็จณๅฎš๏ผŒ็”จๆˆทๅฏไปฅ้€‰ๆ‹ฉไฝฟ็”จๆ‰‹ๅŠจไธ‹่ฝฝๆ–นๆณ•๏ผŒๆˆ–่€…ๅฐ่ฏ•่ฟ่กŒ Shell ่„šๆœฌๆฅๅฎŒๆˆไธ‹่ฝฝใ€‚่„šๆœฌๅ…ทๆœ‰ไปฅไธ‹ๅŠŸ่ƒฝใ€‚
1. **้€‰ๆ‹ฉไธ‹่ฝฝๆ–นๅผ**: ็”จๆˆทๅฏไปฅ้€‰ๆ‹ฉไปŽไธ‰็งไธๅŒ็š„ๆบไธ‹่ฝฝๆจกๅž‹๏ผšModelScopeใ€Huggingface ๆˆ– Huggingface ้•œๅƒ็ซ™็‚นใ€‚
2. **ไธ‹่ฝฝๆจกๅž‹**: ๆ นๆฎ็”จๆˆท็š„้€‰ๆ‹ฉ๏ผŒๆ‰ง่กŒ็›ธๅบ”็š„ไธ‹่ฝฝๅ‘ฝไปคใ€‚
3. **็งปๅŠจๆจกๅž‹ๆ–‡ไปถ**: ไธ‹่ฝฝๅฎŒๆˆๅŽ๏ผŒๅฐ†ๆจกๅž‹ๆ–‡ไปถ็งปๅŠจๅˆฐๆŒ‡ๅฎš็š„็›ฎๅฝ•ใ€‚
4. **้”™่ฏฏๅค„็†**: ๅœจๆฏไธ€ๆญฅๆ“ไฝœไธญๅŠ ๅ…ฅไบ†้”™่ฏฏๆฃ€ๆŸฅ๏ผŒๅฆ‚ๆžœๆ“ไฝœๅคฑ่ดฅ๏ผŒ่„šๆœฌไผš่พ“ๅ‡บ้”™่ฏฏไฟกๆฏๅนถๅœๆญขๆ‰ง่กŒใ€‚
```bash
sh scripts/download_models.sh
```
**HuggingFaceไธ‹่ฝฝ**
ๅฆ‚ๆžœ้€Ÿๅบฆๅคชๆ…ขๅฏไปฅ่€ƒ่™‘้•œๅƒ๏ผŒๅ‚่€ƒ [็ฎ€ไพฟๅฟซๆท่Žทๅ– Hugging Face ๆจกๅž‹๏ผˆไฝฟ็”จ้•œๅƒ็ซ™็‚น๏ผ‰](https://kedreamix.github.io/2024/01/05/Note/HuggingFace/?highlight=้•œๅƒ)
```bash
# ไปŽhuggingfaceไธ‹่ฝฝ้ข„่ฎญ็ปƒๆจกๅž‹
git lfs install
git clone https://huggingface.co/Kedreamix/Linly-Talker --depth 1
# git lfs clone https://huggingface.co/Kedreamix/Linly-Talker
# pip install -U huggingface_hub
# export HF_ENDPOINT=https://hf-mirror.com # ไฝฟ็”จ้•œๅƒ็ฝ‘็ซ™
huggingface-cli download --resume-download --local-dir-use-symlinks False Kedreamix/Linly-Talker --local-dir Linly-Talker
```
**ModelScopeไธ‹่ฝฝ**
```bash
# ไปŽmodelscopeไธ‹่ฝฝ้ข„่ฎญ็ปƒๆจกๅž‹
# 1. git ๆ–นๆณ•
git lfs install
git clone https://www.modelscope.cn/Kedreamix/Linly-Talker.git --depth 1
# git lfs clone https://www.modelscope.cn/Kedreamix/Linly-Talker.git --depth 1
# 2. Python ไปฃ็ ไธ‹่ฝฝ
pip install modelscope
from modelscope import snapshot_download
model_dir = snapshot_download('Kedreamix/Linly-Talker', resume_download=True, cache_dir='./', revision='master')
```
**็งปๅŠจๆ‰€ๆœ‰ๆจกๅž‹ๅˆฐๅฝ“ๅ‰็›ฎๅฝ•**
ๅฆ‚ๆžœ็™พๅบฆ็ฝ‘็›˜ไธ‹่ฝฝๅŽ๏ผŒๅฏไปฅๅ‚่€ƒๆ–‡ๆกฃๆœ€ๅŽ็›ฎๅฝ•็ป“ๆž„ๆฅ็งปๅŠจ็›ฎๅฝ•
```bash
# ็งปๅŠจๆ‰€ๆœ‰ๆจกๅž‹ๅˆฐๅฝ“ๅ‰็›ฎๅฝ•
# checkpointไธญๅซๆœ‰SadTalkerๅ’ŒWav2Lip็ญ‰ๆƒ้‡
mv Linly-Talker/checkpoints/* ./checkpoints
# ่‹ฅไฝฟ็”จGFPGANๅขžๅผบ๏ผŒๅฎ‰่ฃ…ๅฏนๅบ”็š„ๅบ“
# pip install gfpgan
# mv Linly-Talker/gfpan ./
# ่ฏญ้Ÿณๅ…‹้š†ๆจกๅž‹
mv Linly-Talker/GPT_SoVITS/pretrained_models/* ./GPT_SoVITS/pretrained_models/
# Qwenๅคงๆจกๅž‹
mv Linly-Talker/Qwen ./
# MuseTalkๆจกๅž‹
mkdir -p ./Musetalk/models
mv Linly-Talker/MuseTalk/* ./Musetalk/models
```
ไธบไบ†ๅคงๅฎถ็š„้ƒจ็ฝฒไฝฟ็”จๆ–นไพฟ๏ผŒๆ›ดๆ–ฐไบ†ไธ€ไธช`configs.py`ๆ–‡ไปถ๏ผŒๅฏไปฅๅฏนๅ…ถ่ฟ›่กŒไธ€ไบ›่ถ…ๅ‚ๆ•ฐไฟฎๆ”นๅณๅฏ
```bash
# ่ฎพๅค‡่ฟ่กŒ็ซฏๅฃ (Device running port)
port = 6006
# api่ฟ่กŒ็ซฏๅฃๅŠIP (API running port and IP)
mode = 'api' # api ้œ€่ฆๅ…ˆ่ฟ่กŒLinly-api-fast.py๏ผŒๆš‚ๆ—ถไป…ไป…้€‚็”จไบŽLinly
# ๆœฌๅœฐ็ซฏๅฃlocalhost:127.0.0.1 ๅ…จๅฑ€็ซฏๅฃ่ฝฌๅ‘:"0.0.0.0"
ip = '127.0.0.1'
api_port = 7871
# LLMๆจกๅž‹่ทฏๅพ„ (Linly model path)
mode = 'offline'
model_path = 'Qwen/Qwen-1_8B-Chat'
# ssl่ฏไนฆ (SSL certificate) ้บฆๅ…‹้ฃŽๅฏน่ฏ้œ€่ฆๆญคๅ‚ๆ•ฐ
# ๆœ€ๅฅฝ่ฐƒๆ•ดไธบ็ปๅฏน่ทฏๅพ„
ssl_certfile = "./https_cert/cert.pem"
ssl_keyfile = "./https_cert/key.pem"
```
## ASR - Speech Recognition
่ฏฆ็ป†ๆœ‰ๅ…ณไบŽ่ฏญ้Ÿณ่ฏ†ๅˆซ็š„**ไฝฟ็”จไป‹็ป**ไธŽ**ไปฃ็ ๅฎž็Žฐ**ๅฏ่ง [ASR - ๅŒๆ•ฐๅญ—ไบบๆฒŸ้€š็š„ๆกฅๆข](./ASR/README.md)
### Whisper
ๅ€Ÿ้‰ดOpenAI็š„Whisperๅฎž็Žฐไบ†ASR็š„่ฏญ้Ÿณ่ฏ†ๅˆซ๏ผŒๅ…ทไฝ“ไฝฟ็”จๆ–นๆณ•ๅ‚่€ƒ [https://github.com/openai/whisper](https://github.com/openai/whisper)
### FunASR
้˜ฟ้‡Œ็š„`FunASR`็š„่ฏญ้Ÿณ่ฏ†ๅˆซๆ•ˆๆžœไนŸๆ˜ฏ็›ธๅฝ“ไธ้”™๏ผŒ่€Œไธ”ๆ—ถ้—ดไนŸๆ˜ฏๆฏ”whisperๆ›ดๅฟซ็š„๏ผŒๅฏนไธญๆ–‡ๅฎž้™…ไธŠๆ˜ฏๆ›ดๅฅฝ็š„ใ€‚
ๅŒๆ—ถfunasrๆ›ด่ƒฝ่พพๅˆฐๅฎžๆ—ถ็š„ๆ•ˆๆžœ๏ผŒๆ‰€ไปฅไนŸๅฐ†FunASRๆทปๅŠ ่ฟ›ๅŽปไบ†๏ผŒๅœจASRๆ–‡ไปถๅคนไธ‹็š„FunASRๆ–‡ไปถ้‡Œๅฏไปฅ่ฟ›่กŒไฝ“้ชŒ๏ผŒๅ‚่€ƒ [https://github.com/alibaba-damo-academy/FunASR](https://github.com/alibaba-damo-academy/FunASR)ใ€‚
### Coming Soon
ๆฌข่ฟŽๅคงๅฎถๆๅ‡บๅปบ่ฎฎ๏ผŒๆฟ€ๅŠฑๆˆ‘ไธๆ–ญๆ›ดๆ–ฐๆจกๅž‹๏ผŒไธฐๅฏŒLinly-Talker็š„ๅŠŸ่ƒฝใ€‚
## TTS Text To Speech
่ฏฆ็ป†ๆœ‰ๅ…ณไบŽ่ฏญ้Ÿณ่ฏ†ๅˆซ็š„**ไฝฟ็”จไป‹็ป**ไธŽ**ไปฃ็ ๅฎž็Žฐ**ๅฏ่ง [TTS - ่ต‹ไบˆๆ•ฐๅญ—ไบบ็œŸๅฎž็š„่ฏญ้Ÿณไบคไบ’่ƒฝๅŠ›](./TTS/README.md)
### Edge TTS
ๅ€Ÿ้‰ดไฝฟ็”จๅพฎ่ฝฏ่ฏญ้ŸณๆœๅŠก๏ผŒๅ…ทไฝ“ไฝฟ็”จๆ–นๆณ•ๅ‚่€ƒ[https://github.com/rany2/edge-tts](https://github.com/rany2/edge-tts)
### PaddleTTS
ๅœจๅฎž้™…ไฝฟ็”จ่ฟ‡็จ‹ไธญ๏ผŒๅฏ่ƒฝไผš้‡ๅˆฐ้œ€่ฆ็ฆป็บฟๆ“ไฝœ็š„ๆƒ…ๅ†ตใ€‚็”ฑไบŽEdge TTS้œ€่ฆๅœจ็บฟ็Žฏๅขƒๆ‰่ƒฝ็”Ÿๆˆ่ฏญ้Ÿณ๏ผŒๅ› ๆญคๆˆ‘ไปฌ้€‰ๆ‹ฉไบ†ๅŒๆ ทๅผ€ๆบ็š„PaddleSpeechไฝœไธบๆ–‡ๆœฌๅˆฐ่ฏญ้Ÿณ๏ผˆTTS๏ผ‰็š„ๆ›ฟไปฃๆ–นๆกˆใ€‚่™ฝ็„ถๆ•ˆๆžœๅฏ่ƒฝๆœ‰ๆ‰€ไธๅŒ๏ผŒไฝ†PaddleSpeechๆ”ฏๆŒ็ฆป็บฟๆ“ไฝœใ€‚ๆ›ดๅคšไฟกๆฏๅฏๅ‚่€ƒPaddleSpeech็š„GitHub้กต้ข๏ผš[PaddleSpeech](https://github.com/PaddlePaddle/PaddleSpeech)ใ€‚
### Coming Soon
ๆฌข่ฟŽๅคงๅฎถๆๅ‡บๅปบ่ฎฎ๏ผŒๆฟ€ๅŠฑๆˆ‘ไธๆ–ญๆ›ดๆ–ฐๆจกๅž‹๏ผŒไธฐๅฏŒLinly-Talker็š„ๅŠŸ่ƒฝใ€‚
## Voice Clone
่ฏฆ็ป†ๆœ‰ๅ…ณไบŽ่ฏญ้Ÿณๅ…‹้š†็š„**ไฝฟ็”จไป‹็ป**ไธŽ**ไปฃ็ ๅฎž็Žฐ**ๅฏ่ง [Voice Clone - ๅœจๅฏน่ฏๆ—ถๆ‚„ๆ‚„ๅท่ตฐไฝ ็š„ๅฃฐ้Ÿณ](./VITS/README.md)
### GPT-SoVITS๏ผˆๆŽจ่๏ผ‰
ๆ„Ÿ่ฐขๅคงๅฎถ็š„ๅผ€ๆบ่ดก็Œฎ๏ผŒๆˆ‘ๅ€Ÿ้‰ดไบ†ๅฝ“ๅ‰ๅผ€ๆบ็š„่ฏญ้Ÿณๅ…‹้š†ๆจกๅž‹ `GPT-SoVITS`๏ผŒๆˆ‘่ฎคไธบๆ•ˆๆžœๆ˜ฏ็›ธๅฝ“ไธ้”™็š„๏ผŒ้กน็›ฎๅœฐๅ€ๅฏๅ‚่€ƒ[https://github.com/RVC-Boss/GPT-SoVITS](https://github.com/RVC-Boss/GPT-SoVITS)
ๆˆ‘ๅฐ†ไธ€ไบ›่ฎญ็ปƒๅฅฝ็š„ๅ…‹้š†ๆƒ้‡ๆ”พๅœจไบ†[Quark(ๅคธๅ…‹็ฝ‘็›˜)](https://pan.quark.cn/s/f48f5e35796b)ไธญ๏ผŒๅคงๅฎถๅฏไปฅ่‡ชๅ–ๆƒ้‡ๅ’Œๅ‚่€ƒ้Ÿณ้ข‘ใ€‚
### XTTS
Coqui XTTSๆ˜ฏไธ€ไธช้ข†ๅ…ˆ็š„ๆทฑๅบฆๅญฆไน ๆ–‡ๆœฌๅˆฐ่ฏญ้ŸณไปปๅŠก๏ผˆTTS่ฏญ้Ÿณ็”Ÿๆˆๆจกๅž‹๏ผ‰ๅทฅๅ…ทๅŒ…๏ผŒ้€š่ฟ‡ไฝฟ็”จไธ€ๆฎต5็ง’้’ŸไปฅไธŠ็š„่ฏญ้Ÿณ้ข‘ๅ‰ช่พ‘ๅฐฑๅฏไปฅๅฎŒๆˆๅฃฐ้Ÿณๅ…‹้š†*ๅฐ†่ฏญ้Ÿณๅ…‹้š†ๅˆฐไธๅŒ็š„่ฏญ่จ€*ใ€‚
๐ŸธTTS ๆ˜ฏไธ€ไธช็”จไบŽ้ซ˜็บงๆ–‡ๆœฌ่ฝฌ่ฏญ้Ÿณ็”Ÿๆˆ็š„ๅบ“ใ€‚
๐Ÿš€ ่ถ…่ฟ‡ 1100 ็ง่ฏญ่จ€็š„้ข„่ฎญ็ปƒๆจกๅž‹ใ€‚
๐Ÿ› ๏ธ ็”จไบŽไปฅไปปไฝ•่ฏญ่จ€่ฎญ็ปƒๆ–ฐๆจกๅž‹ๅ’Œๅพฎ่ฐƒ็Žฐๆœ‰ๆจกๅž‹็š„ๅทฅๅ…ทใ€‚
๐Ÿ“š ็”จไบŽๆ•ฐๆฎ้›†ๅˆ†ๆžๅ’Œ็ฎก็†็š„ๅฎž็”จ็จ‹ๅบใ€‚
- ๅœจ็บฟไฝ“้ชŒXTTS [https://huggingface.co/spaces/coqui/xtts](https://huggingface.co/spaces/coqui/xtts)
- ๅฎ˜ๆ–นGithubๅบ“ https://github.com/coqui-ai/TTS
### Coming Soon
ๆฌข่ฟŽๅคงๅฎถๆๅ‡บๅปบ่ฎฎ๏ผŒๆฟ€ๅŠฑๆˆ‘ไธๆ–ญๆ›ดๆ–ฐๆจกๅž‹๏ผŒไธฐๅฏŒLinly-Talker็š„ๅŠŸ่ƒฝใ€‚
## THG - Avatar
่ฏฆ็ป†ๆœ‰ๅ…ณไบŽๆ•ฐๅญ—ไบบ็”Ÿๆˆ็š„**ไฝฟ็”จไป‹็ป**ไธŽ**ไปฃ็ ๅฎž็Žฐ**ๅฏ่ง [THG - ๆž„ๅปบๆ™บ่ƒฝๆ•ฐๅญ—ไบบ](./TFG/README.md)
### SadTalker
ๆ•ฐๅญ—ไบบ็”Ÿๆˆๅฏไฝฟ็”จSadTalker๏ผˆCVPR 2023๏ผ‰,่ฏฆๆƒ…ไป‹็ป่ง [https://sadtalker.github.io](https://sadtalker.github.io)
ๅœจไฝฟ็”จๅ‰ๅ…ˆไธ‹่ฝฝSadTalkerๆจกๅž‹:
```bash
bash scripts/sadtalker_download_models.sh
```
[Baidu (็™พๅบฆไบ‘็›˜)](https://pan.baidu.com/s/1eF13O-8wyw4B3MtesctQyg?pwd=linl) (Password: `linl`)
[Quark(ๅคธๅ…‹็ฝ‘็›˜)](https://pan.quark.cn/s/f48f5e35796b)
> ๅฆ‚ๆžœ็™พๅบฆ็ฝ‘็›˜ไธ‹่ฝฝ๏ผŒ่ฎฐไฝๆ˜ฏๆ”พๅœจcheckpointsๆ–‡ไปถๅคนไธ‹๏ผŒ็™พๅบฆ็ฝ‘็›˜ไธ‹่ฝฝ็š„้ป˜่ฎคๅ‘ฝๅไธบsadtalker๏ผŒๅฎž้™…ๅบ”่ฏฅ้‡ๅ‘ฝๅไธบcheckpoints
### Wav2Lip
ๆ•ฐๅญ—ไบบ็”Ÿๆˆ่ฟ˜ๅฏไฝฟ็”จWav2Lip๏ผˆACM 2020๏ผ‰๏ผŒ่ฏฆๆƒ…ไป‹็ป่ง [https://github.com/Rudrabha/Wav2Lip](https://github.com/Rudrabha/Wav2Lip)
ๅœจไฝฟ็”จๅ‰ๅ…ˆไธ‹่ฝฝWav2Lipๆจกๅž‹๏ผš
| Model | Description | Link to the model |
| ---------------------------- | ----------------------------------------------------- | ------------------------------------------------------------ |
| Wav2Lip | Highly accurate lip-sync | [Link](https://iiitaphyd-my.sharepoint.com/:u:/g/personal/radrabha_m_research_iiit_ac_in/Eb3LEzbfuKlJiR600lQWRxgBIY27JZg80f7V9jtMfbNDaQ?e=TBFBVW) |
| Wav2Lip + GAN | Slightly inferior lip-sync, but better visual quality | [Link](https://iiitaphyd-my.sharepoint.com/:u:/g/personal/radrabha_m_research_iiit_ac_in/EdjI7bZlgApMqsVoEUUXpLsBxqXbn5z8VTmoxp55YNDcIA?e=n9ljGW) |
| Expert Discriminator | Weights of the expert discriminator | [Link](https://iiitaphyd-my.sharepoint.com/:u:/g/personal/radrabha_m_research_iiit_ac_in/EQRvmiZg-HRAjvI6zqN9eTEBP74KefynCwPWVmF57l-AYA?e=ZRPHKP) |
| Visual Quality Discriminator | Weights of the visual disc trained in a GAN setup | [Link](https://iiitaphyd-my.sharepoint.com/:u:/g/personal/radrabha_m_research_iiit_ac_in/EQVqH88dTm1HjlK11eNba5gBbn15WMS0B0EZbDBttqrqkg?e=ic0ljo) |
### ER-NeRF
ER-NeRF๏ผˆICCV2023๏ผ‰ๆ˜ฏไฝฟ็”จๆœ€ๆ–ฐ็š„NeRFๆŠ€ๆœฏๆž„ๅปบ็š„ๆ•ฐๅญ—ไบบ๏ผŒๆ‹ฅๆœ‰ๅฎšๅˆถๆ•ฐๅญ—ไบบ็š„็‰นๆ€ง๏ผŒๅช้œ€่ฆไธ€ไธชไบบ็š„ไบ”ๅˆ†้’Ÿๅทฆๅณๅˆฐ่ง†้ข‘ๅณๅฏ้‡ๅปบๅ‡บๆฅ๏ผŒๅ…ทไฝ“ๅฏๅ‚่€ƒ [https://github.com/Fictionarry/ER-NeRF](https://github.com/Fictionarry/ER-NeRF)
ๅทฒๆ›ดๆ–ฐ๏ผŒไปฅๅฅฅๅทด้ฉฌๅฝข่ฑกไฝœไธบๅ‚่€ƒ๏ผŒ่‹ฅ่€ƒ่™‘ๆ›ดๅฅฝ็š„ๆ•ˆๆžœ๏ผŒๅฏ่ƒฝ่€ƒ่™‘ๅ…‹้š†ๅฎšๅˆถๆ•ฐๅญ—ไบบ็š„ๅฃฐ้Ÿณไปฅๅพ—ๅˆฐๆ›ดๅฅฝ็š„ๆ•ˆๆžœใ€‚
### MuseTalk
MuseTalk ๆ˜ฏไธ€ไธชๅฎžๆ—ถ้ซ˜่ดจ้‡็š„้Ÿณ้ข‘้ฉฑๅŠจๅ”‡ๅฝขๅŒๆญฅๆจกๅž‹๏ผŒ่ƒฝๅคŸไปฅ30ๅธงๆฏ็ง’ไปฅไธŠ็š„้€ŸๅบฆๅœจNVIDIA Tesla V100ๆ˜พๅกไธŠ่ฟ่กŒใ€‚่ฏฅๆจกๅž‹ๅฏไปฅไธŽ็”ฑ MuseV ็”Ÿๆˆ็š„่พ“ๅ…ฅ่ง†้ข‘็ป“ๅˆไฝฟ็”จ๏ผŒไฝœไธบๅฎŒๆ•ด็š„่™šๆ‹Ÿไบบ่งฃๅ†ณๆ–นๆกˆ็š„ไธ€้ƒจๅˆ†ใ€‚ๅ…ทไฝ“ๅฏๅ‚่€ƒ [https://github.com/TMElyralab/MuseTalk](https://github.com/TMElyralab/MuseTalk)
MuseTalk ๆ˜ฏไธ€ไธชๅฎžๆ—ถ้ซ˜่ดจ้‡็š„้Ÿณ้ข‘้ฉฑๅŠจๅ”‡ๅฝขๅŒๆญฅๆจกๅž‹๏ผŒ็ป่ฟ‡่ฎญ็ปƒๅฏไปฅๅœจ ft-mse-vae ็š„ๆฝœๅœจ็ฉบ้—ดไธญ่ฟ›่กŒๅทฅไฝœใ€‚ๅฎƒๅ…ทๆœ‰ไปฅไธ‹็‰นๆ€ง๏ผš
- **ๆœช่ง้ขๅญ”็š„ๅŒๆญฅ**๏ผšๆ นๆฎ่พ“ๅ…ฅ็š„้Ÿณ้ข‘ๅฏนๆœช่ง่ฟ‡็š„้ขๅญ”่ฟ›่กŒไฟฎๆ”น๏ผŒ้ข้ƒจๅŒบๅŸŸ็š„ๅคงๅฐไธบ 256 x 256ใ€‚
- **ๅคš่ฏญ่จ€ๆ”ฏๆŒ**๏ผšๆ”ฏๆŒๅคš็ง่ฏญ่จ€็š„้Ÿณ้ข‘่พ“ๅ…ฅ๏ผŒๅŒ…ๆ‹ฌไธญๆ–‡ใ€่‹ฑ่ฏญๅ’Œๆ—ฅ่ฏญใ€‚
- **้ซ˜ๆ€ง่ƒฝๅฎžๆ—ถๆŽจ็†**๏ผšๅœจ NVIDIA Tesla V100 ไธŠๅฏไปฅๅฎž็Žฐ 30ๅธงๆฏ็ง’ไปฅไธŠ็š„ๅฎžๆ—ถๆŽจ็†ใ€‚
- **้ข้ƒจไธญๅฟƒ็‚น่ฐƒๆ•ด**๏ผšๆ”ฏๆŒไฟฎๆ”น้ข้ƒจๅŒบๅŸŸ็š„ไธญๅฟƒ็‚นไฝ็ฝฎ๏ผŒ่ฟ™ๅฏน็”Ÿๆˆ็ป“ๆžœๆœ‰ๆ˜พ่‘—ๅฝฑๅ“ใ€‚
- **HDTF ๆ•ฐๆฎ้›†่ฎญ็ปƒ**๏ผšๆไพ›ๅœจ HDTF ๆ•ฐๆฎ้›†ไธŠ่ฎญ็ปƒ็š„ๆจกๅž‹ๆฃ€ๆŸฅ็‚นใ€‚
- **่ฎญ็ปƒไปฃ็ ๅณๅฐ†ๅ‘ๅธƒ**๏ผš่ฎญ็ปƒไปฃ็ ๅณๅฐ†ๅ‘ๅธƒ๏ผŒๆ–นไพฟ่ฟ›ไธ€ๆญฅ็š„ๅผ€ๅ‘ๅ’Œ็ ”็ฉถใ€‚
MuseTalk ๆไพ›ไบ†ไธ€ไธช้ซ˜ๆ•ˆไธ”็ตๆดป็š„ๅทฅๅ…ท๏ผŒไฝฟ่™šๆ‹Ÿไบบ็š„้ข้ƒจ่กจๆƒ…่ƒฝๅคŸ็ฒพ็กฎๅŒๆญฅไบŽ้Ÿณ้ข‘๏ผŒไธบๅฎž็Žฐๅ…จๆ–นไฝไบ’ๅŠจ็š„่™šๆ‹Ÿไบบ่ฟˆๅ‡บไบ†้‡่ฆไธ€ๆญฅใ€‚
ๅœจLinly-Talkerไธญๅทฒ็ปๅŠ ๅ…ฅไบ†MuseTalk๏ผŒๅŸบไบŽMuseV็š„่ง†้ข‘่ฟ›่กŒๆŽจ็†๏ผŒๅพ—ๅˆฐไบ†ๆฏ”่พƒ็†ๆƒณ็š„้€Ÿๅบฆ่ฟ›่กŒๅฏน่ฏ๏ผŒๅŸบๆœฌ่พพๅˆฐๅฎžๆ—ถ็š„ๆ•ˆๆžœ๏ผŒ่ฟ˜ๆ˜ฏ้žๅธธไธ้”™็š„๏ผŒไนŸๆ˜ฏๅฏไปฅๅŸบไบŽๆตๅผ่ฟ›่กŒๆŽจ็†็š„ใ€‚
### Coming Soon
ๆฌข่ฟŽๅคงๅฎถๆๅ‡บๅปบ่ฎฎ๏ผŒๆฟ€ๅŠฑๆˆ‘ไธๆ–ญๆ›ดๆ–ฐๆจกๅž‹๏ผŒไธฐๅฏŒLinly-Talker็š„ๅŠŸ่ƒฝใ€‚
## LLM - Conversation
่ฏฆ็ป†ๆœ‰ๅ…ณไบŽๅคงๆจกๅž‹็š„**ไฝฟ็”จไป‹็ป**ไธŽ**ไปฃ็ ๅฎž็Žฐ**ๅฏ่ง [LLM - ๅคง่ฏญ่จ€ๆจกๅž‹ไธบๆ•ฐๅญ—ไบบ่ต‹่ƒฝ](./LLM/README.md)
### Linly-AI
Linlyๆฅ่‡ชๆทฑๅœณๅคงๅญฆๆ•ฐๆฎๅทฅ็จ‹ๅ›ฝๅฎถ้‡็‚นๅฎž้ชŒๅฎค๏ผŒๅ‚่€ƒ [https://github.com/CVI-SZU/Linly](https://github.com/CVI-SZU/Linly)
### Qwen
ๆฅ่‡ช้˜ฟ้‡Œไบ‘็š„Qwen๏ผŒๆŸฅ็œ‹ [https://github.com/QwenLM/Qwen](https://github.com/QwenLM/Qwen)
ๅฆ‚ๆžœๆƒณ่ฆๅฟซ้€Ÿไฝฟ็”จ๏ผŒๅฏไปฅ้€‰1.8B็š„ๆจกๅž‹๏ผŒๅ‚ๆ•ฐๆฏ”่พƒๅฐ‘๏ผŒๅœจ่พƒๅฐ็š„ๆ˜พๅญ˜ไนŸๅฏไปฅๆญฃๅธธไฝฟ็”จ๏ผŒๅฝ“็„ถ่ฟ™ไธ€้ƒจๅˆ†ๅฏไปฅๆ›ฟๆข
ไธ‹่ฝฝ Qwen1.8B ๆจกๅž‹: [https://huggingface.co/Qwen/Qwen-1_8B-Chat](https://huggingface.co/Qwen/Qwen-1_8B-Chat)
### Gemini-Pro
ๆฅ่‡ช Google ็š„ Gemini-Pro๏ผŒไบ†่งฃๆ›ดๅคš่ฏท่ฎฟ้—ฎ [https://deepmind.google/technologies/gemini/](https://deepmind.google/technologies/gemini/)
่ฏทๆฑ‚ API ๅฏ†้’ฅ: [https://makersuite.google.com/](https://makersuite.google.com/)
### ChatGPT
ๆฅ่‡ชOpenAI็š„๏ผŒ้œ€่ฆ็”ณ่ฏทAPI๏ผŒไบ†่งฃๆ›ดๅคš่ฏท่ฎฟ้—ฎ [https://platform.openai.com/docs/introduction](https://platform.openai.com/docs/introduction)
### ChatGLM
ๆฅ่‡ชๆธ…ๅŽ็š„๏ผŒไบ†่งฃๆ›ดๅคš่ฏท่ฎฟ้—ฎ [https://github.com/THUDM/ChatGLM3](https://github.com/THUDM/ChatGLM3)
### GPT4Free
ๅฏๅ‚่€ƒ[https://github.com/xtekky/gpt4free](https://github.com/xtekky/gpt4free)๏ผŒๅ…่ดน็™ฝๅซ–ไฝฟ็”จGPT4็ญ‰ๆจกๅž‹
### LLM ๅคšๆจกๅž‹้€‰ๆ‹ฉ
ๅœจ webui.py ๆ–‡ไปถไธญ๏ผŒ่ฝปๆพ้€‰ๆ‹ฉๆ‚จ้œ€่ฆ็š„ๆจกๅž‹๏ผŒโš ๏ธ็ฌฌไธ€ๆฌก่ฟ่กŒ่ฆๅ…ˆไธ‹่ฝฝๆจกๅž‹๏ผŒๅ‚่€ƒQwen1.8B
### Coming Soon
ๆฌข่ฟŽๅคงๅฎถๆๅ‡บๅปบ่ฎฎ๏ผŒๆฟ€ๅŠฑๆˆ‘ไธๆ–ญๆ›ดๆ–ฐๆจกๅž‹๏ผŒไธฐๅฏŒLinly-Talker็š„ๅŠŸ่ƒฝใ€‚
## ไผ˜ๅŒ–
ไธ€ไบ›ไผ˜ๅŒ–:
- ไฝฟ็”จๅ›บๅฎš็š„่พ“ๅ…ฅไบบ่„ธๅ›พๅƒ,ๆๅ‰ๆๅ–็‰นๅพ,้ฟๅ…ๆฏๆฌก่ฏปๅ–
- ็งป้™คไธๅฟ…่ฆ็š„ๅบ“,็ผฉ็Ÿญๆ€ปๆ—ถ้—ด
- ๅชไฟๅญ˜ๆœ€็ปˆ่ง†้ข‘่พ“ๅ‡บ,ไธไฟๅญ˜ไธญ้—ด็ป“ๆžœ,ๆ้ซ˜ๆ€ง่ƒฝ
- ไฝฟ็”จOpenCV็”Ÿๆˆๆœ€็ปˆ่ง†้ข‘,ๆฏ”mimwriteๆ›ดๅฟซ
## Gradio
Gradioๆ˜ฏไธ€ไธชPythonๅบ“,ๆไพ›ไบ†ไธ€็ง็ฎ€ๅ•็š„ๆ–นๅผๅฐ†ๆœบๅ™จๅญฆไน ๆจกๅž‹ไฝœไธบไบคไบ’ๅผWebๅบ”็”จ็จ‹ๅบๆฅ้ƒจ็ฝฒใ€‚
ๅฏนLinly-Talker่€Œ่จ€,ไฝฟ็”จGradioๆœ‰ไธคไธชไธป่ฆ็›ฎ็š„:
1. **ๅฏ่ง†ๅŒ–ไธŽๆผ”็คบ**:Gradioไธบๆจกๅž‹ๆไพ›ไธ€ไธช็ฎ€ๅ•็š„Web GUI,ไธŠไผ ๅ›พ็‰‡ๅ’Œๆ–‡ๆœฌๅŽๅฏไปฅ็›ด่ง‚ๅœฐ็œ‹ๅˆฐ็ป“ๆžœใ€‚่ฟ™ๆ˜ฏๅฑ•็คบ็ณป็ปŸ่ƒฝๅŠ›็š„ๆœ‰ๆ•ˆๆ–นๅผใ€‚
2. **็”จๆˆทไบคไบ’**:Gradio็š„GUIๅฏไปฅไฝœไธบๅ‰็ซฏ,ๅ…่ฎธ็”จๆˆทไธŽLinly-Talker่ฟ›่กŒไบคไบ’ๅฏน่ฏใ€‚็”จๆˆทๅฏไปฅไธŠไผ ่‡ชๅทฑ็š„ๅ›พ็‰‡ๅนถ่พ“ๅ…ฅ้—ฎ้ข˜,ๅฎžๆ—ถ่Žทๅ–ๅ›ž็ญ”ใ€‚่ฟ™ๆไพ›ไบ†ๆ›ด่‡ช็„ถ็š„่ฏญ้Ÿณไบคไบ’ๆ–นๅผใ€‚
ๅ…ทไฝ“ๆฅ่ฏด,ๆˆ‘ไปฌๅœจapp.pyไธญๅˆ›ๅปบไบ†ไธ€ไธชGradio็š„Interface,ๆŽฅๆ”ถๅ›พ็‰‡ๅ’Œๆ–‡ๆœฌ่พ“ๅ…ฅ,่ฐƒ็”จๅ‡ฝๆ•ฐ็”Ÿๆˆๅ›žๅบ”่ง†้ข‘,ๅœจGUIไธญๆ˜พ็คบๅ‡บๆฅใ€‚่ฟ™ๆ ทๅฐฑๅฎž็Žฐไบ†ๆต่งˆๅ™จไบคไบ’่€Œไธ้œ€่ฆ็ผ–ๅ†™ๅคๆ‚็š„ๅ‰็ซฏใ€‚
ๆ€ปไน‹,GradioไธบLinly-Talkerๆไพ›ไบ†ๅฏ่ง†ๅŒ–ๅ’Œ็”จๆˆทไบคไบ’็š„ๆŽฅๅฃ,ๆ˜ฏๅฑ•็คบ็ณป็ปŸๅŠŸ่ƒฝๅ’Œ่ฎฉๆœ€็ปˆ็”จๆˆทไฝฟ็”จ็ณป็ปŸ็š„ๆœ‰ๆ•ˆ้€”ๅพ„ใ€‚
> ่‹ฅ่€ƒ่™‘ๅฎžๆ—ถๅฏน่ฏ๏ผŒๅฏ่ƒฝ้œ€่ฆๆขไธชๆก†ๆžถ๏ผŒๆˆ–่€…ๅฏนGradio่ฟ›่กŒ้ญ”ๆ”น๏ผŒๅธŒๆœ›ๅ’Œๅคงๅฎถไธ€่ตทๅŠชๅŠ›
## ๅฏๅŠจWebUI
ไน‹ๅ‰ๆˆ‘ๅฐ†ๅพˆๅคšไธช็‰ˆๆœฌ้ƒฝๆ˜ฏๅˆ†ๅผ€ๆฅ็š„๏ผŒๅฎž้™…ไธŠ่ฟ่กŒๅคšไธชไผšๆฏ”่พƒ้บป็ƒฆ๏ผŒๆ‰€ไปฅๅŽ็ปญๆˆ‘ๅขžๅŠ ไบ†ๅ˜ๆˆWebUIไธ€ไธช็•Œ้ขๅณๅฏไฝ“้ชŒ๏ผŒๅŽ็ปญไนŸไผšไธๆ–ญๆ›ดๆ–ฐ
### WebUI
็ŽฐๅœจๅทฒๅŠ ๅ…ฅWebUI็š„ๅŠŸ่ƒฝๅฆ‚ไธ‹
- [x] ๆ–‡ๆœฌ/่ฏญ้Ÿณๆ•ฐๅญ—ไบบๅฏน่ฏ๏ผˆๅ›บๅฎšๆ•ฐๅญ—ไบบ๏ผŒๅˆ†็”ทๅฅณ่ง’่‰ฒ๏ผ‰
- [x] ไปปๆ„ๅ›พ็‰‡ๆ•ฐๅญ—ไบบๅฏน่ฏ๏ผˆๅฏไธŠไผ ไปปๆ„ๅ›พ็‰‡ๆ•ฐๅญ—ไบบ๏ผ‰
- [x] ๅคš่ฝฎGPTๅฏน่ฏ๏ผˆๅŠ ๅ…ฅๅކๅฒๅฏน่ฏๆ•ฐๆฎ๏ผŒ้“พๆŽฅไธŠไธ‹ๆ–‡๏ผ‰
- [x] ่ฏญ้Ÿณๅ…‹้š†ๅฏน่ฏ๏ผˆๅŸบไบŽGPT-SoVITS่ฎพ็ฝฎ่ฟ›่กŒ่ฏญ้Ÿณๅ…‹้š†๏ผŒไนŸๅฏๆ นๆฎ่ฏญ้Ÿณๅฏน่ฏ็š„ๅฃฐ้Ÿณ่ฟ›่กŒๅ…‹้š†๏ผ‰
- [x] ๆ•ฐๅญ—ไบบๆ–‡ๆœฌ/่ฏญ้Ÿณๆ’ญๆŠฅ๏ผˆๆ นๆฎ่พ“ๅ…ฅ็š„ๆ–‡ๅญ—/่ฏญ้Ÿณ่ฟ›่กŒๆ’ญๆŠฅ๏ผ‰
- [x] ๅคšๆจกๅ—โž•ๅคšๆจกๅž‹โž•ๅคš้€‰ๆ‹ฉ
- [x] ่ง’่‰ฒๅคš้€‰ๆ‹ฉ๏ผšๅฅณๆ€ง่ง’่‰ฒ/็”ทๆ€ง่ง’่‰ฒ/่‡ชๅฎšไน‰่ง’่‰ฒ(ๆฏไธ€้ƒจๅˆ†้ƒฝๅฏไปฅ่‡ชๅŠจไธŠไผ ๅ›พ็‰‡)/Comming Soon
- [x] TTSๆจกๅž‹ๅคš้€‰ๆ‹ฉ๏ผšEdgeTTS / PaddleTTS/ GPT-SoVITS/Comming Soon
- [x] LLMๆจกๅž‹ๅคš้€‰ๆ‹ฉ๏ผš Linly/ Qwen / ChatGLM/ GeminiPro/ ChatGPT/Comming Soon
- [x] Talkerๆจกๅž‹ๅคš้€‰ๆ‹ฉ๏ผšWav2Lip/ SadTalker/ ERNeRF/ MuseTalk/Comming Soon
- [x] ASRๆจกๅž‹ๅคš้€‰ๆ‹ฉ๏ผšWhisper/ FunASR/Comming Soon
![](docs/WebUI2.png)
ๅฏไปฅ็›ดๆŽฅ่ฟ่กŒwebuiๆฅๅพ—ๅˆฐ็ป“ๆžœ๏ผŒๅฏไปฅ็œ‹ๅˆฐ็š„้กต้ขๅฆ‚ไธ‹
```bash
# WebUI
python webui.py
```
![](docs/WebUI.png)
่ฟ™ๆฌกๆ›ดๆ–ฐไบ†ไธ€ไธ‹็•Œ้ข๏ผŒๆˆ‘ไปฌๅฏไปฅ่‡ช็”ฑ้€‰ๆ‹ฉGPT-SoVITSๅพฎ่ฐƒๅŽ็š„ๆจกๅž‹ๆฅๅฎž็Žฐ๏ผŒไธŠไผ ๅ‚่€ƒ้Ÿณ้ข‘ๅณๅฏๅพˆๅฅฝ็š„ๅ…‹้š†ๅฃฐ้Ÿณ
![](docs/WebUI3.png)
### Old Verison
> ่ฟ™ไธ€้ƒจๅˆ†ๆ˜ฏไธบไบ†ไฟ่ฏๆฏ้ƒจไปฝไปฃ็ ้ƒฝๆ˜ฏๆญฃ็กฎ็š„๏ผŒๆ‰€ไปฅไผšๅ…ˆๅฏนๆฏไธ€ไธชๆจกๅ—้ƒฝ่ฟ›่กŒๆต‹่ฏ•ๅ’Œๆ”น่ฟ›
ๅฏๅŠจไธ€ๅ…ฑๆœ‰ๅ‡ ็งๆจกๅผ๏ผŒๅฏไปฅ้€‰ๆ‹ฉ็‰นๅฎš็š„ๅœบๆ™ฏ่ฟ›่กŒ่ฎพ็ฝฎ
็ฌฌไธ€็งๅชๆœ‰ๅ›บๅฎšไบ†ไบบ็‰ฉ้—ฎ็ญ”๏ผŒ่ฎพ็ฝฎๅฅฝไบ†ไบบ็‰ฉ๏ผŒ็œๅŽปไบ†้ข„ๅค„็†ๆ—ถ้—ด
```bash
python app.py
```
![](docs/UI.png)
ๆœ€่ฟ‘ๆ›ดๆ–ฐไบ†็ฌฌไธ€็งๆจกๅผ๏ผŒๅŠ ๅ…ฅไบ†Wav2Lipๆจกๅž‹่ฟ›่กŒๅฏน่ฏ
```bash
python appv2.py
```
็ฌฌไบŒ็งๆ˜ฏๅฏไปฅไปปๆ„ไธŠไผ ๅ›พ็‰‡่ฟ›่กŒๅฏน่ฏ
```bash
python app_img.py
```
![](docs/UI2.png)
็ฌฌไธ‰็งๆ˜ฏๅœจ็ฌฌไธ€็ง็š„ๅŸบ็ก€ไธŠๅŠ ๅ…ฅไบ†ๅคง่ฏญ่จ€ๆจกๅž‹๏ผŒๅŠ ๅ…ฅไบ†ๅคš่ฝฎ็š„GPTๅฏน่ฏ
```bash
python app_multi.py
```
![](docs/UI3.png)
็ŽฐๅœจๅŠ ๅ…ฅไบ†่ฏญ้Ÿณๅ…‹้š†็š„้ƒจๅˆ†๏ผŒๅฏไปฅ่‡ช็”ฑๅˆ‡ๆข่‡ชๅทฑๅ…‹้š†็š„ๅฃฐ้Ÿณๆจกๅž‹ๅ’Œๅฏนๅบ”็š„ไบบๅ›พ็‰‡่ฟ›่กŒๅฎž็Žฐ๏ผŒ่ฟ™้‡Œๆˆ‘้€‰ๆ‹ฉไบ†ไธ€ไธช็ƒŸๅ—“้Ÿณๅ’Œ็”ท็”Ÿๅ›พ็‰‡
```bash
python app_vits.py
```
ๅŠ ๅ…ฅไบ†็ฌฌๅ››็งๆ–นๅผ๏ผŒไธๅ›บๅฎšๅœบๆ™ฏ่ฟ›่กŒๅฏน่ฏ๏ผŒ็›ดๆŽฅ่พ“ๅ…ฅ่ฏญ้Ÿณๆˆ–่€…็”Ÿๆˆ่ฏญ้Ÿณ่ฟ›่กŒๆ•ฐๅญ—ไบบ็”Ÿๆˆ๏ผŒๅ†…็ฝฎไบ†Sadtalker๏ผŒWav2Lip๏ผŒER-NeRF็ญ‰ๆ–นๅผ
> ER-NeRFๆ˜ฏ้’ˆๅฏนๅ•็‹ฌไธ€ไธชไบบ็š„่ง†้ข‘่ฟ›่กŒ่ฎญ็ปƒ็š„๏ผŒๆ‰€ไปฅ้œ€่ฆๆ›ฟๆข็‰นๅฎš็š„ๆจกๅž‹ๆ‰่ƒฝ่ฟ›่กŒๆธฒๆŸ“ๅพ—ๅˆฐๆญฃ็กฎ็š„็ป“ๆžœ๏ผŒๅ†…็ฝฎไบ†Obama็š„ๆƒ้‡๏ผŒๅฏ็›ดๆŽฅ็”จ
```bash
python app_talk.py
```
![](docs/UI4.png)
ๅŠ ๅ…ฅไบ†MuseTalk็š„ๆ–นๅผ๏ผŒ่ƒฝๅคŸๅฐ†MuseV็š„่ง†้ข‘่ฟ›่กŒ้ข„ๅค„็†๏ผŒ้ข„ๅค„็†ๅŽ่ฟ›่กŒๅฏน่ฏ๏ผŒ้€ŸๅบฆๅŸบๆœฌ่ƒฝๅคŸ่พพๅˆฐๅฎžๆ—ถ็š„่ฆๆฑ‚๏ผŒ้€Ÿๅบฆ้žๅธธๅฟซ๏ผŒMuseTalkๅทฒๅŠ ๅ…ฅๅœจWebUIไธญใ€‚
```bash
python app_musetalk.py
```
![](docs/UI5.png)
## ๆ–‡ไปถๅคน็ป“ๆž„
ๆ‰€ๆœ‰็š„ๆƒ้‡้ƒจๅˆ†ๅฏไปฅไปŽ่ฟ™ไธ‹่ฝฝ๏ผŒ็™พๅบฆ็ฝ‘็›˜ๅฏ่ƒฝๆœ‰ๆ—ถๅ€™ไผšๆ›ดๆ–ฐๆ…ขไธ€็‚น๏ผŒๅปบ่ฎฎไปŽๅคธๅ…‹็ฝ‘็›˜ไธ‹่ฝฝ๏ผŒไผš็ฌฌไธ€ๆ—ถ้—ดๆ›ดๆ–ฐ
- [Baidu (็™พๅบฆไบ‘็›˜)](https://pan.baidu.com/s/1eF13O-8wyw4B3MtesctQyg?pwd=linl) (Password: `linl`)
- [huggingface](https://huggingface.co/Kedreamix/Linly-Talker)
- [modelscope](https://www.modelscope.cn/models/Kedreamix/Linly-Talker/files)
- [Quark(ๅคธๅ…‹็ฝ‘็›˜)](https://pan.quark.cn/s/f48f5e35796b)
ๆƒ้‡ๆ–‡ไปถๅคน็ป“ๆž„ๅฆ‚ไธ‹
```bash
Linly-Talker/
โ”œโ”€โ”€ checkpoints
โ”‚ โ”œโ”€โ”€ audio_visual_encoder.pth
โ”‚ โ”œโ”€โ”€ hub
โ”‚ โ”‚ โ””โ”€โ”€ checkpoints
โ”‚ โ”‚ โ””โ”€โ”€ s3fd-619a316812.pth
โ”‚ โ”œโ”€โ”€ lipsync_expert.pth
โ”‚ โ”œโ”€โ”€ mapping_00109-model.pth.tar
โ”‚ โ”œโ”€โ”€ mapping_00229-model.pth.tar
โ”‚ โ”œโ”€โ”€ May.json
โ”‚ โ”œโ”€โ”€ May.pth
โ”‚ โ”œโ”€โ”€ Obama_ave.pth
โ”‚ โ”œโ”€โ”€ Obama.json
โ”‚ โ”œโ”€โ”€ Obama.pth
โ”‚ โ”œโ”€โ”€ ref_eo.npy
โ”‚ โ”œโ”€โ”€ ref.npy
โ”‚ โ”œโ”€โ”€ ref.wav
โ”‚ โ”œโ”€โ”€ SadTalker_V0.0.2_256.safetensors
โ”‚ โ”œโ”€โ”€ visual_quality_disc.pth
โ”‚ โ”œโ”€โ”€ wav2lip_gan.pth
โ”‚ โ””โ”€โ”€ wav2lip.pth
โ”œโ”€โ”€ gfpgan
โ”‚ย ย  โ””โ”€โ”€ weights
โ”‚ย ย  โ”œโ”€โ”€ alignment_WFLW_4HG.pth
โ”‚ย ย  โ””โ”€โ”€ detection_Resnet50_Final.pth
โ”œโ”€โ”€ GPT_SoVITS
โ”‚ย ย  โ””โ”€โ”€ pretrained_models
โ”‚ย ย  โ”œโ”€โ”€ chinese-hubert-base
โ”‚ย ย  โ”‚ย ย  โ”œโ”€โ”€ config.json
โ”‚ย ย  โ”‚ย ย  โ”œโ”€โ”€ preprocessor_config.json
โ”‚ย ย  โ”‚ย ย  โ””โ”€โ”€ pytorch_model.bin
โ”‚ย ย  โ”œโ”€โ”€ chinese-roberta-wwm-ext-large
โ”‚ย ย  โ”‚ย ย  โ”œโ”€โ”€ config.json
โ”‚ย ย  โ”‚ย ย  โ”œโ”€โ”€ pytorch_model.bin
โ”‚ย ย  โ”‚ย ย  โ””โ”€โ”€ tokenizer.json
โ”‚ย ย  โ”œโ”€โ”€ README.md
โ”‚ย ย  โ”œโ”€โ”€ s1bert25hz-2kh-longer-epoch=68e-step=50232.ckpt
โ”‚ย ย  โ”œโ”€โ”€ s2D488k.pth
โ”‚ย ย  โ”œโ”€โ”€ s2G488k.pth
โ”‚ย ย  โ””โ”€โ”€ speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch
โ”œโ”€โ”€ MuseTalk
โ”‚ โ”œโ”€โ”€ models
โ”‚ โ”‚ โ”œโ”€โ”€ dwpose
โ”‚ โ”‚ โ”‚ โ””โ”€โ”€ dw-ll_ucoco_384.pth
โ”‚ โ”‚ โ”œโ”€โ”€ face-parse-bisent
โ”‚ โ”‚ โ”‚ โ”œโ”€โ”€ 79999_iter.pth
โ”‚ โ”‚ โ”‚ โ””โ”€โ”€ resnet18-5c106cde.pth
โ”‚ โ”‚ โ”œโ”€โ”€ musetalk
โ”‚ โ”‚ โ”‚ โ”œโ”€โ”€ musetalk.json
โ”‚ โ”‚ โ”‚ โ””โ”€โ”€ pytorch_model.bin
โ”‚ โ”‚ โ”œโ”€โ”€ README.md
โ”‚ โ”‚ โ”œโ”€โ”€ sd-vae-ft-mse
โ”‚ โ”‚ โ”‚ โ”œโ”€โ”€ config.json
โ”‚ โ”‚ โ”‚ โ””โ”€โ”€ diffusion_pytorch_model.bin
โ”‚ โ”‚ โ””โ”€โ”€ whisper
โ”‚ โ”‚ โ””โ”€โ”€ tiny.pt
โ”œโ”€โ”€ Qwen
โ”‚ย ย  โ””โ”€โ”€ Qwen-1_8B-Chat
โ”‚ย ย  โ”œโ”€โ”€ assets
โ”‚ย ย  โ”‚ย ย  โ”œโ”€โ”€ logo.jpg
โ”‚ย ย  โ”‚ย ย  โ”œโ”€โ”€ qwen_tokenizer.png
โ”‚ย ย  โ”‚ย ย  โ”œโ”€โ”€ react_showcase_001.png
โ”‚ย ย  โ”‚ย ย  โ”œโ”€โ”€ react_showcase_002.png
โ”‚ย ย  โ”‚ย ย  โ””โ”€โ”€ wechat.png
โ”‚ย ย  โ”œโ”€โ”€ cache_autogptq_cuda_256.cpp
โ”‚ย ย  โ”œโ”€โ”€ cache_autogptq_cuda_kernel_256.cu
โ”‚ย ย  โ”œโ”€โ”€ config.json
โ”‚ย ย  โ”œโ”€โ”€ configuration_qwen.py
โ”‚ย ย  โ”œโ”€โ”€ cpp_kernels.py
โ”‚ย ย  โ”œโ”€โ”€ examples
โ”‚ย ย  โ”‚ย ย  โ””โ”€โ”€ react_prompt.md
โ”‚ย ย  โ”œโ”€โ”€ generation_config.json
โ”‚ย ย  โ”œโ”€โ”€ LICENSE
โ”‚ย ย  โ”œโ”€โ”€ model-00001-of-00002.safetensors
โ”‚ย ย  โ”œโ”€โ”€ model-00002-of-00002.safetensors
โ”‚ย ย  โ”œโ”€โ”€ modeling_qwen.py
โ”‚ย ย  โ”œโ”€โ”€ model.safetensors.index.json
โ”‚ย ย  โ”œโ”€โ”€ NOTICE
โ”‚ย ย  โ”œโ”€โ”€ qwen_generation_utils.py
โ”‚ย ย  โ”œโ”€โ”€ qwen.tiktoken
โ”‚ย ย  โ”œโ”€โ”€ README.md
โ”‚ย ย  โ”œโ”€โ”€ tokenization_qwen.py
โ”‚ย ย  โ””โ”€โ”€ tokenizer_config.json
โ”œโ”€โ”€ Whisper
โ”‚ โ”œโ”€โ”€ base.pt
โ”‚ โ””โ”€โ”€ tiny.pt
โ”œโ”€โ”€ FunASR
โ”‚ โ”œโ”€โ”€ punc_ct-transformer_zh-cn-common-vocab272727-pytorch
โ”‚ โ”‚ โ”œโ”€โ”€ configuration.json
โ”‚ โ”‚ โ”œโ”€โ”€ config.yaml
โ”‚ โ”‚ โ”œโ”€โ”€ example
โ”‚ โ”‚ โ”‚ โ””โ”€โ”€ punc_example.txt
โ”‚ โ”‚ โ”œโ”€โ”€ fig
โ”‚ โ”‚ โ”‚ โ””โ”€โ”€ struct.png
โ”‚ โ”‚ โ”œโ”€โ”€ model.pt
โ”‚ โ”‚ โ”œโ”€โ”€ README.md
โ”‚ โ”‚ โ””โ”€โ”€ tokens.json
โ”‚ โ”œโ”€โ”€ speech_fsmn_vad_zh-cn-16k-common-pytorch
โ”‚ โ”‚ โ”œโ”€โ”€ am.mvn
โ”‚ โ”‚ โ”œโ”€โ”€ configuration.json
โ”‚ โ”‚ โ”œโ”€โ”€ config.yaml
โ”‚ โ”‚ โ”œโ”€โ”€ example
โ”‚ โ”‚ โ”‚ โ””โ”€โ”€ vad_example.wav
โ”‚ โ”‚ โ”œโ”€โ”€ fig
โ”‚ โ”‚ โ”‚ โ””โ”€โ”€ struct.png
โ”‚ โ”‚ โ”œโ”€โ”€ model.pt
โ”‚ โ”‚ โ””โ”€โ”€ README.md
โ”‚ โ””โ”€โ”€ speech_seaco_paraformer_large_asr_nat-zh-cn-16k-common-vocab8404-pytorch
โ”‚ โ”œโ”€โ”€ am.mvn
โ”‚ โ”œโ”€โ”€ asr_example_hotword.wav
โ”‚ โ”œโ”€โ”€ configuration.json
โ”‚ โ”œโ”€โ”€ config.yaml
โ”‚ โ”œโ”€โ”€ example
โ”‚ โ”‚ โ”œโ”€โ”€ asr_example.wav
โ”‚ โ”‚ โ””โ”€โ”€ hotword.txt
โ”‚ โ”œโ”€โ”€ fig
โ”‚ โ”‚ โ”œโ”€โ”€ res.png
โ”‚ โ”‚ โ””โ”€โ”€ seaco.png
โ”‚ โ”œโ”€โ”€ model.pt
โ”‚ โ”œโ”€โ”€ README.md
โ”‚ โ”œโ”€โ”€ seg_dict
โ”‚ โ””โ”€โ”€ tokens.json
โ””โ”€โ”€ README.md
```
## ่ตžๅŠฉ
| ๆ”ฏไป˜ๅฎ | ๅพฎไฟก |
| -------------------- | ----------------------- |
| ![](docs/Alipay.jpg) | ![](docs/WeChatpay.jpg) |
## ๅ‚่€ƒ
**ASR**
- [https://github.com/openai/whisper](https://github.com/openai/whisper)
- [https://github.com/alibaba-damo-academy/FunASR](https://github.com/alibaba-damo-academy/FunASR)
**TTS**
- [https://github.com/rany2/edge-tts](https://github.com/rany2/edge-tts)
- [https://github.com/PaddlePaddle/PaddleSpeech](https://github.com/PaddlePaddle/PaddleSpeech)
**LLM**
- [https://github.com/CVI-SZU/Linly](https://github.com/CVI-SZU/Linly)
- [https://github.com/QwenLM/Qwen](https://github.com/QwenLM/Qwen)
- [https://deepmind.google/technologies/gemini/](https://deepmind.google/technologies/gemini/)
- [https://github.com/THUDM/ChatGLM3](https://github.com/THUDM/ChatGLM3)
- [https://openai.com](https://openai.com)
**THG**
- [https://github.com/OpenTalker/SadTalker](https://github.com/OpenTalker/SadTalker)
- [https://github.com/Rudrabha/Wav2Lip](https://github.com/Rudrabha/Wav2Lip)
- [https://github.com/Fictionarry/ER-NeRF](https://github.com/Fictionarry/ER-NeRF)
**Voice Clone**
- [https://github.com/RVC-Boss/GPT-SoVITS](https://github.com/RVC-Boss/GPT-SoVITS)
- [https://github.com/coqui-ai/TTS](https://github.com/coqui-ai/TTS)
## Star History
[![Star History Chart](https://api.star-history.com/svg?repos=Kedreamix/Linly-Talker&type=Date)](https://star-history.com/#Kedreamix/Linly-Talker&Date)