Spaces:
Runtime error
Runtime error
| # ๆฐๅญไบบๆบ่ฝๅฏน่ฏ็ณป็ป - Linly-Talker โ โๆฐๅญไบบไบคไบ๏ผไธ่ๆ็่ชๅทฑไบๅจโ | |
| <div align="center"> | |
| <h1>Linly-Talker WebUI</h1> | |
| [](https://github.com/Kedreamix/Linly-Talker) | |
| <img src="docs/linly_logo.png" /><br> | |
| [](https://colab.research.google.com/github/Kedreamix/Linly-Talker/blob/main/colab_webui.ipynb) | |
| [](https://github.com/Kedreamix/Linly-Talker/blob/main/LICENSE) | |
| [](https://huggingface.co/Kedreamix/Linly-Talker) | |
| [**English**](./README.md) | [**ไธญๆ็ฎไฝ**](./README_zh.md) | |
| </div> | |
| **2023.12 ๆดๆฐ** ๐ | |
| **็จๆทๅฏไปฅไธไผ ไปปๆๅพ็่ฟ่กๅฏน่ฏ** | |
| **2024.01 ๆดๆฐ** ๐ | |
| - **ไปคไบบๅ ดๅฅ็ๆถๆฏ๏ผๆ็ฐๅจๅทฒ็ปๅฐๅผบๅคง็GeminiProๅQwenๅคงๆจกๅ่ๅ ฅๅฐๆไปฌ็ๅฏน่ฏๅบๆฏไธญใ็จๆท็ฐๅจๅฏไปฅๅจๅฏน่ฏไธญไธไผ ไปปไฝๅพ็๏ผไธบๆไปฌ็ไบๅจๅขๆทปไบๅ จๆฐ็ๅฑ้ขใ** | |
| - **ๆดๆฐไบFastAPI็้จ็ฝฒ่ฐ็จๆนๆณใ** | |
| - **ๆดๆฐไบๅพฎ่ฝฏTTS็้ซ็บง่ฎพ็ฝฎ้้กน๏ผๅขๅ ๅฃฐ้ณ็ง็ฑป็ๅคๆ ทๆง๏ผไปฅๅๅ ๅ ฅ่ง้ขๅญๅนๅ ๅผบๅฏ่งๅใ** | |
| - **ๆดๆฐไบGPTๅค่ฝฎๅฏน่ฏ็ณป็ป๏ผไฝฟๅพๅฏน่ฏๆไธไธๆ่็ณป๏ผๆ้ซๆฐๅญไบบ็ไบคไบๆงๅ็ๅฎๆใ** | |
| **2024.02 ๆดๆฐ** ๐ | |
| - **ๆดๆฐไบGradio็็ๆฌไธบๆๆฐ็ๆฌ4.16.0๏ผไฝฟๅพ็้ขๆฅๆๆดๅค็ๅ่ฝ๏ผๆฏๅฆๅฏไปฅๆๅๅคดๆๆๅพ็ๆๅปบๆฐๅญไบบ็ญใ** | |
| - **ๆดๆฐไบASRๅTHG๏ผๅ ถไธญASRๅ ๅ ฅไบ้ฟ้็FunASR๏ผๅ ทไฝๆดๅฟซ็้ๅบฆ๏ผTHG้จๅๅ ๅ ฅไบWav2Lipๆจกๅ๏ผER-NeRFๅจๅๅคไธญ(Comming Soon)ใ** | |
| - **ๅ ๅ ฅไบ่ฏญ้ณๅ ้ๆนๆณGPT-SoVITSๆจกๅ๏ผ่ฝๅค้่ฟๅพฎ่ฐไธๅ้ๅฏนๅบไบบ็่ฏญๆ่ฟ่กๅ ้๏ผๆๆ่ฟๆฏ็ธๅฝไธ้็๏ผๅผๅพๆจ่ใ** | |
| - **้ๆไธไธชWebUI็้ข๏ผ่ฝๅคๆดๅฅฝ็่ฟ่กLinly-Talkerใ** | |
| **2024.04 ๆดๆฐ** ๐ | |
| - **ๆดๆฐไบ้ค Edge TTS็ Paddle TTS็็ฆป็บฟๆนๅผใ** | |
| - **ๆดๆฐไบER-NeRFไฝไธบAvatar็ๆ็้ๆฉไนไธใ** | |
| - **ๆดๆฐไบapp_talk.py๏ผๅจไธๅบไบๅฏน่ฏๅบๆฏๅฏ่ช็ฑไธไผ ่ฏญ้ณๅๅพ็่ง้ข็ๆใ** | |
| **2024.05 ๆดๆฐ** ๐ | |
| - **ๆดๆฐ้ถๅบ็กๅฐ็ฝ้จ็ฝฒ AutoDL ๆ็จ๏ผๅนถไธๆดๆฐไบcodewithgpu็้ๅ๏ผๅฏไปฅไธ้ฎ่ฟ่กไฝ้ชๅๅญฆไน ใ** | |
| - **ๆดๆฐไบWebUI.py๏ผLinly-Talker WebUIๆฏๆๅคๆจกๅใๅคๆจกๅๅๅค้้กน** | |
| **2024.06 ๆดๆฐ** ๐ | |
| - **ๆดๆฐMuseTalkๅ ๅ ฅLinly-Talkerไนไธญ๏ผๅนถไธๆดๆฐไบWebUIไธญ๏ผ่ฝๅคๅบๆฌๅฎ็ฐๅฎๆถๅฏน่ฏใ** | |
| - **ๆน่ฟ็WebUIๅจ้ป่ฎค่ฎพ็ฝฎไธไธๅ ่ฝฝLLMๆจกๅ๏ผไปฅๅๅฐๆพๅญไฝฟ็จ๏ผๅนถไธๅฏไปฅ็ดๆฅ้่ฟ้ฎ้ขๅๅคๅฎๆๅฃๆญๅ่ฝใ็ฒพ็ปๅๅ็WebUIๅ ๅซไปฅไธไธไธชไธป่ฆๅ่ฝ๏ผไธชๆงๅ่ง่ฒ็ๆใๆฐๅญไบบๅค่ฝฎๆบ่ฝๅฏน่ฏไปฅๅMuseTalkๅฎๆถๅฏน่ฏใ่ฟไบๆน่ฟไธไป ๅๅฐไบๅ ๅ็ๆพๅญๅไฝ๏ผ่ฟๅขๅ ไบๆดๅคๆ็คบ๏ผไปฅๅธฎๅฉ็จๆทๆด่ฝปๆพๅฐไฝฟ็จใ** | |
| --- | |
| <details> | |
| <summary>็ฎๅฝ</summary> | |
| <!-- TOC --> | |
| - [ๆฐๅญไบบๆบ่ฝๅฏน่ฏ็ณป็ป - Linly-Talker โ โๆฐๅญไบบไบคไบ๏ผไธ่ๆ็่ชๅทฑไบๅจโ](#ๆฐๅญไบบๆบ่ฝๅฏน่ฏ็ณป็ป---linly-talker--ๆฐๅญไบบไบคไบไธ่ๆ็่ชๅทฑไบๅจ) | |
| - [ไป็ป](#ไป็ป) | |
| - [TO DO LIST](#to-do-list) | |
| - [็คบไพ](#็คบไพ) | |
| - [ๅๅปบ็ฏๅข](#ๅๅปบ็ฏๅข) | |
| - [ASR - Speech Recognition](#asr---speech-recognition) | |
| - [Whisper](#whisper) | |
| - [FunASR](#funasr) | |
| - [Coming Soon](#coming-soon) | |
| - [TTS Text To Speech](#tts-text-to-speech) | |
| - [Edge TTS](#edge-tts) | |
| - [PaddleTTS](#paddletts) | |
| - [Coming Soon](#coming-soon-1) | |
| - [Voice Clone](#voice-clone) | |
| - [GPT-SoVITS๏ผๆจ่๏ผ](#gpt-sovitsๆจ่) | |
| - [XTTS](#xtts) | |
| - [Coming Soon](#coming-soon-2) | |
| - [THG - Avatar](#thg---avatar) | |
| - [SadTalker](#sadtalker) | |
| - [Wav2Lip](#wav2lip) | |
| - [ER-NeRF](#er-nerf) | |
| - [MuseTalk](#musetalk) | |
| - [Coming Soon](#coming-soon-3) | |
| - [LLM - Conversation](#llm---conversation) | |
| - [Linly-AI](#linly-ai) | |
| - [Qwen](#qwen) | |
| - [Gemini-Pro](#gemini-pro) | |
| - [ChatGPT](#chatgpt) | |
| - [ChatGLM](#chatglm) | |
| - [GPT4Free](#gpt4free) | |
| - [LLM ๅคๆจกๅ้ๆฉ](#llm-ๅคๆจกๅ้ๆฉ) | |
| - [Coming Soon](#coming-soon-4) | |
| - [ไผๅ](#ไผๅ) | |
| - [Gradio](#gradio) | |
| - [ๅฏๅจWebUI](#ๅฏๅจwebui) | |
| - [WebUI](#webui) | |
| - [Old Verison](#old-verison) | |
| - [ๆไปถๅคน็ปๆ](#ๆไปถๅคน็ปๆ) | |
| - [่ตๅฉ](#่ตๅฉ) | |
| - [ๅ่](#ๅ่) | |
| - [Star History](#star-history) | |
| <!-- /TOC --> | |
| </details> | |
| ## ไป็ป | |
| Linly-Talkerๆฏไธๆฌพๅๆฐ็ๆฐๅญไบบๅฏน่ฏ็ณป็ป๏ผๅฎ่ๅไบๆๆฐ็ไบบๅทฅๆบ่ฝๆๆฏ๏ผๅ ๆฌๅคงๅ่ฏญ่จๆจกๅ๏ผLLM๏ผ๐คใ่ชๅจ่ฏญ้ณ่ฏๅซ๏ผASR๏ผ๐๏ธใๆๆฌๅฐ่ฏญ้ณ่ฝฌๆข๏ผTTS๏ผ๐ฃ๏ธๅ่ฏญ้ณๅ ้ๆๆฏ๐คใ่ฟไธช็ณป็ป้่ฟGradioๅนณๅฐๆไพไบไธไธชไบคไบๅผ็Web็้ข๏ผๅ ่ฎธ็จๆทไธไผ ๅพ็๐ทไธAI่ฟ่กไธชๆงๅ็ๅฏน่ฏไบคๆต๐ฌใ | |
| ็ณป็ป็ๆ ธๅฟ็น็นๅ ๆฌ๏ผ | |
| 1. **ๅคๆจกๅ้ๆ**๏ผLinly-TalkerๆดๅไบLinlyใGeminiProใQwen็ญๅคงๆจกๅ๏ผไปฅๅWhisperใSadTalker็ญ่ง่งๆจกๅ๏ผๅฎ็ฐไบ้ซ่ดจ้็ๅฏน่ฏๅ่ง่ง็ๆใ | |
| 2. **ๅค่ฝฎๅฏน่ฏ่ฝๅ**๏ผ้่ฟGPTๆจกๅ็ๅค่ฝฎๅฏน่ฏ็ณป็ป๏ผLinly-Talker่ฝๅค็่งฃๅนถ็ปดๆไธไธๆ็ธๅ ณ็่ฟ่ดฏๅฏน่ฏ๏ผๆๅคงๅฐๆๅไบไบคไบ็็ๅฎๆใ | |
| 3. **่ฏญ้ณๅ ้**๏ผๅฉ็จGPT-SoVITS็ญๆๆฏ๏ผ็จๆทๅฏไปฅไธไผ ไธๅ้็่ฏญ้ณๆ ทๆฌ่ฟ่กๅพฎ่ฐ๏ผ็ณป็ปๅฐๅ ้็จๆท็ๅฃฐ้ณ๏ผไฝฟๅพๆฐๅญไบบ่ฝๅคไปฅ็จๆท็ๅฃฐ้ณ่ฟ่กๅฏน่ฏใ | |
| 4. **ๅฎๆถไบๅจ**๏ผ็ณป็ปๆฏๆๅฎๆถ่ฏญ้ณ่ฏๅซๅ่ง้ขๅญๅน๏ผไฝฟๅพ็จๆทๅฏไปฅ้่ฟ่ฏญ้ณไธๆฐๅญไบบ่ฟ่ก่ช็ถ็ไบคๆตใ | |
| 5. **่ง่งๅขๅผบ**๏ผ้่ฟๆฐๅญไบบ็ๆ็ญๆๆฏ๏ผLinly-Talker่ฝๅค็ๆ้ผ็็ๆฐๅญไบบๅฝข่ฑก๏ผๆไพๆดๅ ๆฒๆตธๅผ็ไฝ้ชใ | |
| Linly-Talker็่ฎพ่ฎก็ๅฟตๆฏๅ้ ไธ็งๅ จๆฐ็ไบบๆบไบคไบๆนๅผ๏ผไธไป ไป ๆฏ็ฎๅ็้ฎ็ญ๏ผ่ๆฏ้่ฟ้ซๅบฆ้ๆ็ๆๆฏ๏ผๆไพไธไธช่ฝๅค็่งฃใๅๅบๅนถๆจกๆไบบ็ฑปไบคๆต็ๆบ่ฝๆฐๅญไบบใ | |
|  | |
| > ๆฅ็ๆไปฌ็ไป็ป่ง้ข [demo video](https://www.bilibili.com/video/BV1rN4y1a76x/) | |
| > | |
| > ๅจB็ซไธๆๅฝไบไธ็ณปๅ่ง้ข๏ผไนไปฃ่กจๆๆดๆฐ็ๆฏไธๆญฅไธไฝฟ็จๆนๆณ๏ผ่ฏฆ็ปๆฅ็[ๆฐๅญไบบๆบ่ฝๅฏน่ฏ็ณป็ป - Linly-Talkerๅ้](https://space.bilibili.com/241286257/channel/collectiondetail?sid=2065753) | |
| > | |
| > - [๐ฅ๐ฅ๐ฅๆฐๅญไบบๅฏน่ฏ็ณป็ป Linly-Talker๐ฅ๐ฅ๐ฅ](https://www.bilibili.com/video/BV1rN4y1a76x/) | |
| > - [๐ๆฐๅญไบบ็ๆชๆฅ๏ผLinly-Talker+GPT-SoVIT่ฏญ้ณๅ ้ๆๆฏ็่ต่ฝไน้](https://www.bilibili.com/video/BV1S4421A7gh/) | |
| > - [AutoDLๅนณๅฐ้จ็ฝฒLinly-Talker (0ๅบ็กๅฐ็ฝ่ถ ่ฏฆ็ปๆ็จ)](https://www.bilibili.com/video/BV1uT421m74z/) | |
| > - [Linly-Talker ๆดๆฐ็ฆป็บฟTTS้ๆๅๅฎๅถๆฐๅญไบบๆนๆก](https://www.bilibili.com/video/BV1Mr421u7NN/) | |
| ## TO DO LIST | |
| - [x] ๅบๆฌๅฎๆๅฏน่ฏ็ณป็ปๆต็จ๏ผ่ฝๅค`่ฏญ้ณๅฏน่ฏ` | |
| - [x] ๅ ๅ ฅไบLLMๅคงๆจกๅ๏ผๅ ๆฌ`Linly`๏ผ`Qwen`ๅ`GeminiPro`็ไฝฟ็จ | |
| - [x] ๅฏไธไผ `ไปปๆๆฐๅญไบบ็ ง็`่ฟ่กๅฏน่ฏ | |
| - [x] Linlyๅ ๅ ฅ`FastAPI`่ฐ็จๆนๅผ | |
| - [x] ๅฉ็จๅพฎ่ฝฏ`TTS`ๅ ๅ ฅ้ซ็บง้้กน๏ผๅฏ่ฎพ็ฝฎๅฏนๅบไบบๅฃฐไปฅๅ้ณ่ฐ็ญๅๆฐ๏ผๅขๅ ๅฃฐ้ณ็ๅคๆ ทๆง | |
| - [x] ่ง้ข็ๆๅ ๅ ฅ`ๅญๅน`๏ผ่ฝๅคๆดๅฅฝ็่ฟ่กๅฏ่งๅ | |
| - [x] GPT`ๅค่ฝฎๅฏน่ฏ`็ณป็ป๏ผๆ้ซๆฐๅญไบบ็ไบคไบๆงๅ็ๅฎๆ๏ผๅขๅผบๆฐๅญไบบ็ๆบ่ฝ๏ผ | |
| - [x] ไผๅGradio็้ข๏ผๅ ๅ ฅๆดๅคๆจกๅ๏ผๅฆWav2Lip๏ผFunASR็ญ | |
| - [x] `่ฏญ้ณๅ ้`ๆๆฏ๏ผๅ ๅ ฅGPT-SoVITS๏ผๅช้่ฆไธๅ้็่ฏญ้ณ็ฎๅๅพฎ่ฐๅณๅฏ๏ผ่ฏญ้ณๅ ้ๅๆ่ชๅทฑๅฃฐ้ณ๏ผๆ้ซๆฐๅญไบบๅ่บซ็็ๅฎๆๅไบๅจไฝ้ช๏ผ | |
| - [x] ๅ ๅ ฅ็ฆป็บฟTTSไปฅๅNeRF-based็ๆนๆณๅๆจกๅ | |
| - [x] Linly-Talker WebUIๆฏๆๅคๆจกๅใๅคๆจกๅๅๅค้้กน | |
| - [x] ไธบLinly-Talkerๆทปๅ MuseTalkๅ่ฝ๏ผๅบๆฌ่พพๅฐๅฎๆถ็้ๅบฆ๏ผไบคๆต้ๅบฆๅพๅฟซ | |
| - [x] ้ๆMuseTalk่ฟๅ ฅLinly-Talker WebUI | |
| - [ ] `ๅฎๆถ`่ฏญ้ณ่ฏๅซ๏ผไบบไธๆฐๅญไบบไน้ดๅฐฑๅฏไปฅ้่ฟ่ฏญ้ณ่ฟ่กๅฏน่ฏไบคๆต) | |
| ๐ ่ฏฅ้กน็ฎ Linly-Talker ๆญฃๅจ่ฟ่กไธญ - ๆฌข่ฟๆๅบPR่ฏทๆฑ๏ผๅฆๆๆจๆไปปไฝๅ ณไบๆฐ็ๆจกๅๆนๆณใ็ ็ฉถใๆๆฏๆๅ็ฐ่ฟ่ก้่ฏฏ็ๅปบ่ฎฎ๏ผ่ฏท้ๆถ็ผ่พๅนถๆไบค PRใๆจไนๅฏไปฅๆๅผไธไธช้ฎ้ขๆ้่ฟ็ตๅญ้ฎไปถ็ดๆฅ่็ณปๆใ๐ฉโญ ๅฆๆๆจๅ็ฐ่ฟไธชGithub Projectๆ็จ๏ผ่ฏท็ปๅฎ็นไธชๆ๏ผ๐คฉ | |
| > ๅฆๆๅจ้จ็ฝฒ็ๆถๅๆไปปไฝ็้ฎ้ข๏ผๅฏไปฅๅ ณๆณจ[ๅธธ่ง้ฎ้ขๆฑๆป.md](https://github.com/Kedreamix/Linly-Talker/blob/main/ๅธธ่ง้ฎ้ขๆฑๆป.md)้จๅ๏ผๆๅทฒ็ปๆด็ไบๅฏ่ฝๅบ็ฐ็ๆๆ้ฎ้ข๏ผๅฆๅคไบคๆต็พคไนๅจ่ฟ้๏ผๆไผๅฎๆถๆดๆฐ๏ผๆ่ฐขๅคงๅฎถ็ๅ ณๆณจไธไฝฟ็จ๏ผ๏ผ๏ผ | |
| ## ็คบไพ | |
| | ๆๅญ/่ฏญ้ณๅฏน่ฏ | ๆฐๅญไบบๅ็ญ | | |
| | :----------------------------------------------------------: | :----------------------------------------------------------: | | |
| | ๅบๅฏนๅๅๆๆๆ็ๆนๆณๆฏไปไน๏ผ | <video src="https://github.com/Kedreamix/Linly-Talker/assets/61195303/f1deb189-b682-4175-9dea-7eeb0fb392ca"></video> | | |
| | ๅฆไฝ่ฟ่กๆถ้ด็ฎก็๏ผ | <video src="https://github.com/Kedreamix/Linly-Talker/assets/61195303/968b5c43-4dce-484b-b6c6-0fd4d621ac03"></video> | | |
| | ๆฐๅไธ็ฏไบคๅไน้ณไนไผ่ฏ่ฎบ๏ผ่ฎจ่ฎบไนๅข็่กจๆผๅ่งไผ็ๆดไฝไฝ้ชใ | <video src="https://github.com/Kedreamix/Linly-Talker/assets/61195303/f052820f-6511-4cf0-a383-daf8402630db"></video> | | |
| | ็ฟป่ฏๆไธญๆ๏ผLuck is a dividend of sweat. The more you sweat, the luckier you get. | <video src="https://github.com/Kedreamix/Linly-Talker/assets/61195303/118eec13-a9f7-4c38-b4ad-044d36ba9776"></video> | | |
| ## ๅๅปบ็ฏๅข | |
| AutoDLๅทฒๅๅธ้ๅ๏ผๅฏไปฅ็ดๆฅไฝฟ็จ๏ผ[https://www.codewithgpu.com/i/Kedreamix/Linly-Talker/Kedreamix-Linly-Talker](https://www.codewithgpu.com/i/Kedreamix/Linly-Talker/Kedreamix-Linly-Talker)๏ผไนๅฏไปฅไฝฟ็จdockerๆฅ็ดๆฅๅๅปบ็ฏๅข๏ผๆไนไผๆ็ปญไธๆญ็ๆดๆฐ้ๅ | |
| ```bash | |
| docker pull registry.cn-beijing.aliyuncs.com/codewithgpu2/kedreamix-linly-talker:cMDvNE4RYl | |
| ``` | |
| Windowsๆๅ ๅ ฅไบไธไธชpythonไธ้ฎๆดๅๅ ๏ผๅฏไปฅๆ้กบๅบ่ฟ่ก่ฟ่ก๏ผๆ็ ง้ๆฑๆ็ ง็ธๅบ็ไพ่ต๏ผๅนถไธไธ่ฝฝๅฏนๅบ็ๆจกๅ๏ผๅณๅฏ่ฟ่ก๏ผไธป่ฆๆ็ งcondaไปฅๅไป02ๅผๅงๅฎ่ฃ pytorch่ฟ่ก่ฟ่ก๏ผๅฆๆๆ้ฎ้ข๏ผ่ฏท้ๆถไธๆๆฒ้ | |
| [Windowsไธ้ฎๆดๅๅ ](https://pan.quark.cn/s/cc8f19c45a15) | |
| ไธ่ฝฝไปฃ็ | |
| ```bash | |
| git clone https://github.com/Kedreamix/Linly-Talker.git --depth 1 | |
| ``` | |
| ่ฅไฝฟ็จLinly-Talker๏ผๅฏไปฅ็ดๆฅ็จanaconda่ฟ่กๅฎ่ฃ ็ฏๅข๏ผๅ ไนๅ ๆฌๆๆ็ๆจกๅๆ้่ฆ็ไพ่ต๏ผๅ ทไฝๆไฝๅฆไธ๏ผ | |
| ```bash | |
| conda create -n linly python=3.10 | |
| conda activate linly | |
| # pytorchๅฎ่ฃ ๆนๅผ1๏ผcondaๅฎ่ฃ | |
| # CUDA 11.7 | |
| # conda install pytorch==2.0.1 torchvision==0.15.2 torchaudio==2.0.2 pytorch-cuda=11.7 -c pytorch -c nvidia | |
| # CUDA 11.8 | |
| # conda install pytorch==2.0.1 torchvision==0.15.2 torchaudio==2.0.2 pytorch-cuda=11.8 -c pytorch -c nvidia | |
| # pytorchๅฎ่ฃ ๆนๅผ2๏ผpip ๅฎ่ฃ | |
| # CUDA 11.7 | |
| # pip install torch==2.0.1 torchvision==0.15.2 torchaudio==2.0.2 | |
| # CUDA 11.8 | |
| pip install torch==2.0.1 torchvision==0.15.2 torchaudio==2.0.2 --index-url https://download.pytorch.org/whl/cu118 | |
| conda install -q ffmpeg # ffmpeg==4.2.2 | |
| # ๅ็บงpip | |
| python -m pip install --upgrade pip | |
| # ๆดๆข pypi ๆบๅ ้ๅบ็ๅฎ่ฃ | |
| pip config set global.index-url https://pypi.tuna.tsinghua.edu.cn/simple | |
| pip install tb-nightly -i https://mirrors.aliyun.com/pypi/simple | |
| pip install -r requirements_webui.txt | |
| # ๅฎ่ฃ ๆๅ ณmusetalkไพ่ต | |
| pip install --no-cache-dir -U openmim | |
| mim install mmengine | |
| mim install "mmcv>=2.0.1" | |
| mim install "mmdet>=3.1.0" | |
| mim install "mmpose>=1.1.0" | |
| # ๅฎ่ฃ NeRF-basedไพ่ต๏ผๅฏ่ฝ้ฎ้ข่พๅค๏ผๅฏไปฅๅ ๆพๅผ | |
| pip install "git+https://github.com/facebookresearch/pytorch3d.git" | |
| pip install -r TFG/requirements_nerf.txt | |
| # ่ฅpyaudioๅบ็ฐ้ฎ้ข๏ผๅฏๅฎ่ฃ ๅฏนๅบไพ่ต | |
| # sudo apt-get install libasound-dev portaudio19-dev libportaudio2 libportaudiocpp0 | |
| # ๆณจๆไปฅไธๅ ไธชๆจกๅ๏ผ่ฅๅฎ่ฃ ไธๆๅ๏ผๅฏไปฅ่ฟๅ ฅ่ทฏๅพๅฉ็จpip install . ๆ่ python setup.py install็ผ่ฏๅฎ่ฃ | |
| # NeRF/freqencoder | |
| # NeRF/gridencoder | |
| # NeRF/raymarching | |
| # NeRF/shencoder | |
| ``` | |
| ไปฅไธๆฏๆง็ๆฌ็ไธไบๅฎ่ฃ ๆนๆณ๏ผๅฏ่ฝๅญๅจไผไธไบไพ่ตๅฒ็ช็้ฎ้ข๏ผไฝๆฏไนไธไผๅบ็ฐๅคชๅคbug๏ผไฝๆฏไธบไบๆดๅฅฝๆดๆนไพฟ็ๅฎ่ฃ ๏ผๆๅฐฑๆดๆฐไบไธ่ฟฐ็ๆฌ๏ผไปฅไธ็ๆฌๅฏไปฅๅฟฝ็ฅ๏ผๆ่ ้ๅฐ้ฎ้ขๅฏไปฅๅ่ไธไธ | |
| > ้ฆๅ ไฝฟ็จanacondaๅฎ่ฃ ็ฏๅข๏ผๅฎ่ฃ pytorch็ฏๅข๏ผๅ ทไฝๆไฝๅฆไธ๏ผ | |
| > | |
| > ```bash | |
| > conda create -n linly python=3.10 | |
| > conda activate linly | |
| > | |
| > # pytorchๅฎ่ฃ ๆนๅผ1๏ผcondaๅฎ่ฃ ๏ผๆจ่๏ผ | |
| > conda install pytorch==1.12.1 torchvision==0.13.1 torchaudio==0.12.1 cudatoolkit=11.3 -c pytorch | |
| > | |
| > # pytorchๅฎ่ฃ ๆนๅผ2๏ผpip ๅฎ่ฃ | |
| > pip install torch==1.12.1+cu113 torchvision==0.13.1+cu113 torchaudio==0.12.1 --extra-index-url https://download.pytorch.org/whl/cu113 | |
| > | |
| > conda install -q ffmpeg # ffmpeg==4.2.2 | |
| > | |
| > pip install -r requirements_app.txt | |
| > ``` | |
| > | |
| > ่ฅไฝฟ็จ่ฏญ้ณๅ ้็ญๆจกๅ๏ผ้่ฆๆด้ซ็ๆฌ็Pytorch๏ผไฝๆฏๅ่ฝไนไผๆดๅ ไธฐๅฏ๏ผไธ่ฟ้่ฆ็้ฉฑๅจ็ๆฌๅฏ่ฝ่ฆๅฐcuda11.8๏ผๅฏ้ๆฉ | |
| > | |
| > ```bash | |
| > conda create -n linly python=3.10 | |
| > conda activate linly | |
| > | |
| > pip install torch==2.0.1 torchvision==0.15.2 torchaudio==2.0.2 --index-url https://download.pytorch.org/whl/cu118 | |
| > | |
| > conda install -q ffmpeg # ffmpeg==4.2.2 | |
| > | |
| > pip install -r requirements_app.txt | |
| > | |
| > # ๅฎ่ฃ ่ฏญ้ณๅ ้ๅฏนๅบ็ไพ่ต | |
| > pip install -r VITS/requirements_gptsovits.txt | |
| > ``` | |
| > | |
| > ่ฅๅธๆไฝฟ็จNeRF-based็ญๆจกๅ็ญ่ฏ๏ผๅฏ่ฝ้่ฆๅฎ่ฃ ไธไธๅฏนๅบ็็ฏๅข | |
| > | |
| > ```bash | |
| > # ๅฎ่ฃ NeRFๅฏนๅบ็ไพ่ต | |
| > pip install "git+https://github.com/facebookresearch/pytorch3d.git" | |
| > pip install -r TFG/requirements_nerf.txt | |
| > | |
| > # ่ฅpyaudioๅบ็ฐ้ฎ้ข๏ผๅฏๅฎ่ฃ ๅฏนๅบไพ่ต | |
| > # sudo apt-get update | |
| > # sudo apt-get install libasound-dev portaudio19-dev libportaudio2 libportaudiocpp0 | |
| > | |
| > # ๆณจๆไปฅไธๅ ไธชๆจกๅ๏ผ่ฅๅฎ่ฃ ไธๆๅ๏ผๅฏไปฅ่ฟๅ ฅ่ทฏๅพๅฉ็จpip install . ๆ่ python setup.py install็ผ่ฏๅฎ่ฃ | |
| > # NeRF/freqencoder | |
| > # NeRF/gridencoder | |
| > # NeRF/raymarching | |
| > # NeRF/shencoder | |
| > ``` | |
| > | |
| > ่ฅไฝฟ็จPaddleTTS๏ผๅฏๅฎ่ฃ ๅฏนๅบ็็ฏๅข | |
| > | |
| > ```bash | |
| > pip install -r TTS/requirements_paddle.txt | |
| > ``` | |
| > | |
| > ่ฅไฝฟ็จFunASR่ฏญ้ณ่ฏๅซๆจกๅ๏ผๅฏๅฎ่ฃ ็ฏๅข | |
| > | |
| > ``` | |
| > pip install -r ASR/requirements_funasr.txt | |
| > ``` | |
| > | |
| > ่ฅไฝฟ็จMuesTalkๆจกๅ๏ผๅฏๅฎ่ฃ ็ฏๅข | |
| > | |
| > ```bash | |
| > pip install --no-cache-dir -U openmim | |
| > mim install mmengine | |
| > mim install "mmcv>=2.0.1" | |
| > mim install "mmdet>=3.1.0" | |
| > mim install "mmpose>=1.1.0" | |
| > pip install -r TFG/requirements_musetalk.txt | |
| > ``` | |
| > | |
| ๆฅไธๆฅ่ฟ้่ฆๅฎ่ฃ ๅฏนๅบ็ๆจกๅ๏ผๆไปฅไธไธ่ฝฝๆนๅผ๏ผไธ่ฝฝๅๅฎ่ฃ ๆไปถๆถ็ปๆๆพ็ฝฎ๏ผๆไปถๅคน็ปๆๅจๆฌๆๆๅๆ่ฏดๆ๏ผๅปบ่ฎฎไปๅคธๅ ็ฝ็ไธ่ฝฝ๏ผไผ็ฌฌไธๆถ้ดๆดๆฐ | |
| - [Baidu (็พๅบฆไบ็)](https://pan.baidu.com/s/1eF13O-8wyw4B3MtesctQyg?pwd=linl) (Password: `linl`) | |
| - [huggingface](https://huggingface.co/Kedreamix/Linly-Talker) | |
| - [modelscope](https://www.modelscope.cn/models/Kedreamix/Linly-Talker/summary) | |
| - [Quark(ๅคธๅ ็ฝ็)](https://pan.quark.cn/s/f48f5e35796b) | |
| ๆๅถไฝไธไธช่ๆฌๅฏไปฅๅฎๆไธ่ฟฐๆๆๆจกๅ็ไธ่ฝฝ๏ผๆ ้็จๆท่ฟๅคๆไฝใ่ฟ็งๆนๅผ้ๅ็ฝ็ป็จณๅฎ็ๆ ๅต๏ผๅนถไธ็นๅซ้ๅ Linux ็จๆทใๅฏนไบ Windows ็จๆท๏ผไนๅฏไปฅไฝฟ็จ Git ๆฅไธ่ฝฝๆจกๅใๅฆๆ็ฝ็ป็ฏๅขไธ็จณๅฎ๏ผ็จๆทๅฏไปฅ้ๆฉไฝฟ็จๆๅจไธ่ฝฝๆนๆณ๏ผๆ่ ๅฐ่ฏ่ฟ่ก Shell ่ๆฌๆฅๅฎๆไธ่ฝฝใ่ๆฌๅ ทๆไปฅไธๅ่ฝใ | |
| 1. **้ๆฉไธ่ฝฝๆนๅผ**: ็จๆทๅฏไปฅ้ๆฉไปไธ็งไธๅ็ๆบไธ่ฝฝๆจกๅ๏ผModelScopeใHuggingface ๆ Huggingface ้ๅ็ซ็นใ | |
| 2. **ไธ่ฝฝๆจกๅ**: ๆ นๆฎ็จๆท็้ๆฉ๏ผๆง่ก็ธๅบ็ไธ่ฝฝๅฝไปคใ | |
| 3. **็งปๅจๆจกๅๆไปถ**: ไธ่ฝฝๅฎๆๅ๏ผๅฐๆจกๅๆไปถ็งปๅจๅฐๆๅฎ็็ฎๅฝใ | |
| 4. **้่ฏฏๅค็**: ๅจๆฏไธๆญฅๆไฝไธญๅ ๅ ฅไบ้่ฏฏๆฃๆฅ๏ผๅฆๆๆไฝๅคฑ่ดฅ๏ผ่ๆฌไผ่พๅบ้่ฏฏไฟกๆฏๅนถๅๆญขๆง่กใ | |
| ```bash | |
| sh scripts/download_models.sh | |
| ``` | |
| **HuggingFaceไธ่ฝฝ** | |
| ๅฆๆ้ๅบฆๅคชๆ ขๅฏไปฅ่่้ๅ๏ผๅ่ [็ฎไพฟๅฟซๆท่ทๅ Hugging Face ๆจกๅ๏ผไฝฟ็จ้ๅ็ซ็น๏ผ](https://kedreamix.github.io/2024/01/05/Note/HuggingFace/?highlight=้ๅ) | |
| ```bash | |
| # ไปhuggingfaceไธ่ฝฝ้ข่ฎญ็ปๆจกๅ | |
| git lfs install | |
| git clone https://huggingface.co/Kedreamix/Linly-Talker --depth 1 | |
| # git lfs clone https://huggingface.co/Kedreamix/Linly-Talker | |
| # pip install -U huggingface_hub | |
| # export HF_ENDPOINT=https://hf-mirror.com # ไฝฟ็จ้ๅ็ฝ็ซ | |
| huggingface-cli download --resume-download --local-dir-use-symlinks False Kedreamix/Linly-Talker --local-dir Linly-Talker | |
| ``` | |
| **ModelScopeไธ่ฝฝ** | |
| ```bash | |
| # ไปmodelscopeไธ่ฝฝ้ข่ฎญ็ปๆจกๅ | |
| # 1. git ๆนๆณ | |
| git lfs install | |
| git clone https://www.modelscope.cn/Kedreamix/Linly-Talker.git --depth 1 | |
| # git lfs clone https://www.modelscope.cn/Kedreamix/Linly-Talker.git --depth 1 | |
| # 2. Python ไปฃ็ ไธ่ฝฝ | |
| pip install modelscope | |
| from modelscope import snapshot_download | |
| model_dir = snapshot_download('Kedreamix/Linly-Talker', resume_download=True, cache_dir='./', revision='master') | |
| ``` | |
| **็งปๅจๆๆๆจกๅๅฐๅฝๅ็ฎๅฝ** | |
| ๅฆๆ็พๅบฆ็ฝ็ไธ่ฝฝๅ๏ผๅฏไปฅๅ่ๆๆกฃๆๅ็ฎๅฝ็ปๆๆฅ็งปๅจ็ฎๅฝ | |
| ```bash | |
| # ็งปๅจๆๆๆจกๅๅฐๅฝๅ็ฎๅฝ | |
| # checkpointไธญๅซๆSadTalkerๅWav2Lip็ญๆ้ | |
| mv Linly-Talker/checkpoints/* ./checkpoints | |
| # ่ฅไฝฟ็จGFPGANๅขๅผบ๏ผๅฎ่ฃ ๅฏนๅบ็ๅบ | |
| # pip install gfpgan | |
| # mv Linly-Talker/gfpan ./ | |
| # ่ฏญ้ณๅ ้ๆจกๅ | |
| mv Linly-Talker/GPT_SoVITS/pretrained_models/* ./GPT_SoVITS/pretrained_models/ | |
| # Qwenๅคงๆจกๅ | |
| mv Linly-Talker/Qwen ./ | |
| # MuseTalkๆจกๅ | |
| mkdir -p ./Musetalk/models | |
| mv Linly-Talker/MuseTalk/* ./Musetalk/models | |
| ``` | |
| ไธบไบๅคงๅฎถ็้จ็ฝฒไฝฟ็จๆนไพฟ๏ผๆดๆฐไบไธไธช`configs.py`ๆไปถ๏ผๅฏไปฅๅฏนๅ ถ่ฟ่กไธไบ่ถ ๅๆฐไฟฎๆนๅณๅฏ | |
| ```bash | |
| # ่ฎพๅค่ฟ่ก็ซฏๅฃ (Device running port) | |
| port = 6006 | |
| # api่ฟ่ก็ซฏๅฃๅIP (API running port and IP) | |
| mode = 'api' # api ้่ฆๅ ่ฟ่กLinly-api-fast.py๏ผๆๆถไป ไป ้็จไบLinly | |
| # ๆฌๅฐ็ซฏๅฃlocalhost:127.0.0.1 ๅ จๅฑ็ซฏๅฃ่ฝฌๅ:"0.0.0.0" | |
| ip = '127.0.0.1' | |
| api_port = 7871 | |
| # LLMๆจกๅ่ทฏๅพ (Linly model path) | |
| mode = 'offline' | |
| model_path = 'Qwen/Qwen-1_8B-Chat' | |
| # ssl่ฏไนฆ (SSL certificate) ้บฆๅ ้ฃๅฏน่ฏ้่ฆๆญคๅๆฐ | |
| # ๆๅฅฝ่ฐๆดไธบ็ปๅฏน่ทฏๅพ | |
| ssl_certfile = "./https_cert/cert.pem" | |
| ssl_keyfile = "./https_cert/key.pem" | |
| ``` | |
| ## ASR - Speech Recognition | |
| ่ฏฆ็ปๆๅ ณไบ่ฏญ้ณ่ฏๅซ็**ไฝฟ็จไป็ป**ไธ**ไปฃ็ ๅฎ็ฐ**ๅฏ่ง [ASR - ๅๆฐๅญไบบๆฒ้็ๆกฅๆข](./ASR/README.md) | |
| ### Whisper | |
| ๅ้ดOpenAI็Whisperๅฎ็ฐไบASR็่ฏญ้ณ่ฏๅซ๏ผๅ ทไฝไฝฟ็จๆนๆณๅ่ [https://github.com/openai/whisper](https://github.com/openai/whisper) | |
| ### FunASR | |
| ้ฟ้็`FunASR`็่ฏญ้ณ่ฏๅซๆๆไนๆฏ็ธๅฝไธ้๏ผ่ไธๆถ้ดไนๆฏๆฏwhisperๆดๅฟซ็๏ผๅฏนไธญๆๅฎ้ ไธๆฏๆดๅฅฝ็ใ | |
| ๅๆถfunasrๆด่ฝ่พพๅฐๅฎๆถ็ๆๆ๏ผๆไปฅไนๅฐFunASRๆทปๅ ่ฟๅปไบ๏ผๅจASRๆไปถๅคนไธ็FunASRๆไปถ้ๅฏไปฅ่ฟ่กไฝ้ช๏ผๅ่ [https://github.com/alibaba-damo-academy/FunASR](https://github.com/alibaba-damo-academy/FunASR)ใ | |
| ### Coming Soon | |
| ๆฌข่ฟๅคงๅฎถๆๅบๅปบ่ฎฎ๏ผๆฟๅฑๆไธๆญๆดๆฐๆจกๅ๏ผไธฐๅฏLinly-Talker็ๅ่ฝใ | |
| ## TTS Text To Speech | |
| ่ฏฆ็ปๆๅ ณไบ่ฏญ้ณ่ฏๅซ็**ไฝฟ็จไป็ป**ไธ**ไปฃ็ ๅฎ็ฐ**ๅฏ่ง [TTS - ่ตไบๆฐๅญไบบ็ๅฎ็่ฏญ้ณไบคไบ่ฝๅ](./TTS/README.md) | |
| ### Edge TTS | |
| ๅ้ดไฝฟ็จๅพฎ่ฝฏ่ฏญ้ณๆๅก๏ผๅ ทไฝไฝฟ็จๆนๆณๅ่[https://github.com/rany2/edge-tts](https://github.com/rany2/edge-tts) | |
| ### PaddleTTS | |
| ๅจๅฎ้ ไฝฟ็จ่ฟ็จไธญ๏ผๅฏ่ฝไผ้ๅฐ้่ฆ็ฆป็บฟๆไฝ็ๆ ๅตใ็ฑไบEdge TTS้่ฆๅจ็บฟ็ฏๅขๆ่ฝ็ๆ่ฏญ้ณ๏ผๅ ๆญคๆไปฌ้ๆฉไบๅๆ ทๅผๆบ็PaddleSpeechไฝไธบๆๆฌๅฐ่ฏญ้ณ๏ผTTS๏ผ็ๆฟไปฃๆนๆกใ่ฝ็ถๆๆๅฏ่ฝๆๆไธๅ๏ผไฝPaddleSpeechๆฏๆ็ฆป็บฟๆไฝใๆดๅคไฟกๆฏๅฏๅ่PaddleSpeech็GitHub้กต้ข๏ผ[PaddleSpeech](https://github.com/PaddlePaddle/PaddleSpeech)ใ | |
| ### Coming Soon | |
| ๆฌข่ฟๅคงๅฎถๆๅบๅปบ่ฎฎ๏ผๆฟๅฑๆไธๆญๆดๆฐๆจกๅ๏ผไธฐๅฏLinly-Talker็ๅ่ฝใ | |
| ## Voice Clone | |
| ่ฏฆ็ปๆๅ ณไบ่ฏญ้ณๅ ้็**ไฝฟ็จไป็ป**ไธ**ไปฃ็ ๅฎ็ฐ**ๅฏ่ง [Voice Clone - ๅจๅฏน่ฏๆถๆๆๅท่ตฐไฝ ็ๅฃฐ้ณ](./VITS/README.md) | |
| ### GPT-SoVITS๏ผๆจ่๏ผ | |
| ๆ่ฐขๅคงๅฎถ็ๅผๆบ่ดก็ฎ๏ผๆๅ้ดไบๅฝๅๅผๆบ็่ฏญ้ณๅ ้ๆจกๅ `GPT-SoVITS`๏ผๆ่ฎคไธบๆๆๆฏ็ธๅฝไธ้็๏ผ้กน็ฎๅฐๅๅฏๅ่[https://github.com/RVC-Boss/GPT-SoVITS](https://github.com/RVC-Boss/GPT-SoVITS) | |
| ๆๅฐไธไบ่ฎญ็ปๅฅฝ็ๅ ้ๆ้ๆพๅจไบ[Quark(ๅคธๅ ็ฝ็)](https://pan.quark.cn/s/f48f5e35796b)ไธญ๏ผๅคงๅฎถๅฏไปฅ่ชๅๆ้ๅๅ่้ณ้ขใ | |
| ### XTTS | |
| Coqui XTTSๆฏไธไธช้ขๅ ็ๆทฑๅบฆๅญฆไน ๆๆฌๅฐ่ฏญ้ณไปปๅก๏ผTTS่ฏญ้ณ็ๆๆจกๅ๏ผๅทฅๅ ทๅ ๏ผ้่ฟไฝฟ็จไธๆฎต5็ง้ไปฅไธ็่ฏญ้ณ้ขๅช่พๅฐฑๅฏไปฅๅฎๆๅฃฐ้ณๅ ้*ๅฐ่ฏญ้ณๅ ้ๅฐไธๅ็่ฏญ่จ*ใ | |
| ๐ธTTS ๆฏไธไธช็จไบ้ซ็บงๆๆฌ่ฝฌ่ฏญ้ณ็ๆ็ๅบใ | |
| ๐ ่ถ ่ฟ 1100 ็ง่ฏญ่จ็้ข่ฎญ็ปๆจกๅใ | |
| ๐ ๏ธ ็จไบไปฅไปปไฝ่ฏญ่จ่ฎญ็ปๆฐๆจกๅๅๅพฎ่ฐ็ฐๆๆจกๅ็ๅทฅๅ ทใ | |
| ๐ ็จไบๆฐๆฎ้ๅๆๅ็ฎก็็ๅฎ็จ็จๅบใ | |
| - ๅจ็บฟไฝ้ชXTTS [https://huggingface.co/spaces/coqui/xtts](https://huggingface.co/spaces/coqui/xtts) | |
| - ๅฎๆนGithubๅบ https://github.com/coqui-ai/TTS | |
| ### Coming Soon | |
| ๆฌข่ฟๅคงๅฎถๆๅบๅปบ่ฎฎ๏ผๆฟๅฑๆไธๆญๆดๆฐๆจกๅ๏ผไธฐๅฏLinly-Talker็ๅ่ฝใ | |
| ## THG - Avatar | |
| ่ฏฆ็ปๆๅ ณไบๆฐๅญไบบ็ๆ็**ไฝฟ็จไป็ป**ไธ**ไปฃ็ ๅฎ็ฐ**ๅฏ่ง [THG - ๆๅปบๆบ่ฝๆฐๅญไบบ](./TFG/README.md) | |
| ### SadTalker | |
| ๆฐๅญไบบ็ๆๅฏไฝฟ็จSadTalker๏ผCVPR 2023๏ผ,่ฏฆๆ ไป็ป่ง [https://sadtalker.github.io](https://sadtalker.github.io) | |
| ๅจไฝฟ็จๅๅ ไธ่ฝฝSadTalkerๆจกๅ: | |
| ```bash | |
| bash scripts/sadtalker_download_models.sh | |
| ``` | |
| [Baidu (็พๅบฆไบ็)](https://pan.baidu.com/s/1eF13O-8wyw4B3MtesctQyg?pwd=linl) (Password: `linl`) | |
| [Quark(ๅคธๅ ็ฝ็)](https://pan.quark.cn/s/f48f5e35796b) | |
| > ๅฆๆ็พๅบฆ็ฝ็ไธ่ฝฝ๏ผ่ฎฐไฝๆฏๆพๅจcheckpointsๆไปถๅคนไธ๏ผ็พๅบฆ็ฝ็ไธ่ฝฝ็้ป่ฎคๅฝๅไธบsadtalker๏ผๅฎ้ ๅบ่ฏฅ้ๅฝๅไธบcheckpoints | |
| ### Wav2Lip | |
| ๆฐๅญไบบ็ๆ่ฟๅฏไฝฟ็จWav2Lip๏ผACM 2020๏ผ๏ผ่ฏฆๆ ไป็ป่ง [https://github.com/Rudrabha/Wav2Lip](https://github.com/Rudrabha/Wav2Lip) | |
| ๅจไฝฟ็จๅๅ ไธ่ฝฝWav2Lipๆจกๅ๏ผ | |
| | Model | Description | Link to the model | | |
| | ---------------------------- | ----------------------------------------------------- | ------------------------------------------------------------ | | |
| | Wav2Lip | Highly accurate lip-sync | [Link](https://iiitaphyd-my.sharepoint.com/:u:/g/personal/radrabha_m_research_iiit_ac_in/Eb3LEzbfuKlJiR600lQWRxgBIY27JZg80f7V9jtMfbNDaQ?e=TBFBVW) | | |
| | Wav2Lip + GAN | Slightly inferior lip-sync, but better visual quality | [Link](https://iiitaphyd-my.sharepoint.com/:u:/g/personal/radrabha_m_research_iiit_ac_in/EdjI7bZlgApMqsVoEUUXpLsBxqXbn5z8VTmoxp55YNDcIA?e=n9ljGW) | | |
| | Expert Discriminator | Weights of the expert discriminator | [Link](https://iiitaphyd-my.sharepoint.com/:u:/g/personal/radrabha_m_research_iiit_ac_in/EQRvmiZg-HRAjvI6zqN9eTEBP74KefynCwPWVmF57l-AYA?e=ZRPHKP) | | |
| | Visual Quality Discriminator | Weights of the visual disc trained in a GAN setup | [Link](https://iiitaphyd-my.sharepoint.com/:u:/g/personal/radrabha_m_research_iiit_ac_in/EQVqH88dTm1HjlK11eNba5gBbn15WMS0B0EZbDBttqrqkg?e=ic0ljo) | | |
| ### ER-NeRF | |
| ER-NeRF๏ผICCV2023๏ผๆฏไฝฟ็จๆๆฐ็NeRFๆๆฏๆๅปบ็ๆฐๅญไบบ๏ผๆฅๆๅฎๅถๆฐๅญไบบ็็นๆง๏ผๅช้่ฆไธไธชไบบ็ไบๅ้ๅทฆๅณๅฐ่ง้ขๅณๅฏ้ๅปบๅบๆฅ๏ผๅ ทไฝๅฏๅ่ [https://github.com/Fictionarry/ER-NeRF](https://github.com/Fictionarry/ER-NeRF) | |
| ๅทฒๆดๆฐ๏ผไปฅๅฅฅๅทด้ฉฌๅฝข่ฑกไฝไธบๅ่๏ผ่ฅ่่ๆดๅฅฝ็ๆๆ๏ผๅฏ่ฝ่่ๅ ้ๅฎๅถๆฐๅญไบบ็ๅฃฐ้ณไปฅๅพๅฐๆดๅฅฝ็ๆๆใ | |
| ### MuseTalk | |
| MuseTalk ๆฏไธไธชๅฎๆถ้ซ่ดจ้็้ณ้ข้ฉฑๅจๅๅฝขๅๆญฅๆจกๅ๏ผ่ฝๅคไปฅ30ๅธงๆฏ็งไปฅไธ็้ๅบฆๅจNVIDIA Tesla V100ๆพๅกไธ่ฟ่กใ่ฏฅๆจกๅๅฏไปฅไธ็ฑ MuseV ็ๆ็่พๅ ฅ่ง้ข็ปๅไฝฟ็จ๏ผไฝไธบๅฎๆด็่ๆไบบ่งฃๅณๆนๆก็ไธ้จๅใๅ ทไฝๅฏๅ่ [https://github.com/TMElyralab/MuseTalk](https://github.com/TMElyralab/MuseTalk) | |
| MuseTalk ๆฏไธไธชๅฎๆถ้ซ่ดจ้็้ณ้ข้ฉฑๅจๅๅฝขๅๆญฅๆจกๅ๏ผ็ป่ฟ่ฎญ็ปๅฏไปฅๅจ ft-mse-vae ็ๆฝๅจ็ฉบ้ดไธญ่ฟ่กๅทฅไฝใๅฎๅ ทๆไปฅไธ็นๆง๏ผ | |
| - **ๆช่ง้ขๅญ็ๅๆญฅ**๏ผๆ นๆฎ่พๅ ฅ็้ณ้ขๅฏนๆช่ง่ฟ็้ขๅญ่ฟ่กไฟฎๆน๏ผ้ข้จๅบๅ็ๅคงๅฐไธบ 256 x 256ใ | |
| - **ๅค่ฏญ่จๆฏๆ**๏ผๆฏๆๅค็ง่ฏญ่จ็้ณ้ข่พๅ ฅ๏ผๅ ๆฌไธญๆใ่ฑ่ฏญๅๆฅ่ฏญใ | |
| - **้ซๆง่ฝๅฎๆถๆจ็**๏ผๅจ NVIDIA Tesla V100 ไธๅฏไปฅๅฎ็ฐ 30ๅธงๆฏ็งไปฅไธ็ๅฎๆถๆจ็ใ | |
| - **้ข้จไธญๅฟ็น่ฐๆด**๏ผๆฏๆไฟฎๆน้ข้จๅบๅ็ไธญๅฟ็นไฝ็ฝฎ๏ผ่ฟๅฏน็ๆ็ปๆๆๆพ่ๅฝฑๅใ | |
| - **HDTF ๆฐๆฎ้่ฎญ็ป**๏ผๆไพๅจ HDTF ๆฐๆฎ้ไธ่ฎญ็ป็ๆจกๅๆฃๆฅ็นใ | |
| - **่ฎญ็ปไปฃ็ ๅณๅฐๅๅธ**๏ผ่ฎญ็ปไปฃ็ ๅณๅฐๅๅธ๏ผๆนไพฟ่ฟไธๆญฅ็ๅผๅๅ็ ็ฉถใ | |
| MuseTalk ๆไพไบไธไธช้ซๆไธ็ตๆดป็ๅทฅๅ ท๏ผไฝฟ่ๆไบบ็้ข้จ่กจๆ ่ฝๅค็ฒพ็กฎๅๆญฅไบ้ณ้ข๏ผไธบๅฎ็ฐๅ จๆนไฝไบๅจ็่ๆไบบ่ฟๅบไบ้่ฆไธๆญฅใ | |
| ๅจLinly-Talkerไธญๅทฒ็ปๅ ๅ ฅไบMuseTalk๏ผๅบไบMuseV็่ง้ข่ฟ่กๆจ็๏ผๅพๅฐไบๆฏ่พ็ๆณ็้ๅบฆ่ฟ่กๅฏน่ฏ๏ผๅบๆฌ่พพๅฐๅฎๆถ็ๆๆ๏ผ่ฟๆฏ้ๅธธไธ้็๏ผไนๆฏๅฏไปฅๅบไบๆตๅผ่ฟ่กๆจ็็ใ | |
| ### Coming Soon | |
| ๆฌข่ฟๅคงๅฎถๆๅบๅปบ่ฎฎ๏ผๆฟๅฑๆไธๆญๆดๆฐๆจกๅ๏ผไธฐๅฏLinly-Talker็ๅ่ฝใ | |
| ## LLM - Conversation | |
| ่ฏฆ็ปๆๅ ณไบๅคงๆจกๅ็**ไฝฟ็จไป็ป**ไธ**ไปฃ็ ๅฎ็ฐ**ๅฏ่ง [LLM - ๅคง่ฏญ่จๆจกๅไธบๆฐๅญไบบ่ต่ฝ](./LLM/README.md) | |
| ### Linly-AI | |
| Linlyๆฅ่ชๆทฑๅณๅคงๅญฆๆฐๆฎๅทฅ็จๅฝๅฎถ้็นๅฎ้ชๅฎค๏ผๅ่ [https://github.com/CVI-SZU/Linly](https://github.com/CVI-SZU/Linly) | |
| ### Qwen | |
| ๆฅ่ช้ฟ้ไบ็Qwen๏ผๆฅ็ [https://github.com/QwenLM/Qwen](https://github.com/QwenLM/Qwen) | |
| ๅฆๆๆณ่ฆๅฟซ้ไฝฟ็จ๏ผๅฏไปฅ้1.8B็ๆจกๅ๏ผๅๆฐๆฏ่พๅฐ๏ผๅจ่พๅฐ็ๆพๅญไนๅฏไปฅๆญฃๅธธไฝฟ็จ๏ผๅฝ็ถ่ฟไธ้จๅๅฏไปฅๆฟๆข | |
| ไธ่ฝฝ Qwen1.8B ๆจกๅ: [https://huggingface.co/Qwen/Qwen-1_8B-Chat](https://huggingface.co/Qwen/Qwen-1_8B-Chat) | |
| ### Gemini-Pro | |
| ๆฅ่ช Google ็ Gemini-Pro๏ผไบ่งฃๆดๅค่ฏท่ฎฟ้ฎ [https://deepmind.google/technologies/gemini/](https://deepmind.google/technologies/gemini/) | |
| ่ฏทๆฑ API ๅฏ้ฅ: [https://makersuite.google.com/](https://makersuite.google.com/) | |
| ### ChatGPT | |
| ๆฅ่ชOpenAI็๏ผ้่ฆ็ณ่ฏทAPI๏ผไบ่งฃๆดๅค่ฏท่ฎฟ้ฎ [https://platform.openai.com/docs/introduction](https://platform.openai.com/docs/introduction) | |
| ### ChatGLM | |
| ๆฅ่ชๆธ ๅ็๏ผไบ่งฃๆดๅค่ฏท่ฎฟ้ฎ [https://github.com/THUDM/ChatGLM3](https://github.com/THUDM/ChatGLM3) | |
| ### GPT4Free | |
| ๅฏๅ่[https://github.com/xtekky/gpt4free](https://github.com/xtekky/gpt4free)๏ผๅ ่ดน็ฝๅซไฝฟ็จGPT4็ญๆจกๅ | |
| ### LLM ๅคๆจกๅ้ๆฉ | |
| ๅจ webui.py ๆไปถไธญ๏ผ่ฝปๆพ้ๆฉๆจ้่ฆ็ๆจกๅ๏ผโ ๏ธ็ฌฌไธๆฌก่ฟ่ก่ฆๅ ไธ่ฝฝๆจกๅ๏ผๅ่Qwen1.8B | |
| ### Coming Soon | |
| ๆฌข่ฟๅคงๅฎถๆๅบๅปบ่ฎฎ๏ผๆฟๅฑๆไธๆญๆดๆฐๆจกๅ๏ผไธฐๅฏLinly-Talker็ๅ่ฝใ | |
| ## ไผๅ | |
| ไธไบไผๅ: | |
| - ไฝฟ็จๅบๅฎ็่พๅ ฅไบบ่ธๅพๅ,ๆๅๆๅ็นๅพ,้ฟๅ ๆฏๆฌก่ฏปๅ | |
| - ็งป้คไธๅฟ ่ฆ็ๅบ,็ผฉ็ญๆปๆถ้ด | |
| - ๅชไฟๅญๆ็ป่ง้ข่พๅบ,ไธไฟๅญไธญ้ด็ปๆ,ๆ้ซๆง่ฝ | |
| - ไฝฟ็จOpenCV็ๆๆ็ป่ง้ข,ๆฏmimwriteๆดๅฟซ | |
| ## Gradio | |
| GradioๆฏไธไธชPythonๅบ,ๆไพไบไธ็ง็ฎๅ็ๆนๅผๅฐๆบๅจๅญฆไน ๆจกๅไฝไธบไบคไบๅผWebๅบ็จ็จๅบๆฅ้จ็ฝฒใ | |
| ๅฏนLinly-Talker่่จ,ไฝฟ็จGradioๆไธคไธชไธป่ฆ็ฎ็: | |
| 1. **ๅฏ่งๅไธๆผ็คบ**:Gradioไธบๆจกๅๆไพไธไธช็ฎๅ็Web GUI,ไธไผ ๅพ็ๅๆๆฌๅๅฏไปฅ็ด่งๅฐ็ๅฐ็ปๆใ่ฟๆฏๅฑ็คบ็ณป็ป่ฝๅ็ๆๆๆนๅผใ | |
| 2. **็จๆทไบคไบ**:Gradio็GUIๅฏไปฅไฝไธบๅ็ซฏ,ๅ ่ฎธ็จๆทไธLinly-Talker่ฟ่กไบคไบๅฏน่ฏใ็จๆทๅฏไปฅไธไผ ่ชๅทฑ็ๅพ็ๅนถ่พๅ ฅ้ฎ้ข,ๅฎๆถ่ทๅๅ็ญใ่ฟๆไพไบๆด่ช็ถ็่ฏญ้ณไบคไบๆนๅผใ | |
| ๅ ทไฝๆฅ่ฏด,ๆไปฌๅจapp.pyไธญๅๅปบไบไธไธชGradio็Interface,ๆฅๆถๅพ็ๅๆๆฌ่พๅ ฅ,่ฐ็จๅฝๆฐ็ๆๅๅบ่ง้ข,ๅจGUIไธญๆพ็คบๅบๆฅใ่ฟๆ ทๅฐฑๅฎ็ฐไบๆต่งๅจไบคไบ่ไธ้่ฆ็ผๅๅคๆ็ๅ็ซฏใ | |
| ๆปไน,GradioไธบLinly-Talkerๆไพไบๅฏ่งๅๅ็จๆทไบคไบ็ๆฅๅฃ,ๆฏๅฑ็คบ็ณป็ปๅ่ฝๅ่ฎฉๆ็ป็จๆทไฝฟ็จ็ณป็ป็ๆๆ้ๅพใ | |
| > ่ฅ่่ๅฎๆถๅฏน่ฏ๏ผๅฏ่ฝ้่ฆๆขไธชๆกๆถ๏ผๆ่ ๅฏนGradio่ฟ่ก้ญๆน๏ผๅธๆๅๅคงๅฎถไธ่ตทๅชๅ | |
| ## ๅฏๅจWebUI | |
| ไนๅๆๅฐๅพๅคไธช็ๆฌ้ฝๆฏๅๅผๆฅ็๏ผๅฎ้ ไธ่ฟ่กๅคไธชไผๆฏ่พ้บป็ฆ๏ผๆไปฅๅ็ปญๆๅขๅ ไบๅๆWebUIไธไธช็้ขๅณๅฏไฝ้ช๏ผๅ็ปญไนไผไธๆญๆดๆฐ | |
| ### WebUI | |
| ็ฐๅจๅทฒๅ ๅ ฅWebUI็ๅ่ฝๅฆไธ | |
| - [x] ๆๆฌ/่ฏญ้ณๆฐๅญไบบๅฏน่ฏ๏ผๅบๅฎๆฐๅญไบบ๏ผๅ็ทๅฅณ่ง่ฒ๏ผ | |
| - [x] ไปปๆๅพ็ๆฐๅญไบบๅฏน่ฏ๏ผๅฏไธไผ ไปปๆๅพ็ๆฐๅญไบบ๏ผ | |
| - [x] ๅค่ฝฎGPTๅฏน่ฏ๏ผๅ ๅ ฅๅๅฒๅฏน่ฏๆฐๆฎ๏ผ้พๆฅไธไธๆ๏ผ | |
| - [x] ่ฏญ้ณๅ ้ๅฏน่ฏ๏ผๅบไบGPT-SoVITS่ฎพ็ฝฎ่ฟ่ก่ฏญ้ณๅ ้๏ผไนๅฏๆ นๆฎ่ฏญ้ณๅฏน่ฏ็ๅฃฐ้ณ่ฟ่กๅ ้๏ผ | |
| - [x] ๆฐๅญไบบๆๆฌ/่ฏญ้ณๆญๆฅ๏ผๆ นๆฎ่พๅ ฅ็ๆๅญ/่ฏญ้ณ่ฟ่กๆญๆฅ๏ผ | |
| - [x] ๅคๆจกๅโๅคๆจกๅโๅค้ๆฉ | |
| - [x] ่ง่ฒๅค้ๆฉ๏ผๅฅณๆง่ง่ฒ/็ทๆง่ง่ฒ/่ชๅฎไน่ง่ฒ(ๆฏไธ้จๅ้ฝๅฏไปฅ่ชๅจไธไผ ๅพ็)/Comming Soon | |
| - [x] TTSๆจกๅๅค้ๆฉ๏ผEdgeTTS / PaddleTTS/ GPT-SoVITS/Comming Soon | |
| - [x] LLMๆจกๅๅค้ๆฉ๏ผ Linly/ Qwen / ChatGLM/ GeminiPro/ ChatGPT/Comming Soon | |
| - [x] Talkerๆจกๅๅค้ๆฉ๏ผWav2Lip/ SadTalker/ ERNeRF/ MuseTalk/Comming Soon | |
| - [x] ASRๆจกๅๅค้ๆฉ๏ผWhisper/ FunASR/Comming Soon | |
|  | |
| ๅฏไปฅ็ดๆฅ่ฟ่กwebuiๆฅๅพๅฐ็ปๆ๏ผๅฏไปฅ็ๅฐ็้กต้ขๅฆไธ | |
| ```bash | |
| # WebUI | |
| python webui.py | |
| ``` | |
|  | |
| ่ฟๆฌกๆดๆฐไบไธไธ็้ข๏ผๆไปฌๅฏไปฅ่ช็ฑ้ๆฉGPT-SoVITSๅพฎ่ฐๅ็ๆจกๅๆฅๅฎ็ฐ๏ผไธไผ ๅ่้ณ้ขๅณๅฏๅพๅฅฝ็ๅ ้ๅฃฐ้ณ | |
|  | |
| ### Old Verison | |
| > ่ฟไธ้จๅๆฏไธบไบไฟ่ฏๆฏ้จไปฝไปฃ็ ้ฝๆฏๆญฃ็กฎ็๏ผๆไปฅไผๅ ๅฏนๆฏไธไธชๆจกๅ้ฝ่ฟ่กๆต่ฏๅๆน่ฟ | |
| ๅฏๅจไธๅ ฑๆๅ ็งๆจกๅผ๏ผๅฏไปฅ้ๆฉ็นๅฎ็ๅบๆฏ่ฟ่ก่ฎพ็ฝฎ | |
| ็ฌฌไธ็งๅชๆๅบๅฎไบไบบ็ฉ้ฎ็ญ๏ผ่ฎพ็ฝฎๅฅฝไบไบบ็ฉ๏ผ็ๅปไบ้ขๅค็ๆถ้ด | |
| ```bash | |
| python app.py | |
| ``` | |
|  | |
| ๆ่ฟๆดๆฐไบ็ฌฌไธ็งๆจกๅผ๏ผๅ ๅ ฅไบWav2Lipๆจกๅ่ฟ่กๅฏน่ฏ | |
| ```bash | |
| python appv2.py | |
| ``` | |
| ็ฌฌไบ็งๆฏๅฏไปฅไปปๆไธไผ ๅพ็่ฟ่กๅฏน่ฏ | |
| ```bash | |
| python app_img.py | |
| ``` | |
|  | |
| ็ฌฌไธ็งๆฏๅจ็ฌฌไธ็ง็ๅบ็กไธๅ ๅ ฅไบๅคง่ฏญ่จๆจกๅ๏ผๅ ๅ ฅไบๅค่ฝฎ็GPTๅฏน่ฏ | |
| ```bash | |
| python app_multi.py | |
| ``` | |
|  | |
| ็ฐๅจๅ ๅ ฅไบ่ฏญ้ณๅ ้็้จๅ๏ผๅฏไปฅ่ช็ฑๅๆข่ชๅทฑๅ ้็ๅฃฐ้ณๆจกๅๅๅฏนๅบ็ไบบๅพ็่ฟ่กๅฎ็ฐ๏ผ่ฟ้ๆ้ๆฉไบไธไธช็ๅ้ณๅ็ท็ๅพ็ | |
| ```bash | |
| python app_vits.py | |
| ``` | |
| ๅ ๅ ฅไบ็ฌฌๅ็งๆนๅผ๏ผไธๅบๅฎๅบๆฏ่ฟ่กๅฏน่ฏ๏ผ็ดๆฅ่พๅ ฅ่ฏญ้ณๆ่ ็ๆ่ฏญ้ณ่ฟ่กๆฐๅญไบบ็ๆ๏ผๅ ็ฝฎไบSadtalker๏ผWav2Lip๏ผER-NeRF็ญๆนๅผ | |
| > ER-NeRFๆฏ้ๅฏนๅ็ฌไธไธชไบบ็่ง้ข่ฟ่ก่ฎญ็ป็๏ผๆไปฅ้่ฆๆฟๆข็นๅฎ็ๆจกๅๆ่ฝ่ฟ่กๆธฒๆๅพๅฐๆญฃ็กฎ็็ปๆ๏ผๅ ็ฝฎไบObama็ๆ้๏ผๅฏ็ดๆฅ็จ | |
| ```bash | |
| python app_talk.py | |
| ``` | |
|  | |
| ๅ ๅ ฅไบMuseTalk็ๆนๅผ๏ผ่ฝๅคๅฐMuseV็่ง้ข่ฟ่ก้ขๅค็๏ผ้ขๅค็ๅ่ฟ่กๅฏน่ฏ๏ผ้ๅบฆๅบๆฌ่ฝๅค่พพๅฐๅฎๆถ็่ฆๆฑ๏ผ้ๅบฆ้ๅธธๅฟซ๏ผMuseTalkๅทฒๅ ๅ ฅๅจWebUIไธญใ | |
| ```bash | |
| python app_musetalk.py | |
| ``` | |
|  | |
| ## ๆไปถๅคน็ปๆ | |
| ๆๆ็ๆ้้จๅๅฏไปฅไป่ฟไธ่ฝฝ๏ผ็พๅบฆ็ฝ็ๅฏ่ฝๆๆถๅไผๆดๆฐๆ ขไธ็น๏ผๅปบ่ฎฎไปๅคธๅ ็ฝ็ไธ่ฝฝ๏ผไผ็ฌฌไธๆถ้ดๆดๆฐ | |
| - [Baidu (็พๅบฆไบ็)](https://pan.baidu.com/s/1eF13O-8wyw4B3MtesctQyg?pwd=linl) (Password: `linl`) | |
| - [huggingface](https://huggingface.co/Kedreamix/Linly-Talker) | |
| - [modelscope](https://www.modelscope.cn/models/Kedreamix/Linly-Talker/files) | |
| - [Quark(ๅคธๅ ็ฝ็)](https://pan.quark.cn/s/f48f5e35796b) | |
| ๆ้ๆไปถๅคน็ปๆๅฆไธ | |
| ```bash | |
| Linly-Talker/ | |
| โโโ checkpoints | |
| โ โโโ audio_visual_encoder.pth | |
| โ โโโ hub | |
| โ โ โโโ checkpoints | |
| โ โ โโโ s3fd-619a316812.pth | |
| โ โโโ lipsync_expert.pth | |
| โ โโโ mapping_00109-model.pth.tar | |
| โ โโโ mapping_00229-model.pth.tar | |
| โ โโโ May.json | |
| โ โโโ May.pth | |
| โ โโโ Obama_ave.pth | |
| โ โโโ Obama.json | |
| โ โโโ Obama.pth | |
| โ โโโ ref_eo.npy | |
| โ โโโ ref.npy | |
| โ โโโ ref.wav | |
| โ โโโ SadTalker_V0.0.2_256.safetensors | |
| โ โโโ visual_quality_disc.pth | |
| โ โโโ wav2lip_gan.pth | |
| โ โโโ wav2lip.pth | |
| โโโ gfpgan | |
| โย ย โโโ weights | |
| โย ย โโโ alignment_WFLW_4HG.pth | |
| โย ย โโโ detection_Resnet50_Final.pth | |
| โโโ GPT_SoVITS | |
| โย ย โโโ pretrained_models | |
| โย ย โโโ chinese-hubert-base | |
| โย ย โย ย โโโ config.json | |
| โย ย โย ย โโโ preprocessor_config.json | |
| โย ย โย ย โโโ pytorch_model.bin | |
| โย ย โโโ chinese-roberta-wwm-ext-large | |
| โย ย โย ย โโโ config.json | |
| โย ย โย ย โโโ pytorch_model.bin | |
| โย ย โย ย โโโ tokenizer.json | |
| โย ย โโโ README.md | |
| โย ย โโโ s1bert25hz-2kh-longer-epoch=68e-step=50232.ckpt | |
| โย ย โโโ s2D488k.pth | |
| โย ย โโโ s2G488k.pth | |
| โย ย โโโ speech_paraformer-large_asr_nat-zh-cn-16k-common-vocab8404-pytorch | |
| โโโ MuseTalk | |
| โ โโโ models | |
| โ โ โโโ dwpose | |
| โ โ โ โโโ dw-ll_ucoco_384.pth | |
| โ โ โโโ face-parse-bisent | |
| โ โ โ โโโ 79999_iter.pth | |
| โ โ โ โโโ resnet18-5c106cde.pth | |
| โ โ โโโ musetalk | |
| โ โ โ โโโ musetalk.json | |
| โ โ โ โโโ pytorch_model.bin | |
| โ โ โโโ README.md | |
| โ โ โโโ sd-vae-ft-mse | |
| โ โ โ โโโ config.json | |
| โ โ โ โโโ diffusion_pytorch_model.bin | |
| โ โ โโโ whisper | |
| โ โ โโโ tiny.pt | |
| โโโ Qwen | |
| โย ย โโโ Qwen-1_8B-Chat | |
| โย ย โโโ assets | |
| โย ย โย ย โโโ logo.jpg | |
| โย ย โย ย โโโ qwen_tokenizer.png | |
| โย ย โย ย โโโ react_showcase_001.png | |
| โย ย โย ย โโโ react_showcase_002.png | |
| โย ย โย ย โโโ wechat.png | |
| โย ย โโโ cache_autogptq_cuda_256.cpp | |
| โย ย โโโ cache_autogptq_cuda_kernel_256.cu | |
| โย ย โโโ config.json | |
| โย ย โโโ configuration_qwen.py | |
| โย ย โโโ cpp_kernels.py | |
| โย ย โโโ examples | |
| โย ย โย ย โโโ react_prompt.md | |
| โย ย โโโ generation_config.json | |
| โย ย โโโ LICENSE | |
| โย ย โโโ model-00001-of-00002.safetensors | |
| โย ย โโโ model-00002-of-00002.safetensors | |
| โย ย โโโ modeling_qwen.py | |
| โย ย โโโ model.safetensors.index.json | |
| โย ย โโโ NOTICE | |
| โย ย โโโ qwen_generation_utils.py | |
| โย ย โโโ qwen.tiktoken | |
| โย ย โโโ README.md | |
| โย ย โโโ tokenization_qwen.py | |
| โย ย โโโ tokenizer_config.json | |
| โโโ Whisper | |
| โ โโโ base.pt | |
| โ โโโ tiny.pt | |
| โโโ FunASR | |
| โ โโโ punc_ct-transformer_zh-cn-common-vocab272727-pytorch | |
| โ โ โโโ configuration.json | |
| โ โ โโโ config.yaml | |
| โ โ โโโ example | |
| โ โ โ โโโ punc_example.txt | |
| โ โ โโโ fig | |
| โ โ โ โโโ struct.png | |
| โ โ โโโ model.pt | |
| โ โ โโโ README.md | |
| โ โ โโโ tokens.json | |
| โ โโโ speech_fsmn_vad_zh-cn-16k-common-pytorch | |
| โ โ โโโ am.mvn | |
| โ โ โโโ configuration.json | |
| โ โ โโโ config.yaml | |
| โ โ โโโ example | |
| โ โ โ โโโ vad_example.wav | |
| โ โ โโโ fig | |
| โ โ โ โโโ struct.png | |
| โ โ โโโ model.pt | |
| โ โ โโโ README.md | |
| โ โโโ speech_seaco_paraformer_large_asr_nat-zh-cn-16k-common-vocab8404-pytorch | |
| โ โโโ am.mvn | |
| โ โโโ asr_example_hotword.wav | |
| โ โโโ configuration.json | |
| โ โโโ config.yaml | |
| โ โโโ example | |
| โ โ โโโ asr_example.wav | |
| โ โ โโโ hotword.txt | |
| โ โโโ fig | |
| โ โ โโโ res.png | |
| โ โ โโโ seaco.png | |
| โ โโโ model.pt | |
| โ โโโ README.md | |
| โ โโโ seg_dict | |
| โ โโโ tokens.json | |
| โโโ README.md | |
| ``` | |
| ## ่ตๅฉ | |
| | ๆฏไปๅฎ | ๅพฎไฟก | | |
| | -------------------- | ----------------------- | | |
| |  |  | | |
| ## ๅ่ | |
| **ASR** | |
| - [https://github.com/openai/whisper](https://github.com/openai/whisper) | |
| - [https://github.com/alibaba-damo-academy/FunASR](https://github.com/alibaba-damo-academy/FunASR) | |
| **TTS** | |
| - [https://github.com/rany2/edge-tts](https://github.com/rany2/edge-tts) | |
| - [https://github.com/PaddlePaddle/PaddleSpeech](https://github.com/PaddlePaddle/PaddleSpeech) | |
| **LLM** | |
| - [https://github.com/CVI-SZU/Linly](https://github.com/CVI-SZU/Linly) | |
| - [https://github.com/QwenLM/Qwen](https://github.com/QwenLM/Qwen) | |
| - [https://deepmind.google/technologies/gemini/](https://deepmind.google/technologies/gemini/) | |
| - [https://github.com/THUDM/ChatGLM3](https://github.com/THUDM/ChatGLM3) | |
| - [https://openai.com](https://openai.com) | |
| **THG** | |
| - [https://github.com/OpenTalker/SadTalker](https://github.com/OpenTalker/SadTalker) | |
| - [https://github.com/Rudrabha/Wav2Lip](https://github.com/Rudrabha/Wav2Lip) | |
| - [https://github.com/Fictionarry/ER-NeRF](https://github.com/Fictionarry/ER-NeRF) | |
| **Voice Clone** | |
| - [https://github.com/RVC-Boss/GPT-SoVITS](https://github.com/RVC-Boss/GPT-SoVITS) | |
| - [https://github.com/coqui-ai/TTS](https://github.com/coqui-ai/TTS) | |
| ## Star History | |
| [](https://star-history.com/#Kedreamix/Linly-Talker&Date) | |