LTTEAM commited on
Commit
f98ef23
·
verified ·
1 Parent(s): e2fb73d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +109 -109
README.md CHANGED
@@ -1,109 +1,109 @@
1
- ---
2
- title: LatentSync - Đồng bộ môi bằng AI
3
- emoji: 🎤
4
- colorFrom: green
5
- colorTo: yellow
6
- sdk: gradio
7
- sdk_version: 3.50.2
8
- app_file: app.py
9
- pinned: true
10
- ---
11
-
12
- # LatentSync - AI Lip Sync Technology
13
-
14
- [![Open in Spaces](https://img.shields.io/badge/🤗-Open%20in%20Spaces-blue.svg)](https://huggingface.co/spaces/LTTEAM/LatentSync)
15
- [![Facebook Community](https://img.shields.io/badge/👥-Facebook%20Group-blue)](https://www.facebook.com/groups/622526090937760)
16
-
17
- ## 🌟 Giới thiệu / Introduction
18
-
19
- **LatentSync** là công nghệ đồng bộ hóa chuyển động môi sử dụng mô hình Diffusion tiên tiến, cho phép tạo chuyển động môi tự nhiên từ âm thanh đầu vào.
20
-
21
- **LatentSync** is an advanced lip-sync technology using Diffusion models to generate natural lip movements from input audio.
22
-
23
- ## 🚀 Công nghệ / Technology
24
-
25
- ```python
26
- # Kiến trúc chính / Core Architecture
27
- pipeline = LipsyncPipeline(
28
- vae=AutoencoderKL.from_pretrained("stabilityai/sd-vae-ft-mse"),
29
- audio_encoder=Audio2Feature(model_path="whisper/small.pt"),
30
- unet=UNet3DConditionModel.from_config(config),
31
- scheduler=DDIMScheduler()
32
- )
33
- ```
34
-
35
- **Công nghệ chính / Key Technologies:**
36
- - 🧠 UNet 3D Condition Model
37
- - 🔊 Whisper Audio Encoder
38
- - 🌀 Latent Diffusion
39
- - ⚡ GPU Acceleration
40
-
41
- ## 📚 Cách sử dụng / How to Use
42
-
43
- 1. Tải lên video chứa khuôn mặt / Upload face video
44
- 2. Tải lên file âm thanh / Upload audio file
45
- 3. Nhấn "Chạy đồng bộ" / Click "Run Sync"
46
- 4. Chờ kết quả / Wait for processing
47
-
48
- ```bash
49
- # Chạy local / Run locally
50
- git clone https://huggingface.co/spaces/LTTEAM/LatentSync
51
- cd LatentSync
52
- pip install -r requirements.txt
53
- python app.py
54
- ```
55
-
56
- ## 🌐 Demo Online
57
-
58
- [![Try on Spaces](https://img.shields.io/badge/🤗-Try%20on%20Spaces-blue.svg)](https://huggingface.co/spaces/LTTEAM/LatentSync)
59
-
60
- ## 👨‍💻 Tác giả & Cộng đồng / Author & Community
61
-
62
- **Tác giả / Author:**
63
- [Lý Trần](https://github.com/lytrann)
64
-
65
- **Cộng đồng / Community:**
66
- [LTTEAM Facebook Group](https://www.facebook.com/groups/622526090937760)
67
-
68
- **Hỗ trợ / Support:**
69
- [![Facebook Community](https://img.shields.io/badge/👥-Join%20Community-blue)](https://www.facebook.com/groups/622526090937760)
70
-
71
- ## 📜 Giấy phép / License
72
-
73
- ```text
74
- Copyright 2023 LTTEAM
75
-
76
- Licensed under the Apache License, Version 2.0 (the "License");
77
- you may not use this file except in compliance with the License.
78
- ```
79
-
80
- ---
81
-
82
- 🔥 **Đóng góp / Contributions welcome!**
83
- 💡 **Báo lỗi / Report issues:** [Issues](https://huggingface.co/spaces/LTTEAM/LatentSync/discussions)
84
- ```
85
-
86
- ## Key Features of this README:
87
-
88
- 1. **Bilingual Presentation**: Vietnamese and English for wider accessibility
89
- 2. **Technical Highlights**:
90
- - Code block showing core architecture
91
- - Badges for easy navigation
92
- - Clear technology stack
93
-
94
- 3. **Community Focus**:
95
- - Author information
96
- - Community links
97
- - Support channels
98
-
99
- 4. **Visual Appeal**:
100
- - Emoji usage
101
- - Colorful badges
102
- - Clear section separation
103
-
104
- 5. **Practical Information**:
105
- - Usage instructions
106
- - Local setup guide
107
- - License information
108
-
109
- This README will display beautifully on your Hugging Face Space while effectively communicating all key information to users.
 
1
+ ---
2
+ title: LatentSync - Đồng bộ môi bằng AI
3
+ emoji: 🎤
4
+ colorFrom: green
5
+ colorTo: yellow
6
+ sdk: gradio
7
+ sdk_version: 5.34.0
8
+ app_file: app.py
9
+ pinned: true
10
+ ---
11
+
12
+ # LatentSync - AI Lip Sync Technology
13
+
14
+ [![Open in Spaces](https://img.shields.io/badge/🤗-Open%20in%20Spaces-blue.svg)](https://huggingface.co/spaces/LTTEAM/LatentSync)
15
+ [![Facebook Community](https://img.shields.io/badge/👥-Facebook%20Group-blue)](https://www.facebook.com/groups/622526090937760)
16
+
17
+ ## 🌟 Giới thiệu / Introduction
18
+
19
+ **LatentSync** là công nghệ đồng bộ hóa chuyển động môi sử dụng mô hình Diffusion tiên tiến, cho phép tạo chuyển động môi tự nhiên từ âm thanh đầu vào.
20
+
21
+ **LatentSync** is an advanced lip-sync technology using Diffusion models to generate natural lip movements from input audio.
22
+
23
+ ## 🚀 Công nghệ / Technology
24
+
25
+ ```python
26
+ # Kiến trúc chính / Core Architecture
27
+ pipeline = LipsyncPipeline(
28
+ vae=AutoencoderKL.from_pretrained("stabilityai/sd-vae-ft-mse"),
29
+ audio_encoder=Audio2Feature(model_path="whisper/small.pt"),
30
+ unet=UNet3DConditionModel.from_config(config),
31
+ scheduler=DDIMScheduler()
32
+ )
33
+ ```
34
+
35
+ **Công nghệ chính / Key Technologies:**
36
+ - 🧠 UNet 3D Condition Model
37
+ - 🔊 Whisper Audio Encoder
38
+ - 🌀 Latent Diffusion
39
+ - ⚡ GPU Acceleration
40
+
41
+ ## 📚 Cách sử dụng / How to Use
42
+
43
+ 1. Tải lên video chứa khuôn mặt / Upload face video
44
+ 2. Tải lên file âm thanh / Upload audio file
45
+ 3. Nhấn "Chạy đồng bộ" / Click "Run Sync"
46
+ 4. Chờ kết quả / Wait for processing
47
+
48
+ ```bash
49
+ # Chạy local / Run locally
50
+ git clone https://huggingface.co/spaces/LTTEAM/LatentSync
51
+ cd LatentSync
52
+ pip install -r requirements.txt
53
+ python app.py
54
+ ```
55
+
56
+ ## 🌐 Demo Online
57
+
58
+ [![Try on Spaces](https://img.shields.io/badge/🤗-Try%20on%20Spaces-blue.svg)](https://huggingface.co/spaces/LTTEAM/LatentSync)
59
+
60
+ ## 👨‍💻 Tác giả & Cộng đồng / Author & Community
61
+
62
+ **Tác giả / Author:**
63
+ [Lý Trần](https://github.com/lytrann)
64
+
65
+ **Cộng đồng / Community:**
66
+ [LTTEAM Facebook Group](https://www.facebook.com/groups/622526090937760)
67
+
68
+ **Hỗ trợ / Support:**
69
+ [![Facebook Community](https://img.shields.io/badge/👥-Join%20Community-blue)](https://www.facebook.com/groups/622526090937760)
70
+
71
+ ## 📜 Giấy phép / License
72
+
73
+ ```text
74
+ Copyright 2023 LTTEAM
75
+
76
+ Licensed under the Apache License, Version 2.0 (the "License");
77
+ you may not use this file except in compliance with the License.
78
+ ```
79
+
80
+ ---
81
+
82
+ 🔥 **Đóng góp / Contributions welcome!**
83
+ 💡 **Báo lỗi / Report issues:** [Issues](https://huggingface.co/spaces/LTTEAM/LatentSync/discussions)
84
+ ```
85
+
86
+ ## Key Features of this README:
87
+
88
+ 1. **Bilingual Presentation**: Vietnamese and English for wider accessibility
89
+ 2. **Technical Highlights**:
90
+ - Code block showing core architecture
91
+ - Badges for easy navigation
92
+ - Clear technology stack
93
+
94
+ 3. **Community Focus**:
95
+ - Author information
96
+ - Community links
97
+ - Support channels
98
+
99
+ 4. **Visual Appeal**:
100
+ - Emoji usage
101
+ - Colorful badges
102
+ - Clear section separation
103
+
104
+ 5. **Practical Information**:
105
+ - Usage instructions
106
+ - Local setup guide
107
+ - License information
108
+
109
+ This README will display beautifully on your Hugging Face Space while effectively communicating all key information to users.