Spaces:

WJBSCUT
/

CosyVoice

Running

App Files Files Community

jerrybwang commited on 27 days ago

Commit

cdf446f

1 Parent(s): 92b99c9

--other 更新代码

Browse files

Files changed (5) hide show

README.md +85 -29
app.py +82 -340
check_deployment.py +134 -0
packages.txt +2 -0
requirements.txt +3 -3

README.md CHANGED Viewed

@@ -75,56 +75,112 @@ model = AutoModel.from_pretrained("FunAudioLLM/CosyVoice-300M")
 ## 部署到Hugging Face Space
-### 快速部署
 1. **创建Space**
    - 访问 [Hugging Face Spaces](https://huggingface.co/spaces)
    - 点击 "New Space"
    - 选择 Gradio SDK
-2. **上传文件**
-   - 上传 `app.py`
-   - 上传 `requirements.txt`
-   - 上传 `config.py`（可选）
-   - 上传 `README.md`（可选）
 3. **等待构建**
-   - Space会自动安装依赖（约3-5分钟）
-   - 自动下载CosyVoice模型
    - 启动应用
 4. **验证部署**
    - 检查模型状态指示器（应显示绿色✅）
-   - 测试语音识别和文本转语音功能
-### 硬件建议
-- **CPU Basic（免费）**: 适合测试和演示
-- **GPU T4（推荐）**: 更快的推理速度，更好的用户体验
-- **GPU A10G**: 适合高并发场景
-### 部署问题修复
-如果遇到以下错误：
-```
-ERROR: cosyvoice does not appear to be a Python project
 ```
-**解决方案**: 已修复！当前版本的`requirements.txt`已移除有问题的依赖，改用transformers直接加载模型。
-详细的故障排除指南请查看 [HF_SPACE_FIX.md](HF_SPACE_FIX.md)
-### 自动部署
-1. 将本仓库推送到Hugging Face
-2. 在Hugging Face网站创建新的Space
-3. 选择"Gradio"作为SDK
-4. 系统会自动检测配置并部署
-### 手动部署
-如果需要自定义部署，可以修改以下文件：
-- `app.py`: 主应用文件
-- `requirements.txt`: Python依赖包
-- `README.md`: 空间配置和说明
 ## 技术架构

 ## 部署到Hugging Face Space
+### 快速部署步骤
 1. **创建Space**
    - 访问 [Hugging Face Spaces](https://huggingface.co/spaces)
    - 点击 "New Space"
    - 选择 Gradio SDK
+   - 选择硬件（推荐 CPU Basic 或 GPU T4）
+2. **上传必需文件**
+   - `app.py` - 主应用文件
+   - `requirements.txt` - Python依赖
+   - `packages.txt` - 系统依赖（ffmpeg等）
+   - `README.md` - 项目说明（包含Space配置）
 3. **等待构建**
+   - Space会自动安装依赖（约5-10分钟）
+   - 自动从Hugging Face Hub下载CosyVoice-300M模型
    - 启动应用
 4. **验证部署**
    - 检查模型状态指示器（应显示绿色✅）
+   - 测试文本转语音功能
+   - 测试音频处理功能
+### 硬件配置建议
+| 硬件类型 | 适用场景 | 说明 |
+|---------|---------|------|
+| **CPU Basic（免费）** | 测试和演示 | 推理速度较慢，适合轻量使用 |
+| **GPU T4（推荐）** | 生产环境 | 更快的推理速度，更好的用户体验 |
+| **GPU A10G** | 高并发 | 适合大量用户同时访问 |
+### 重要配置说明
+#### README.md 头部配置
+确保 README.md 文件顶部包含以下配置：
+```yaml
+---
+title: CosyVoice
+emoji: 🌍
+colorFrom: blue
+colorTo: pink
+sdk: gradio
+sdk_version: 6.4.0
+app_file: app.py
+pinned: false
+license: apache-2.0
+---
 ```
+#### 必需的文件
+- ✅ `app.py` - 主应用
+- ✅ `requirements.txt` - Python依赖
+- ✅ `packages.txt` - 系统依赖（ffmpeg, libsndfile1）
+- ✅ `README.md` - 包含Space配置
+### 常见问题解决
+#### 问题1: 模型加载失败
+**症状**: 界面显示"演示模式"警告
+**解决方案**:
+- 检查网络连接是否正常
+- 确认 `requirements.txt` 中包含 `transformers>=4.35.0`
+- 查看 Space 日志确认模型下载进度
+#### 问题2: 音频处理错误
+**症状**: 上传音频后报错
+**解决方案**:
+- 确保 `packages.txt` 文件存在并包含 `ffmpeg`
+- 重新构建 Space
+#### 问题3: 依赖安装失败
+**症状**: 构建过程中出现错误
+**解决方案**:
+- 检查 `requirements.txt` 格式是否正确
+- 移除不必要的依赖（如 modelscope, edge-tts, gTTS）
+- 使用固定版本号避免兼容性问题
+### Git 部署方式
+```bash
+# 克隆或创建仓库
+git clone https://huggingface.co/spaces/YOUR_USERNAME/YOUR_SPACE_NAME
+cd YOUR_SPACE_NAME
+# 添加文件
+cp /path/to/app.py .
+cp /path/to/requirements.txt .
+cp /path/to/packages.txt .
+cp /path/to/README.md .
+# 提交并推送
+git add .
+git commit -m "Initial deployment"
+git push
+```
+### 验证清单
+部署前请确认：
+- [ ] README.md 包含正确的 YAML 配置
+- [ ] app.py 文件完整无误
+- [ ] requirements.txt 包含所有必需依赖
+- [ ] packages.txt 包含系统依赖
+- [ ] 选择了合适的硬件配置
+- [ ] Space 设置为 Public（如需公开访问）
 ## 技术架构

app.py CHANGED Viewed

@@ -33,228 +33,71 @@ def load_cosyvoice_model():
     print("="*60)
     try:
-        # 方法1: 使用transformers加载（推荐用于Hugging Face Space）
-        try:
-            print("\n[方法1] 尝试使用transformers加载...")
-            from transformers import AutoModel
-            import torch
-            model_name = "FunAudioLLM/CosyVoice-300M"
-            print(f"  从 {model_name} 加载...")
-            # CosyVoice需要trust_remote_code=True来加载自定义模型代码
-            model = AutoModel.from_pretrained(
-                model_name,
-                trust_remote_code=True,
-                torch_dtype=torch.float32
-            )
-            # 设置为评估模式
-            model.eval()
-            # 详细检查模型类型和方法
-            print(f"  模型类型: {type(model)}")
-            print(f"  模型类名: {model.__class__.__name__}")
-            # 检查所有可用的方法
-            all_methods = [m for m in dir(model) if not m.startswith('_') and callable(getattr(model, m, None))]
-            print(f"  可用方法数量: {len(all_methods)}")
-            print(f"  前20个方法: {all_methods[:20]}")
-            # 检查关键推理方法
-            has_inference_sft = hasattr(model, 'inference_sft')
-            has_inference = hasattr(model, 'inference')
-            has_generate = hasattr(model, 'generate')
-            has_forward = hasattr(model, 'forward')
-            print(f"  推理方法检查:")
-            print(f"    - inference_sft: {has_inference_sft}")
-            print(f"    - inference: {has_inference}")
-            print(f"    - generate: {has_generate}")
-            print(f"    - forward: {has_forward}")
-            # 如果模型有这些方法，说明加载成功
-            has_inference = has_inference_sft or has_inference or has_generate
-            if has_inference:
-                cosyvoice_model = {
-                    'model': model,
-                    'type': 'transformers',
-                    'has_inference': True
-                }
-                model_loaded = True
-                print("  ✓ 成功通过transformers加载CosyVoice模型（有推理方法）")
-                print("="*60 + "\n")
-                return cosyvoice_model
-            else:
-                print("  ⚠ 模型加载成功但缺少推理方法，尝试下一个方法...")
-                # 不返回，继续尝试其他方法
-        except Exception as e:
-            print(f"  ✗ transformers加载失败: {e}")
-            import traceback
-            print(f"  详细错误: {traceback.format_exc()[:300]}")
-        # 方法2: 尝试从Hugging Face Hub下载并使用官方CosyVoice包
-        try:
-            print("\n[方法2] 尝试从Hugging Face Hub下载并使用官方CosyVoice...")
-            from huggingface_hub import snapshot_download
-            import torch
-            model_id = "FunAudioLLM/CosyVoice-300M"
-            print(f"  下载模型: {model_id}")
-            model_dir = snapshot_download(
-                repo_id=model_id,
-                allow_patterns=["*.pt", "*.pth", "*.bin", "*.safetensors", "*.json", "*.txt", "*.yaml", "*.py"],
-            )
-            print(f"  模型下载到: {model_dir}")
-            # 尝试导入CosyVoice官方包
-            try:
-                # 首先尝试直接导入（如果已安装）
-                from cosyvoice.cli.cosyvoice import CosyVoice
-                print("  使用已安装的CosyVoice包")
-                cosyvoice_model = CosyVoice(model_dir)
-                model_loaded = True
-                print("  ✓ 成功使用官方CosyVoice包加载模型")
-                print("="*60 + "\n")
-                return cosyvoice_model
-            except ImportError:
-                print("  CosyVoice包未安装，尝试从下载的代码加载...")
-                # 尝试从下载的模型目录加载代码
-                if model_dir not in sys.path:
-                    sys.path.insert(0, model_dir)
-                # 查找并加载modeling文件
-                import glob
-                py_files = glob.glob(os.path.join(model_dir, "**/*.py"), recursive=True)
-                print(f"  找到Python文件: {len(py_files)}个")
-                # 尝试找到CosyVoice类定义
-                for py_file in py_files:
-                    if 'cosyvoice' in py_file.lower() or 'model' in py_file.lower():
-                        print(f"  检查文件: {os.path.basename(py_file)}")
-                # 如果找不到官方包，尝试直接加载模型权重
-                try:
-                    # 查找模型文件
-                    model_files = glob.glob(os.path.join(model_dir, "**/*.pt"), recursive=True)
-                    model_files += glob.glob(os.path.join(model_dir, "**/*.pth"), recursive=True)
-                    model_files += glob.glob(os.path.join(model_dir, "**/*.bin"), recursive=True)
-                    if model_files:
-                        print(f"  找到模型文件: {len(model_files)}个")
-                        for mf in model_files[:5]:
-                            print(f"    - {os.path.basename(mf)}")
-                        # 尝试使用transformers的AutoModel加载
-                        print("  尝试使用AutoModel从本地目录加载...")
-                        from transformers import AutoModel
-                        model = AutoModel.from_pretrained(
-                            model_dir,
-                            trust_remote_code=True,
-                            local_files_only=True,
-                            torch_dtype=torch.float32
-                        )
-                        model.eval()
-                        # 检查推理方法
-                        print(f"  模型类型: {type(model).__name__}")
-                        has_inference = hasattr(model, 'inference_sft') or hasattr(model, 'inference') or hasattr(model, 'generate')
-                        print(f"  有推理方法: {has_inference}")
-                        if has_inference:
-                            cosyvoice_model = {
-                                'model': model,
-                                'model_dir': model_dir,
-                                'type': 'transformers'
-                            }
-                            model_loaded = True
-                            print("  ✓ 成功从本地目录加载模型")
-                            print("="*60 + "\n")
-                            return cosyvoice_model
-                        else:
-                            print("  ⚠ 模型加载成功但缺少推理方法")
-                    else:
-                        print("  ✗ 未找到模型文件")
-                except Exception as load_err:
-                    print(f"  ✗ 加载模型文件失败: {load_err}")
-                    import traceback
-                    print(f"  详细错误: {traceback.format_exc()[:300]}")
-        except Exception as e:
-            print(f"  ✗ Hugging Face Hub下载失败: {e}")
-            import traceback
-            print(f"  详细错误: {traceback.format_exc()[:300]}")
-        # 方法3: 尝试从本地路径加载（用于本地开发）
-        try:
-            print("\n[方法3] 尝试从本地路径加载...")
-            from cosyvoice.cli.cosyvoice import CosyVoice
-            possible_paths = [
-                os.environ.get('COSYVOICE_MODEL_DIR', ''),
-                'pretrained_models/CosyVoice-300M',
-                'CosyVoice-300M',
-                './models/CosyVoice-300M',
-            ]
-            for model_dir in possible_paths:
-                if model_dir and os.path.exists(model_dir):
-                    print(f"  尝试从路径加载: {model_dir}")
-                    try:
-                        cosyvoice_model = CosyVoice(model_dir)
-                        model_loaded = True
-                        print(f"  ✓ 成功从 {model_dir} 加载CosyVoice模型")
-                        print("="*60 + "\n")
-                        return cosyvoice_model
-                    except Exception as e:
-                        print(f"  ✗ 加载失败: {e}")
-                        continue
-        except ImportError as e:
-            print(f"  ✗ CosyVoice包未安装: {e}")
-        except Exception as e:
-            print(f"  ✗ 方法3失败: {e}")
-        # 方法4: 使用ModelScope加载（国内用户备选）
-        try:
-            print("\n[方法4] 尝试使用ModelScope加载...")
-            from modelscope import snapshot_download
-            model_dir = snapshot_download('iic/CosyVoice-300M')
-            print(f"  模型下载到: {model_dir}")
-            # 尝试使用下载的模型
-            try:
-                sys.path.insert(0, model_dir)
-                from cosyvoice.cli.cosyvoice import CosyVoice
-                cosyvoice_model = CosyVoice(model_dir)
-                model_loaded = True
-                print("  ✓ 成功通过ModelScope加载CosyVoice模型")
-                print("="*60 + "\n")
-                return cosyvoice_model
-            except ImportError:
-                cosyvoice_model = {'model_dir': model_dir, 'type': 'downloaded'}
-                model_loaded = True
-                print("  ✓ 模型文件已下载（演示模式）")
-                print("="*60 + "\n")
-                return cosyvoice_model
-        except Exception as e:
-            print(f"  ✗ ModelScope加载失败: {e}")
-        # 方法5: 演示模式（所有方法都失败）
-        print("\n[方法5] 所有加载方法失败，使用演示模式")
-        print("  ⚠ 演示模式不包含真实的CosyVoice模型")
-        print("  ⚠ 要使用完整功能，请确保:")
-        print("     1. 网络连接正常，可以访问Hugging Face")
-        print("     2. 有足够的磁盘空间（约2GB）")
-        print("     3. transformers包版本 >= 4.35.0")
-        print("\n安装方法:")
-        print("  pip install transformers>=4.35.0")
         print("="*60 + "\n")
         cosyvoice_model = None
@@ -497,129 +340,28 @@ def text_to_speech(text, speaker="中文女"):
         return None, f"语音合成失败: {str(e)}"
 def generate_demo_audio(text, speaker, error=None):
-    """使用备用TTS引擎生成真实语音"""
-    try:
-        # 尝试使用edge-tts（微软Edge浏览器的TTS引擎）
-        import edge_tts
-        import asyncio
-        import tempfile
-        print(f"使用edge-tts生成语音: {text[:50]}...")
-        # 根据说话人选择语音
-        voice_map = {
-            "中文女": "zh-CN-XiaoxiaoNeural",
-            "中文男": "zh-CN-YunxiNeural",
-            "英文女": "en-US-JennyNeural",
-            "英文男": "en-US-GuyNeural",
-        }
-        voice = voice_map.get(speaker, "zh-CN-XiaoxiaoNeural")
-        # 创建临时文件
-        with tempfile.NamedTemporaryFile(delete=False, suffix='.mp3') as tmp_file:
-            tmp_path = tmp_file.name
-        # 异步生成语音
-        async def generate():
-            communicate = edge_tts.Communicate(text, voice)
-            await communicate.save(tmp_path)
-        # 运行异步任务
-        asyncio.run(generate())
-        # 读取生成的音频
-        import soundfile as sf
-        from pydub import AudioSegment
-        # 转换MP3到WAV
-        audio = AudioSegment.from_mp3(tmp_path)
-        wav_path = tmp_path.replace('.mp3', '.wav')
-        audio.export(wav_path, format='wav')
-        # 读取WAV文件
-        audio_data, sample_rate = sf.read(wav_path)
-        # 清理临时文件
-        os.unlink(tmp_path)
-        os.unlink(wav_path)
-        audio_tuple = (sample_rate, audio_data.astype(np.float32))
-        status_msg = f"✓ 语音合成成功（使用Edge TTS）\n文本: {text}\n说话人: {speaker}\n"
-        if error:
-            status_msg += f"注意: CosyVoice模型不可用，已使用备用引擎\n"
-        print(f"  ✓ Edge TTS生成成功")
-        return audio_tuple, status_msg
-    except Exception as e:
-        print(f"  ✗ Edge TTS失败: {e}")
-        # 如果edge-tts失败，尝试gTTS
-        try:
-            from gtts import gTTS
-            import tempfile
-            import soundfile as sf
-            from pydub import AudioSegment
-            print(f"使用gTTS生成语音: {text[:50]}...")
-            # 根据说话人选择语言
-            lang = 'zh-CN' if speaker.startswith('中文') else 'en'
-            # 创建临时文件
-            with tempfile.NamedTemporaryFile(delete=False, suffix='.mp3') as tmp_file:
-                tmp_path = tmp_file.name
-            # 生成语音
-            tts = gTTS(text=text, lang=lang, slow=False)
-            tts.save(tmp_path)
-            # 转换MP3到WAV
-            audio = AudioSegment.from_mp3(tmp_path)
-            wav_path = tmp_path.replace('.mp3', '.wav')
-            audio.export(wav_path, format='wav')
-            # 读取WAV文件
-            audio_data, sample_rate = sf.read(wav_path)
-            # 清理临时文件
-            os.unlink(tmp_path)
-            os.unlink(wav_path)
-            audio_tuple = (sample_rate, audio_data.astype(np.float32))
-            status_msg = f"✓ 语音合成成功（使用Google TTS）\n文本: {text}\n说话人: {speaker}\n"
-            if error:
-                status_msg += f"注意: CosyVoice模型不可用，已使用备用引擎\n"
-            print(f"  ✓ gTTS生成成功")
-            return audio_tuple, status_msg
-        except Exception as e2:
-            print(f"  ✗ gTTS也失败: {e2}")
-            # 最后的降级方案：生成演示音频
-            sample_rate = 22050
-            duration = min(len(text) * 0.2, 5.0)
-            t = np.linspace(0, duration, int(sample_rate * duration), False)
-            frequency = 440
-            audio_data = 0.3 * np.sin(2 * np.pi * frequency * t)
-            audio_data += 0.2 * np.sin(2 * np.pi * frequency * 1.5 * t)
-            fade_samples = int(sample_rate * 0.1)
-            audio_data[:fade_samples] *= np.linspace(0, 1, fade_samples)
-            audio_data[-fade_samples:] *= np.linspace(1, 0, fade_samples)
-            audio_tuple = (sample_rate, audio_data.astype(np.float32))
-            status_msg = f"⚠ 演示模式\n文本: {text}\n说话人: {speaker}\n"
-            if error:
-                status_msg += f"注意: 所有TTS引擎都不可用\n"
-            status_msg += "提示: 这是演示音频，不是真实的语音合成结果"
-            return audio_tuple, status_msg
 # 在启动时加载模型
 load_cosyvoice_model()

     print("="*60)
     try:
+        # 使用transformers加载（推荐用于Hugging Face Space）
+        print("\n尝试使用transformers加载...")
+        from transformers import AutoModel
+        import torch
+        model_name = "FunAudioLLM/CosyVoice-300M"
+        print(f"从 {model_name} 加载...")
+        # CosyVoice需要trust_remote_code=True来加载自定义模型代码
+        model = AutoModel.from_pretrained(
+            model_name,
+            trust_remote_code=True,
+            torch_dtype=torch.float32,
+            low_cpu_mem_usage=True
+        )
+        # 设置为评估模式
+        model.eval()
+        # 检查模型类型和方法
+        print(f"模型类型: {type(model).__name__}")
+        # 检查关键推理方法
+        has_inference_sft = hasattr(model, 'inference_sft')
+        has_inference = hasattr(model, 'inference')
+        has_generate = hasattr(model, 'generate')
+        print(f"推理方法检查:")
+        print(f"  - inference_sft: {has_inference_sft}")
+        print(f"  - inference: {has_inference}")
+        print(f"  - generate: {has_generate}")
+        # 如果模型有这些方法，说明加载成功
+        if has_inference_sft or has_inference or has_generate:
+            cosyvoice_model = {
+                'model': model,
+                'type': 'transformers',
+                'has_inference': True
+            }
+            model_loaded = True
+            print("✓ 成功加载CosyVoice模型")
+            print("="*60 + "\n")
+            return cosyvoice_model
+        else:
+            print("⚠ 模型加载成功但缺少推理方法")
+            cosyvoice_model = {
+                'model': model,
+                'type': 'transformers',
+                'has_inference': False
+            }
+            model_loaded = True
+            print("="*60 + "\n")
+            return cosyvoice_model
+    except Exception as e:
+        print(f"✗ 模型加载失败: {e}")
+        import traceback
+        print(f"详细错误:\n{traceback.format_exc()}")
+        # 演示模式（加载失败）
+        print("\n⚠ 使用演示模式")
+        print("提示: 要使用完整功能，请确保:")
+        print("  1. 网络连接正常，可以访问Hugging Face")
+        print("  2. 有足够的磁盘空间（约2GB）")
+        print("  3. transformers包版本 >= 4.35.0")
         print("="*60 + "\n")
         cosyvoice_model = None
         return None, f"语音合成失败: {str(e)}"
 def generate_demo_audio(text, speaker, error=None):
+    """生成演示音频（当模型不可用时）"""
+    # 生成简单的演示音频
+    sample_rate = 22050
+    duration = min(len(text) * 0.2, 5.0)
+    t = np.linspace(0, duration, int(sample_rate * duration), False)
+    frequency = 440
+    audio_data = 0.3 * np.sin(2 * np.pi * frequency * t)
+    audio_data += 0.2 * np.sin(2 * np.pi * frequency * 1.5 * t)
+    fade_samples = int(sample_rate * 0.1)
+    audio_data[:fade_samples] *= np.linspace(0, 1, fade_samples)
+    audio_data[-fade_samples:] *= np.linspace(1, 0, fade_samples)
+    audio_tuple = (sample_rate, audio_data.astype(np.float32))
+    status_msg = f"⚠ 演示模式\n文本: {text}\n说话人: {speaker}\n"
+    if error:
+        status_msg += f"错误: {error}\n"
+    status_msg += "提示: 这是演示音频，不是真实的语音合成结果。请确保模型正确加载。"
+    return audio_tuple, status_msg
 # 在启动时加载模型
 load_cosyvoice_model()

check_deployment.py ADDED Viewed

	@@ -0,0 +1,134 @@

+#!/usr/bin/env python3
+"""
+Hugging Face Space 部署检查脚本
+检查所有必需的文件和配置是否正确
+"""
+import os
+import sys
+from pathlib import Path
+def check_file_exists(filepath, required=True):
+    """检查文件是否存在"""
+    exists = os.path.exists(filepath)
+    status = "✅" if exists else ("❌" if required else "⚠️")
+    req_text = "必需" if required else "可选"
+    print(f"{status} {filepath} ({req_text}): {'存在' if exists else '缺失'}")
+    return exists
+def check_file_content(filepath, required_content):
+    """检查文件内容"""
+    try:
+        with open(filepath, 'r', encoding='utf-8') as f:
+            content = f.read()
+            for item in required_content:
+                if item in content:
+                    print(f"  ✅ 包含: {item}")
+                else:
+                    print(f"  ❌ 缺失: {item}")
+                    return False
+        return True
+    except Exception as e:
+        print(f"  ❌ 读取文件失败: {e}")
+        return False
+def main():
+    print("="*60)
+    print("Hugging Face Space 部署检查")
+    print("="*60)
+    print()
+    # 检查必需文件
+    print("📋 检查必需文件:")
+    print("-"*60)
+    files_ok = True
+    files_ok &= check_file_exists("app.py", required=True)
+    files_ok &= check_file_exists("requirements.txt", required=True)
+    files_ok &= check_file_exists("packages.txt", required=True)
+    files_ok &= check_file_exists("README.md", required=True)
+    print()
+    # 检查可选文件
+    print("📋 检查可选文件:")
+    print("-"*60)
+    check_file_exists("config.py", required=False)
+    check_file_exists(".gitignore", required=False)
+    print()
+    # 检查 README.md 配置
+    print("📋 检查 README.md 配置:")
+    print("-"*60)
+    if os.path.exists("README.md"):
+        readme_items = [
+            "title:",
+            "sdk: gradio",
+            "app_file: app.py",
+        ]
+        check_file_content("README.md", readme_items)
+    print()
+    # 检查 requirements.txt
+    print("📋 检查 requirements.txt:")
+    print("-"*60)
+    if os.path.exists("requirements.txt"):
+        req_items = [
+            "gradio",
+            "torch",
+            "transformers",
+            "huggingface_hub",
+        ]
+        check_file_content("requirements.txt", req_items)
+    print()
+    # 检查 packages.txt
+    print("📋 检查 packages.txt:")
+    print("-"*60)
+    if os.path.exists("packages.txt"):
+        pkg_items = [
+            "ffmpeg",
+        ]
+        check_file_content("packages.txt", pkg_items)
+    print()
+    # 检查 app.py
+    print("📋 检查 app.py:")
+    print("-"*60)
+    if os.path.exists("app.py"):
+        app_items = [
+            "import gradio",
+            "AutoModel.from_pretrained",
+            "FunAudioLLM/CosyVoice-300M",
+            "demo.launch()",
+        ]
+        check_file_content("app.py", app_items)
+    print()
+    print("="*60)
+    if files_ok:
+        print("✅ 所有必需文件检查通过！")
+        print()
+        print("📦 下一步:")
+        print("1. 访问 https://huggingface.co/spaces")
+        print("2. 创建新的 Space，选择 Gradio SDK")
+        print("3. 上传以下文件:")
+        print("   - app.py")
+        print("   - requirements.txt")
+        print("   - packages.txt")
+        print("   - README.md")
+        print("4. 等待构建完成（约5-10分钟）")
+        print("5. 测试应用功能")
+    else:
+        print("❌ 检查失败！请修复上述问题后重试。")
+        sys.exit(1)
+    print("="*60)
+if __name__ == "__main__":
+    main()

packages.txt ADDED Viewed

	@@ -0,0 +1,2 @@


1	+ ffmpeg
2	+ libsndfile1

requirements.txt CHANGED Viewed

@@ -7,7 +7,7 @@ librosa>=0.10.0
 soundfile>=0.12.0
 scipy>=1.10.0
 huggingface_hub>=0.19.0
-modelscope
 pydub
-gTTS
-edge-tts

 soundfile>=0.12.0
 scipy>=1.10.0
 huggingface_hub>=0.19.0
 pydub
+accelerate
+sentencepiece
+protobuf