Spaces:

Tom1986
/

GENIE

Sleeping

App Files Files Community

Tom1986 commited on Sep 16, 2025

Commit

a2e2aa0

1 Parent(s): e1d82ba

Deploy Genie TTS

Browse files

Files changed (7) hide show

.gitignore +196 -0
DEPLOYMENT.md +151 -0
PROJECT_SUMMARY.md +118 -0
README.md.space +18 -0
app.py +539 -0
guidance.md +67 -0
requirements.txt +40 -0

.gitignore ADDED Viewed

	@@ -0,0 +1,196 @@

+# Byte-compiled / optimized / DLL files
+__pycache__/
+*.py[cod]
+*$py.class
+# C extensions
+*.so
+# Distribution / packaging
+.Python
+build/
+develop-eggs/
+dist/
+downloads/
+eggs/
+.eggs/
+lib/
+lib64/
+parts/
+sdist/
+var/
+wheels/
+share/python-wheels/
+*.egg-info/
+.installed.cfg
+*.egg
+MANIFEST
+# PyInstaller
+#  Usually these files are written by a python script from a template
+#  before PyInstaller builds the exe, so as to inject date/other infos into it.
+*.manifest
+*.spec
+# Installer logs
+pip-log.txt
+pip-delete-this-directory.txt
+# Unit test / coverage reports
+htmlcov/
+.tox/
+.nox/
+.coverage
+.coverage.*
+.cache
+nosetests.xml
+coverage.xml
+*.cover
+*.py,cover
+.hypothesis/
+.pytest_cache/
+cover/
+# Translations
+*.mo
+*.pot
+# Django stuff:
+*.log
+local_settings.py
+db.sqlite3
+db.sqlite3-journal
+# Flask stuff:
+instance/
+.webassets-cache
+# Scrapy stuff:
+.scrapy
+# Sphinx documentation
+docs/_build/
+# PyBuilder
+.pybuilder/
+target/
+# Jupyter Notebook
+.ipynb_checkpoints
+# IPython
+profile_default/
+ipython_config.py
+# pyenv
+#   For a library or package, you might want to ignore these files since the code is
+#   intended to run in multiple environments; otherwise, check them in:
+# .python-version
+# pipenv
+#   According to pypa/pipenv#598, it is recommended to include Pipfile.lock in version control.
+#   However, in case of collaboration, if having platform-specific dependencies or dependencies
+#   having no cross-platform support, pipenv may install dependencies that don't work, or not
+#   install all needed dependencies.
+#Pipfile.lock
+# poetry
+#   Similar to Pipfile.lock, it is generally recommended to include poetry.lock in version control.
+#   This is especially recommended for binary packages to ensure reproducibility, and is more
+#   commonly ignored for libraries.
+#   https://python-poetry.org/docs/basic-usage/#commit-your-poetrylock-file-to-version-control
+#poetry.lock
+# pdm
+#   Similar to Pipfile.lock, it is generally recommended to include pdm.lock in version control.
+#pdm.lock
+#   pdm stores project-wide configurations in .pdm.toml, but it is recommended to not include it
+#   in version control.
+#   https://pdm.fming.dev/#use-with-ide
+.pdm.toml
+# PEP 582; used by e.g. github.com/David-OConnor/pyflow and github.com/pdm-project/pdm
+__pypackages__/
+# Celery stuff
+celerybeat-schedule
+celerybeat.pid
+# SageMath parsed files
+*.sage.py
+# Environments
+.env
+.venv
+env/
+venv/
+ENV/
+env.bak/
+venv.bak/
+# Spyder project settings
+.spyderproject
+.spyproject
+# Rope project settings
+.ropeproject
+# mkdocs documentation
+/site
+# mypy
+.mypy_cache/
+.dmypy.json
+dmypy.json
+# Pyre type checker
+.pyre/
+# pytype static type analyzer
+.pytype/
+# Cython debug symbols
+cython_debug/
+# Hugging Face specific
+.cache/
+huggingface_hub/
+models/
+checkpoints/
+# Model and audio cache files
+*.wav
+*.mp3
+*.onnx
+*.bin
+*.pth
+*.ckpt
+# Temporary files
+tmp/
+temp/
+*.tmp
+# OS specific
+.DS_Store
+.DS_Store?
+._*
+.Spotlight-V100
+.Trashes
+ehthumbs.db
+Thumbs.db
+# IDE
+.vscode/
+.idea/
+*.swp
+*.swo
+*~
+# Log files
+*.log
+logs/
+# Gradio temporary files
+gradio_cached_examples/
+flagged/

DEPLOYMENT.md ADDED Viewed

	@@ -0,0 +1,151 @@

+# Hugging Face Space 部署指南
+## 📋 准备工作
+确保您已经有以下文件：
+- `app.py` - 主要的 Gradio 应用
+- `requirements.txt` - Python 依赖
+- `README.md` - 项目说明文档
+- `.gitignore` - Git 忽略文件
+- `README.md.space` - Hugging Face Space 配置（需要重命名为 README.md）
+## 🚀 部署步骤
+### 1. 创建 Hugging Face Space
+1. 登录 [Hugging Face](https://huggingface.co/)
+2. 点击您的头像 → "New Space"
+3. 填写 Space 信息：
+   - **Space name**: `GENIE` （或其他您喜欢的名称）
+   - **License**: MIT
+   - **SDK**: Gradio
+   - **Hardware**: CPU (免费) 或 GPU (付费)
+   - **Visibility**: Public
+### 2. 上传文件
+有两种方式上传文件：
+#### 方式 A: Git 上传（推荐）
+```bash
+# 1. 克隆您创建的 Space
+git clone https://huggingface.co/spaces/YOUR_USERNAME/GENIE
+cd GENIE
+# 2. 复制所有文件到此目录
+# 将 README.md.space 重命名为 README.md（会替换默认的）
+# 3. 提交并推送
+git add .
+git commit -m "Initial Genie TTS deployment"
+git push
+```
+#### 方式 B: Web 界面上传
+1. 在 Space 页面点击 "Files" 标签
+2. 点击 "Add file" → "Upload file"
+3. 上传所有必要文件
+4. 将 `README.md.space` 的内容复制到默认的 `README.md` 中
+### 3. 配置 Space
+确保 README.md 文件开头包含正确的 YAML 元数据：
+```yaml
+---
+title: 🔮 Genie TTS - AI语音合成
+emoji: 🎵
+colorFrom: purple
+colorTo: pink
+sdk: gradio
+sdk_version: 4.44.0
+app_file: app.py
+pinned: false
+license: mit
+short_description: 基于Genie的日语文本转语音系统
+tags:
+  - text-to-speech
+  - japanese
+  - gpt-sovits
+  - audio
+  - ai
+  - tts
+  - voice-synthesis
+---
+```
+### 4. 等待构建
+- Space 将自动开始构建
+- 首次构建可能需要 5-10 分钟
+- 您可以在 "Logs" 标签中查看构建进度
+### 5. 测试应用
+1. 构建完成后，访问您的 Space
+2. 测试基本功能：
+   - 选择角色 "misono_mika"
+   - 输入日语文本，例如："おはようございます"
+   - 点击 "开始合成"
+   - 等待音频生成
+## ⚠️ 注意事项
+### 性能考虑
+- **首次运行**: 需要下载约 200MB 的模型文件，可能需要 30-60 秒
+- **后续使用**: 模型会被缓存，合成速度较快（5-15 秒）
+- **并发限制**: 免费 CPU 硬件建议限制并发用户数
+### 内存管理
+- Genie TTS 需要约 500MB-1GB RAM
+- 如果遇到内存不足错误，考虑升级到付费硬件
+### 错误排查
+1. **依赖安装错误**
+   - 检查 `requirements.txt` 中的版本冲突
+   - 查看构建日志中的错误信息
+2. **模型下载失败**
+   - 通常是网络问题，等待几分钟后重试
+   - 检查 Hugging Face Hub 的连接状态
+3. **音频生成失败**
+   - 检查输入文本是否为日语
+   - 验证文本长度不超过 500 字符
+## 🔧 高级配置
+### 自定义硬件
+如果需要更好的性能，可以升级硬件：
+- **CPU Upgrade**: 更快的处理速度
+- **GPU T4**: 显著提升推理速度（需要付费）
+### 环境变量
+在 Space 设置中添加环境变量：
+```
+HF_HUB_ENABLE_PROGRESS_BAR=1
+TOKENIZERS_PARALLELISM=false
+```
+### 域名和访问
+- Space URL: `https://huggingface.co/spaces/YOUR_USERNAME/GENIE`
+- 可以申请自定义域名（Pro 功能）
+## 📊 监控和维护
+- 查看 Space 使用统计
+- 监控错误日志
+- 定期更新依赖包
+- 根据用户反馈优化功能
+## 🤝 社区分享
+部署成功后，您可以：
+- 在社交媒体分享您的 Space
+- 在相关社区发布
+- 收集用户反馈并持续改进
+---
+祝您部署成功！如有问题，可以参考 [Hugging Face Spaces 官方文档](https://huggingface.co/docs/hub/spaces)。

PROJECT_SUMMARY.md ADDED Viewed

	@@ -0,0 +1,118 @@

+# 🎯 项目完成总结
+## ✅ 已完成的工作
+我已经为您成功创建了一个完整的 Hugging Face Spaces 部署方案，将 Genie TTS 模型部署为 Web 应用。
+### 📁 文件结构
+```
+genie/
+├── app.py                 # 主要的 Gradio 应用文件
+├── requirements.txt       # Python 依赖包列表
+├── README.md             # 项目说明文档
+├── README.md.space       # Hugging Face Space 配置文件
+├── .gitignore           # Git 忽略文件配置
+├── DEPLOYMENT.md        # 详细部署指南
+└── PROJECT_SUMMARY.md   # 项目总结（本文件）
+```
+### 🔧 核心功能实现
+#### 1. Gradio Web 界面 (`app.py`)
+- **多标签页设计**: 语音合成、示例教程、项目信息
+- **智能角色管理**: 自动加载预训练角色模型
+- **进度显示**: 实时显示模型加载和合成进度
+- **错误处理**: 友好的错误提示和重试机制
+- **音频输出**: 支持在线播放和下载
+- **示例库**: 内置多个日语示例文本
+#### 2. 依赖管理 (`requirements.txt`)
+- **核心包**: genie-tts, gradio, torch
+- **音频处理**: librosa, soundfile, scipy
+- **模型推理**: onnxruntime
+- **Hugging Face 集成**: huggingface-hub, transformers
+- **系统监控**: psutil, rich
+#### 3. 模型管理系统
+- **自动下载**: 首次使用自动从 Hugging Face 下载模型
+- **缓存机制**: 智能缓存管理，避免重复下载
+- **内存优化**: LRU 缓存和资源清理
+- **错误恢复**: 网络错误重试和失败处理
+#### 4. 文本处理优化
+- **预处理**: 自动文本清理和标点符号规范化
+- **长度限制**: 防止过长文本导致的问题
+- **编码处理**: 正确处理日语字符编码
+- **分句支持**: 自动分句处理长文本
+### 🌟 应用特色
+1. **用户友好界面**
+   - 现代化设计，响应式布局
+   - 多标签页组织，信息层次清晰
+   - 实时进度反馈和状态显示
+   - 丰富的示例和使用指南
+2. **性能优化**
+   - CPU 优化推理，无需 GPU
+   - 智能缓存管理
+   - 内存使用监控
+   - 异常处理和资源清理
+3. **部署友好**
+   - 完整的依赖声明
+   - 环境变量配置
+   - 详细的部署指南
+   - Git 版本控制支持
+## 🚀 部署步骤
+### 快速部署
+1. 访问 [Hugging Face Spaces](https://huggingface.co/spaces)
+2. 创建新的 Space，选择 Gradio SDK
+3. 上传所有文件（将 `README.md.space` 重命名为 `README.md`）
+4. 等待自动构建完成
+### 详细步骤
+请参考 `DEPLOYMENT.md` 文件中的详细指南。
+## 📊 预期性能
+- **首次启动**: 30-60秒（下载模型）
+- **后续合成**: 5-15秒每段文本
+- **内存需求**: ~500MB RAM
+- **存储需求**: ~200MB（模型文件）
+## 🎯 支持功能
+- ✅ 日语文本转语音
+- ✅ 预训练角色 (misono_mika)
+- ✅ 实时音频播放
+- ✅ 音频文件下载
+- ✅ 示例文本库
+- ✅ 错误处理和重试
+- ✅ 响应式 Web 界面
+## 🔮 未来扩展
+可以考虑的功能扩展：
+- 添加更多预训练角色
+- 支持中文和英文TTS
+- 批量文本处理
+- 语音风格调节
+- API 接口支持
+## ⚠️ 注意事项
+1. **首次使用**: 需要下载模型文件，请确保网络连接稳定
+2. **文本限制**: 目前主要支持日语，建议文本长度控制在500字符以内
+3. **并发限制**: 免费版 Hugging Face Spaces 有并发限制
+4. **模型版本**: 基于 GPT-SoVITS V2，支持高质量语音合成
+## 🎉 部署成功！
+您现在可以按照 `DEPLOYMENT.md` 中的指南将此应用部署到 Hugging Face Spaces 上。部署成功后，用户可以通过 Web 界面轻松使用 Genie TTS 进行日语语音合成。
+---
+**祝您部署顺利！如有任何问题，请参考相关文档或联系开发者。** 🚀

README.md.space ADDED Viewed

	@@ -0,0 +1,18 @@

+title: 🔮 Genie TTS - AI语音合成
+emoji: 🎵
+colorFrom: purple
+colorTo: pink
+sdk: gradio
+sdk_version: 4.44.0
+app_file: app.py
+pinned: false
+license: mit
+short_description: 基于Genie的日语文本转语音系统
+tags:
+  - text-to-speech
+  - japanese
+  - gpt-sovits
+  - audio
+  - ai
+  - tts
+  - voice-synthesis

app.py ADDED Viewed

	@@ -0,0 +1,539 @@

+import gradio as gr
+import os
+import tempfile
+import logging
+import warnings
+import subprocess
+import sys
+from pathlib import Path
+# 设置日志
+logging.basicConfig(level=logging.INFO)
+logger = logging.getLogger(__name__)
+# 禁用一些警告
+warnings.filterwarnings("ignore", category=FutureWarning)
+warnings.filterwarnings("ignore", category=UserWarning)
+def install_genie_tts():
+    """安装genie-tts包"""
+    try:
+        import genie_tts
+        logger.info("genie-tts已安装")
+        return True
+    except ImportError:
+        logger.info("正在安装genie-tts...")
+        try:
+            subprocess.check_call([sys.executable, "-m", "pip", "install", "genie-tts"])
+            import genie_tts
+            logger.info("genie-tts安装成功")
+            return True
+        except Exception as e:
+            logger.error(f"安装genie-tts失败: {e}")
+            return False
+# 安装Genie TTS
+install_success = install_genie_tts()
+if install_success:
+    try:
+        import genie_tts as genie
+        logger.info("Genie TTS导入成功")
+    except ImportError as e:
+        logger.error(f"导入Genie TTS失败: {e}")
+        genie = None
+else:
+    genie = None
+class GenieTTSInterface:
+    def __init__(self):
+        self.available_characters = ['misono_mika']  # 预定义角色
+        self.current_character = None
+        self.model_cache_dir = self.setup_cache_directory()
+        self.is_initialized = False
+    def setup_cache_directory(self):
+        """设置模型缓存目录"""
+        cache_dir = os.path.join(tempfile.gettempdir(), "genie_tts_cache")
+        os.makedirs(cache_dir, exist_ok=True)
+        return cache_dir
+    def check_model_availability(self, character_name):
+        """检查模型是否已缓存"""
+        model_files = [
+            'prompt.wav', 'prompt_wav.json',
+            't2s_encoder_fp32.onnx', 't2s_first_stage_decoder_fp32.onnx',
+            't2s_stage_decoder_fp32.onnx', 'vits_fp32.onnx'
+        ]
+        character_cache_dir = os.path.join(self.model_cache_dir, character_name)
+        if not os.path.exists(character_cache_dir):
+            return False
+        for file_name in model_files:
+            if not os.path.exists(os.path.join(character_cache_dir, file_name)):
+                return False
+        return True
+    def initialize_genie(self):
+        """初始化Genie TTS环境"""
+        if self.is_initialized:
+            return True
+        try:
+            # 设置环境变量以优化下载
+            os.environ["HF_HUB_ENABLE_PROGRESS_BAR"] = "1"
+            os.environ["TOKENIZERS_PARALLELISM"] = "false"  # 避免警告
+            # 设置缓存目录
+            if hasattr(genie, '_internal'):
+                logger.info("Genie TTS环境初始化成功")
+            self.is_initialized = True
+            return True
+        except Exception as e:
+            logger.error(f"初始化Genie TTS失败: {e}")
+            return False
+    def load_character(self, character_name):
+        """加载角色模型"""
+        if not genie:
+            return None, "Genie TTS未正确安装"
+        if not self.initialize_genie():
+            return None, "Genie TTS初始化失败"
+        try:
+            logger.info(f"正在加载角色: {character_name}")
+            # 检查模型是否已缓存
+            if self.check_model_availability(character_name):
+                logger.info(f"使用缓存的模型: {character_name}")
+            else:
+                logger.info(f"首次下载模型: {character_name}，请稍候...")
+            # 加载预定义角色（这会自动处理下载）
+            genie.load_predefined_character(character_name)
+            self.current_character = character_name
+            return f"角色 {character_name} 加载成功！", ""
+        except Exception as e:
+            error_msg = str(e)
+            logger.error(f"加载角色失败: {error_msg}")
+            # 提供更友好的错误信息
+            if "network" in error_msg.lower() or "connection" in error_msg.lower():
+                return None, "网络连接错误，请检查网络连接后重试"
+            elif "disk space" in error_msg.lower():
+                return None, "磁盘空间不足，请清理空间后重试"
+            elif "timeout" in error_msg.lower():
+                return None, "下载超时，请重试"
+            else:
+                return None, f"加载角色失败: {error_msg}"
+    def estimate_download_size(self, character_name):
+        """估算下载大小"""
+        # 基于Genie模型的实际大小
+        model_sizes = {
+            'misono_mika': 180  # MB
+        }
+        return model_sizes.get(character_name, 200)
+    def cleanup_cache(self):
+        """清理缓存"""
+        try:
+            import shutil
+            if os.path.exists(self.model_cache_dir):
+                shutil.rmtree(self.model_cache_dir)
+                self.setup_cache_directory()
+                logger.info("缓存清理完成")
+                return True
+        except Exception as e:
+            logger.error(f"清理缓存失败: {e}")
+            return False
+    def synthesize_speech(self, text, character_name, play_audio=False):
+        """文本转语音 - 增强版"""
+        if not genie:
+            return None, "Genie TTS未正确安装"
+        if not text.strip():
+            return None, "请输入要合成的文本"
+        # 文本长度检查
+        if len(text) > 500:
+            return None, "文本过长（超过500字符），请缩短文本长度"
+        if character_name != self.current_character:
+            status, error = self.load_character(character_name)
+            if error:
+                return None, error
+        try:
+            # 文本预处理
+            processed_text = self.preprocess_text(text)
+            # 创建临时文件保存音频
+            with tempfile.NamedTemporaryFile(suffix=".wav", delete=False) as tmp_file:
+                output_path = tmp_file.name
+            logger.info(f"正在合成语音: {processed_text[:50]}...")
+            # 设置内存限制环境变量
+            original_env = os.environ.get('PYTORCH_JIT_USE_NNC_NOT_NVFUSER', None)
+            os.environ['PYTORCH_JIT_USE_NNC_NOT_NVFUSER'] = '1'
+            try:
+                # 执行TTS
+                genie.tts(
+                    character_name=character_name,
+                    text=processed_text,
+                    play=False,  # 在服务器环境不播放
+                    split_sentence=True,
+                    save_path=output_path
+                )
+            finally:
+                # 恢复环境变量
+                if original_env is None and 'PYTORCH_JIT_USE_NNC_NOT_NVFUSER' in os.environ:
+                    del os.environ['PYTORCH_JIT_USE_NNC_NOT_NVFUSER']
+                elif original_env is not None:
+                    os.environ['PYTORCH_JIT_USE_NNC_NOT_NVFUSER'] = original_env
+            # 验证输出文件
+            if not os.path.exists(output_path):
+                return None, "语音合成失败：输出文件未生成"
+            file_size = os.path.getsize(output_path)
+            if file_size == 0:
+                return None, "语音合成失败：输出文件为空"
+            elif file_size < 1000:  # 小于1KB可能是错误
+                return None, "语音合成失败：输出文件异常小"
+            logger.info(f"语音合成成功，文件大小: {file_size/1024:.1f}KB")
+            return output_path, ""
+        except Exception as e:
+            error_msg = str(e)
+            logger.error(f"语音合成失败: {error_msg}")
+            # 提供更详细的错误信息
+            if "out of memory" in error_msg.lower() or "memory" in error_msg.lower():
+                return None, "内存不足，请尝试缩短文本或重启应用"
+            elif "cuda" in error_msg.lower():
+                return None, "GPU相关错误，正在使用CPU模式重试"
+            elif "model" in error_msg.lower():
+                return None, "模型加载错误，请重新选择角色"
+            elif "timeout" in error_msg.lower():
+                return None, "处理超时，请尝试缩短文本"
+            else:
+                return None, f"语音合成失败: {error_msg}"
+    def preprocess_text(self, text):
+        """文本预处理"""
+        # 基本清理
+        text = text.strip()
+        # 替换常见的问题字符
+        replacements = {
+            '"': '"',
+            '"': '"',
+            ''': "'",
+            ''': "'",
+            '—': '一',
+            '–': '-',
+        }
+        for old, new in replacements.items():
+            text = text.replace(old, new)
+        # 确保句子有适当的标点
+        if text and not text.endswith(('。', '！', '？', '.', '!', '?')):
+            text += '。'
+        return text
+    def get_system_info(self):
+        """获取系统信息用于调试"""
+        try:
+            import psutil
+            memory = psutil.virtual_memory()
+            disk = psutil.disk_usage('/')
+            return {
+                'memory_total': f"{memory.total / (1024**3):.1f}GB",
+                'memory_available': f"{memory.available / (1024**3):.1f}GB",
+                'memory_percent': f"{memory.percent}%",
+                'disk_free': f"{disk.free / (1024**3):.1f}GB"
+            }
+        except:
+            return {"status": "无法获取系统信息"}
+# 创建接口实例
+tts_interface = GenieTTSInterface()
+def create_interface():
+    """创建Gradio界面"""
+    def tts_wrapper(text, character, progress=gr.Progress()):
+        """TTS包装函数"""
+        if not text.strip():
+            return None, "❌ 请输入要合成的文本"
+        progress(0.1, desc="准备模型...")
+        # 加载字符模型
+        if character != tts_interface.current_character:
+            progress(0.3, desc=f"加载角色模型: {character}")
+            status, error = tts_interface.load_character(character)
+            if error:
+                return None, f"❌ {error}"
+        progress(0.5, desc="正在合成语音...")
+        audio_path, error = tts_interface.synthesize_speech(text, character)
+        progress(0.9, desc="完成处理...")
+        if error:
+            return None, f"❌ {error}"
+        progress(1.0, desc="✅ 合成成功！")
+        return audio_path, f"✅ 合成成功！音频长度: {get_audio_duration(audio_path):.1f}秒"
+    def get_audio_duration(audio_path):
+        """获取音频时长"""
+        try:
+            import librosa
+            y, sr = librosa.load(audio_path, sr=None)
+            return len(y) / sr
+        except:
+            return 0
+    def clear_all():
+        """清空所有输入和输出"""
+        return "", None, "🔄 已清空所有内容"
+    def load_example(text, character):
+        """加载示例"""
+        return text, character, f"📝 已加载示例: {text[:20]}..."
+    # 定义界面
+    with gr.Blocks(
+        title="🔮 Genie TTS - 语音合成",
+        theme=gr.themes.Soft(),
+        css="""
+        .gradio-container {
+            max-width: 1200px !important;
+        }
+        .status-success {
+            color: #28a745 !important;
+        }
+        .status-error {
+            color: #dc3545 !important;
+        }
+        """
+    ) as demo:
+        gr.Markdown("""
+        # 🔮 Genie TTS - AI 语音合成系统
+        基于 [High-Logic/Genie](https://github.com/High-Logic/Genie) 的轻量级 TTS 推理引擎，支持高质量日语语音合成。
+        <div style="background: linear-gradient(90deg, #667eea 0%, #764ba2 100%); padding: 1rem; border-radius: 10px; color: white; margin: 1rem 0;">
+        <strong>🌟 功能特点</strong><br>
+        ✅ CPU 优化推理，无需 GPU<br>
+        ✅ 基于 GPT-SoVITS V2 技术<br>
+        ✅ 支持长文本自动分句<br>
+        ✅ 实时音频流输出
+        </div>
+        **📖 使用说明:** 选择角色模型 → 输入日语文本 → 点击合成按钮 → 获得高质量语音
+        """)
+        with gr.Tab("🎵 语音合成") as tts_tab:
+            with gr.Row():
+                with gr.Column(scale=1):
+                    # 角色选择
+                    with gr.Group():
+                        gr.Markdown("### 👤 角色设置")
+                        character_dropdown = gr.Dropdown(
+                            choices=tts_interface.available_characters,
+                            value="misono_mika",
+                            label="🎭 选择角色",
+                            info="当前可用的预训练角色模型",
+                            interactive=True
+                        )
+                    # 文本输入
+                    with gr.Group():
+                        gr.Markdown("### 📝 文本输入")
+                        text_input = gr.Textbox(
+                            lines=5,
+                            label="📄 输入文本",
+                            placeholder="请输入要合成的日语文本...\n例如：どうしようかな……やっぱりやりたいかも……！",
+                            info="💡 支持日语文本，建议输入完整的句子以获得更好的效果",
+                            show_copy_button=True
+                        )
+                        # 控制按钮
+                        with gr.Row():
+                            submit_btn = gr.Button(
+                                "🎵 开始合成",
+                                variant="primary",
+                                size="lg",
+                                scale=2
+                            )
+                            clear_btn = gr.Button(
+                                "🔄 清空",
+                                variant="secondary",
+                                scale=1
+                            )
+                with gr.Column(scale=1):
+                    # 音频输出
+                    with gr.Group():
+                        gr.Markdown("### 🔊 音频输出")
+                        audio_output = gr.Audio(
+                            label="🎶 生成的音频",
+                            type="filepath",
+                            interactive=False,
+                            show_download_button=True
+                        )
+                        # 状态显示
+                        status_output = gr.Textbox(
+                            label="📊 合成状态",
+                            interactive=False,
+                            show_copy_button=False
+                        )
+        # 示例和教程标签页
+        with gr.Tab("📚 示例与教程") as examples_tab:
+            gr.Markdown("### 🎯 快速示例")
+            gr.Markdown("点击下面的示例可以快速体验不同类型的文本合成效果：")
+            # 示例网格
+            with gr.Row():
+                with gr.Column():
+                    gr.Markdown("**🌅 问候语**")
+                    gr.Examples(
+                        examples=[
+                            ["おはようございます！", "misono_mika"],
+                            ["こんにちは、元気ですか？", "misono_mika"],
+                            ["お疲れさまでした", "misono_mika"]
+                        ],
+                        inputs=[text_input, character_dropdown],
+                        outputs=[text_input, character_dropdown, status_output],
+                        fn=load_example,
+                        run_on_click=True
+                    )
+                with gr.Column():
+                    gr.Markdown("**💭 情感表达**")
+                    gr.Examples(
+                        examples=[
+                            ["どうしようかな……やっぱりやりたいかも……！", "misono_mika"],
+                            ["うーん、これは難しいですね", "misono_mika"],
+                            ["わあ、すごいですね！", "misono_mika"]
+                        ],
+                        inputs=[text_input, character_dropdown],
+                        outputs=[text_input, character_dropdown, status_output],
+                        fn=load_example,
+                        run_on_click=True
+                    )
+                with gr.Column():
+                    gr.Markdown("**🎭 日常对话**")
+                    gr.Examples(
+                        examples=[
+                            ["ありがとうございます", "misono_mika"],
+                            ["さようなら、また明日", "misono_mika"],
+                            ["お先に失礼します", "misono_mika"]
+                        ],
+                        inputs=[text_input, character_dropdown],
+                        outputs=[text_input, character_dropdown, status_output],
+                        fn=load_example,
+                        run_on_click=True
+                    )
+            gr.Markdown("""
+            ### 📋 使用技巧
+            1. **文本长度**: 建议单次输入文本长度在 100 字以内，过长的文本会自动分句处理
+            2. **标点符号**: 适当使用标点符号（。！？）可以改善语音的自然度
+            3. **特殊符号**: 支持省略号（……）和感叹号（！）等情感表达
+            4. **处理时间**: 首次加载角色需要下载模型（约30秒），后续合成较快（5-10秒）
+            ### 🔧 技术说明
+            - **模型架构**: 基于 Transformer 的端到端语音合成
+            - **采样率**: 32kHz，支持高质量音频输出
+            - **推理方式**: CPU 优化的 ONNX 模型，适合云端部署
+            - **内存占用**: 约 500MB RAM，支持并发处理
+            """)
+        # 关于标签页
+        with gr.Tab("ℹ️ 关于项目") as about_tab:
+            gr.Markdown("""
+            ### 🔍 项目信息
+            **Genie TTS** 是基于 GPT-SoVITS V2 架构的轻量级语音合成引擎，专门为 CPU 推理优化。
+            #### 📊 技术规格
+            | 项目 | 规格 |
+            |------|------|
+            | **基础模型** | GPT-SoVITS V2 |
+            | **推理框架** | ONNX Runtime |
+            | **支持语言** | 日语 (Japanese) |
+            | **音频格式** | WAV, 32kHz |
+            | **推理设备** | CPU (无需 GPU) |
+            | **模型大小** | ~200MB |
+            | **内存需求** | ~500MB RAM |
+            #### � 相关链接
+            - 🏠 [项目主页](https://github.com/High-Logic/Genie)
+            - 🤗 [Hugging Face 模型](https://huggingface.co/High-Logic/Genie)
+            - 📖 [GPT-SoVITS 官方](https://github.com/RVC-Boss/GPT-SoVITS)
+            - 💬 [问题反馈](https://github.com/High-Logic/Genie/issues)
+            #### 🙏 致谢
+            感谢以下项目和开发者：
+            - [High-Logic](https://github.com/High-Logic) 团队开发的 Genie TTS
+            - [RVC-Boss](https://github.com/RVC-Boss) 团队的 GPT-SoVITS 项目
+            - Hugging Face 提供的模型托管和 Spaces 平台
+            #### ⚖️ 免责声明
+            本应用仅用于演示和研究目的。请合理使用，生成的语音内容责任由使用者承担。
+            """)
+        # 绑定事件
+        submit_btn.click(
+            fn=tts_wrapper,
+            inputs=[text_input, character_dropdown],
+            outputs=[audio_output, status_output],
+            show_progress="full",
+            queue=True
+        )
+        clear_btn.click(
+            fn=clear_all,
+            outputs=[text_input, audio_output, status_output]
+        )
+    return demo
+# 启动应用
+if __name__ == "__main__":
+    demo = create_interface()
+    demo.launch(
+        server_name="0.0.0.0",
+        server_port=7860,
+        show_api=False,
+        show_error=True,
+        quiet=False
+    )

guidance.md ADDED Viewed

	@@ -0,0 +1,67 @@

+Spaces
+Hugging Face's logo
+Tom1986
+/
+GENIE
+like
+0
+Logs
+App
+Files
+Community
+Settings
+🚀 Get started with your gradio Space!
+Your new space has been created, follow these steps to get started (or read the full documentation)
+Start by cloning this repo by using:
+Use an access token as git password/credential
+# When prompted for a password, use an access token with write permissions.
+# Generate one from your settings: https://huggingface.co/settings/tokens
+git clone https://huggingface.co/spaces/Tom1986/GENIE
+# Make sure hf CLI is installed: pip install -U "huggingface_hub[cli]"
+hf download Tom1986/GENIE --repo-type=space
+Create your gradio app.py file:
+import gradio as gr
+def greet(name):
+    return "Hello " + name + "!!"
+demo = gr.Interface(fn=greet, inputs="text", outputs="text")
+demo.launch()
+Then commit and push:
+git add app.py
+git commit -m "Add application file"
+git push
+Hint Alternatively, you can create the app.py file directly in your browser.
+Finally, your Space should be running on this page after a few moments!
+Dependencies
+You can add a requirements.txt file at the root of the repository to specify Python dependencies
+If needed, you can also add a packages.txt file at the root of the repository to specify Debian dependencies.
+The gradio package is pre-installed and its version is set in the sdk_version field in the README.md file.
+Personalize your Space
+Make your Space stand out by customizing its emoji, colors, and description by editing metadata in its README.md file.
+Documentation
+Read the full documentation for gradio Spaces here.

requirements.txt ADDED Viewed

	@@ -0,0 +1,40 @@

+# Hugging Face Spaces requirements for Genie TTS
+# Core dependencies
+gradio>=4.0.0
+torch>=2.0.0
+torchaudio>=2.0.0
+# Genie TTS package
+genie-tts
+# Audio processing
+librosa>=0.10.0
+soundfile>=0.12.0
+scipy>=1.9.0
+# ONNX Runtime for model inference
+onnxruntime>=1.16.0
+# Additional dependencies
+numpy>=1.21.0
+pandas>=1.5.0
+Pillow>=9.0.0
+# Hugging Face integrations
+huggingface-hub>=0.17.0
+transformers>=4.25.0
+# Japanese text processing (for pyopenjtalk if needed)
+# Note: pyopenjtalk might need system dependencies on some platforms
+# pyopenjtalk  # Uncomment if needed
+# System utilities
+psutil>=5.8.0
+requests>=2.25.0
+# Logging and monitoring
+rich>=12.0.0
+# File handling
+pathlib2>=2.3.0