Spaces:

jcudit
/

voice-tools

Paused

jcudit HF Staff commited on Dec 28, 2025

Commit

03cad88

1 Parent(s): 09108e1

refactor: rename Voice Profiler to Voice Tools throughout codebase

Update all references from 'Voice Profiler' to 'Voice Tools':
- CLI command headers and descriptions
- Web interface title and description
- Benchmark script output
- Package docstrings
- remove .space and .spacesrc (not needed for HF Spaces deployment)
- update benchmark.py docstring to Voice Tools

This aligns with the project rename completed in previous commits.

Files changed (13) hide show

.space/README.md +0 -45
.spacesrc +0 -2
README.md +6 -6
app.py +2 -2
pyproject.toml +1 -1
scripts/benchmark.py +3 -3
src/cli/__init__.py +1 -1
src/cli/denoise.py +1 -1
src/cli/extract_speaker.py +1 -1
src/cli/main.py +7 -7
src/cli/separate.py +1 -1
src/web/__init__.py +1 -1
src/web/app.py +5 -5

.space/README.md DELETED Viewed

@@ -1,45 +0,0 @@
----
-title: Voice Profiler
-emoji: 🎤
-colorFrom: blue
-colorTo: purple
-sdk: gradio
-sdk_version: 5.49.1
-app_file: app.py
-pinned: false
-license: mit
-hardware: zero-gpu
----
-# Voice Profiler
-AI-powered voice separation, extraction, and denoising tool.
-## Features
-- **Speaker Separation**: Automatically separate multiple speakers from mixed audio
-- **Speaker Extraction**: Extract a specific speaker using a reference clip
-- **Voice Denoising**: Remove background noise and silence from audio
-## Technology
-Powered by:
-- PyAnnote Audio for speaker diarization and embeddings
-- Silero VAD for voice activity detection
-- HuggingFace ZeroGPU for fast GPU-accelerated processing
-## Usage
-1. Select a workflow from the tabs
-2. Upload your audio file
-3. Configure settings (optional)
-4. Click "Process" and wait for results
-## Requirements
-- Audio files in M4A, WAV, or MP3 format
-- For speaker extraction, provide a clean reference clip (minimum 3 seconds)
-## License
-MIT License - See LICENSE file for details

.spacesrc DELETED Viewed

	@@ -1,2 +0,0 @@
1	- #!/bin/bash
2	- pip install -e .

README.md CHANGED Viewed

@@ -1,5 +1,5 @@
 ---
-title: Voice Profiler
 emoji: 🎤
 colorFrom: blue
 colorTo: purple
@@ -11,11 +11,11 @@ license: mit
 hardware: zero-gpu
 ---
-# Voice Profiler
 **Extract target voice from mixed audio files for video generation**
-Voice Profiler is a tool that extracts a specific person's voice (speech and nonverbal sounds) from audio files containing background noise, music, and other speakers. It uses open-source AI models running locally on CPU to identify and isolate your target voice.
 ## Features
@@ -72,7 +72,7 @@ sudo apt-get install ffmpeg
 **Windows**:
 Download from [ffmpeg.org](https://ffmpeg.org/download.html)
-### 2. Install Voice Profiler
 ```bash
 # Clone the repository
@@ -107,7 +107,7 @@ huggingface-cli login
 ### Web Interface (Recommended for Beginners)
-The easiest way to use Voice Profiler is through the web interface:
 ```bash
 voice-tools web
@@ -235,7 +235,7 @@ voice-tools scan input.m4a
 ## HuggingFace Spaces Deployment
-Voice Profiler supports deployment to HuggingFace Spaces with GPU acceleration using ZeroGPU. This provides 10-20x faster processing compared to CPU-only execution.
 ### Prerequisites

 ---
+title: Voice Tools
 emoji: 🎤
 colorFrom: blue
 colorTo: purple
 hardware: zero-gpu
 ---
+# Voice Tools
 **Extract target voice from mixed audio files for video generation**
+Voice Tools is a tool that extracts a specific person's voice (speech and nonverbal sounds) from audio files containing background noise, music, and other speakers. It uses open-source AI models running locally on CPU to identify and isolate your target voice.
 ## Features
 **Windows**:
 Download from [ffmpeg.org](https://ffmpeg.org/download.html)
+### 2. Install Voice Tools
 ```bash
 # Clone the repository
 ### Web Interface (Recommended for Beginners)
+The easiest way to use Voice Tools is through the web interface:
 ```bash
 voice-tools web
 ## HuggingFace Spaces Deployment
+Voice Tools supports deployment to HuggingFace Spaces with GPU acceleration using ZeroGPU. This provides 10-20x faster processing compared to CPU-only execution.
 ### Prerequisites

app.py CHANGED Viewed

@@ -1,6 +1,6 @@
 #!/usr/bin/env python3
 """
-HuggingFace Spaces entry point for Voice Profiler.
 This file serves as the main entry point when deploying to HuggingFace Spaces
 with ZeroGPU support.
@@ -36,7 +36,7 @@ logger = logging.getLogger(__name__)
 # Log environment information
 from src.config.gpu_config import GPUConfig
-logger.info("Voice Profiler starting on HuggingFace Spaces")
 logger.info(f"Environment: {GPUConfig.get_environment_type()}")
 logger.info(f"GPU Available: {GPUConfig.GPU_AVAILABLE}")
 logger.info(f"ZeroGPU Mode: {GPUConfig.IS_ZEROGPU}")

 #!/usr/bin/env python3
 """
+HuggingFace Spaces entry point for Voice Tools.
 This file serves as the main entry point when deploying to HuggingFace Spaces
 with ZeroGPU support.
 # Log environment information
 from src.config.gpu_config import GPUConfig
+logger.info("Voice Tools starting on HuggingFace Spaces")
 logger.info(f"Environment: {GPUConfig.get_environment_type()}")
 logger.info(f"GPU Available: {GPUConfig.GPU_AVAILABLE}")
 logger.info(f"ZeroGPU Mode: {GPUConfig.IS_ZEROGPU}")

pyproject.toml CHANGED Viewed

@@ -6,7 +6,7 @@ readme = "README.md"
 requires-python = ">=3.10"
 license = {text = "MIT"}
 authors = [
-    {name = "Voice Profiler Contributors"}
 ]
 keywords = ["audio", "voice-extraction", "speaker-diarization", "ml", "huggingface"]
 classifiers = [

 requires-python = ">=3.10"
 license = {text = "MIT"}
 authors = [
+    {name = "Voice Tools Contributors"}
 ]
 keywords = ["audio", "voice-extraction", "speaker-diarization", "ml", "huggingface"]
 classifiers = [

scripts/benchmark.py CHANGED Viewed

@@ -1,6 +1,6 @@
 #!/usr/bin/env python3
 """
-Performance benchmarking script for Voice Profiler.
 Validates all success criteria (SC-001 through SC-008) from the specification.
 """
@@ -324,7 +324,7 @@ def benchmark_quality_preservation(results: BenchmarkResults):
 def main():
-    parser = argparse.ArgumentParser(description="Benchmark Voice Profiler performance")
     parser.add_argument(
         "--audio-dir",
         type=Path,
@@ -343,7 +343,7 @@ def main():
     results = BenchmarkResults()
     print("\n" + "=" * 80)
-    print("VOICE PROFILER PERFORMANCE BENCHMARK")
     print("=" * 80 + "\n")
     # Find test files

 #!/usr/bin/env python3
 """
+Performance benchmarking script for Voice Tools.
 Validates all success criteria (SC-001 through SC-008) from the specification.
 """
 def main():
+    parser = argparse.ArgumentParser(description="Benchmark Voice Tools performance")
     parser.add_argument(
         "--audio-dir",
         type=Path,
     results = BenchmarkResults()
     print("\n" + "=" * 80)
+    print("VOICE TOOLS PERFORMANCE BENCHMARK")
     print("=" * 80 + "\n")
     # Find test files

src/cli/__init__.py CHANGED Viewed

	@@ -1 +1 @@
1	- """CLI package for Voice ~~Profiler~~."""


1	+ """CLI package for Voice Tools."""

src/cli/denoise.py CHANGED Viewed

@@ -102,7 +102,7 @@ def denoise(
         # Keep more audio (less aggressive)
         voice-tools denoise noisy_audio.m4a --vad-threshold 0.3 --silence-threshold 3.0
     """
-    console.print("\n[bold cyan]Voice Profiler - Voice Denoising[/bold cyan]\n")
     # Validate input file
     if not input_file.exists():

         # Keep more audio (less aggressive)
         voice-tools denoise noisy_audio.m4a --vad-threshold 0.3 --silence-threshold 3.0
     """
+    console.print("\n[bold cyan]Voice Tools - Voice Denoising[/bold cyan]\n")
     # Validate input file
     if not input_file.exists():

src/cli/extract_speaker.py CHANGED Viewed

@@ -130,7 +130,7 @@ def extract_speaker(
           --no-concatenate --output ./alice_segments/
     """
     console.print()
-    console.print("[bold]Voice Profiler - Speaker Extraction[/bold]")
     console.print()
     try:

           --no-concatenate --output ./alice_segments/
     """
     console.print()
+    console.print("[bold]Voice Tools - Speaker Extraction[/bold]")
     console.print()
     try:

src/cli/main.py CHANGED Viewed

@@ -1,5 +1,5 @@
 """
-Main CLI entry point for Voice Profiler.
 Provides command-line interface for voice extraction and profiling tasks.
 """
@@ -41,7 +41,7 @@ logger = logging.getLogger(__name__)
 @click.version_option(version="0.1.0", prog_name="voice-tools")
 def cli():
     """
-    Voice Profiler - Extract and profile voices from audio files.
     This tool helps you extract specific voices from audio files using
     speaker diarization and voice matching. It can separate speech from
@@ -161,7 +161,7 @@ def extract(
     if verbose:
         logging.getLogger().setLevel(logging.DEBUG)
-    display_header("Voice Profiler - Extract Voice Segments")
     # Validate reference file
     if not reference_file.exists():
@@ -292,7 +292,7 @@ def scan(audio_file: Path, vad_threshold: float):
     \b
     voice-tools scan input.m4a
     """
-    display_header("Voice Profiler - Voice Activity Scan")
     processor = BatchProcessor(vad_threshold=vad_threshold)
@@ -361,7 +361,7 @@ def web(host: str, port: int, share: bool):
     """
     from ..web.app import launch
-    display_header("Voice Profiler - Web Interface")
     display_info(f"Starting web server on http://{host}:{port}")
     if share:
@@ -381,7 +381,7 @@ def web(host: str, port: int, share: bool):
 @cli.command()
 def info():
     """
-    Display information about Voice Profiler.
     Shows configuration, model information, and system details.
     """
@@ -391,7 +391,7 @@ def info():
     from ..services.model_manager import ModelManager
     from .progress import console
-    display_header("Voice Profiler - System Information")
     # Version info
     info_table = Table(title="Version", show_header=False)

 """
+Main CLI entry point for Voice Tools.
 Provides command-line interface for voice extraction and profiling tasks.
 """
 @click.version_option(version="0.1.0", prog_name="voice-tools")
 def cli():
     """
+    Voice Tools - Extract and profile voices from audio files.
     This tool helps you extract specific voices from audio files using
     speaker diarization and voice matching. It can separate speech from
     if verbose:
         logging.getLogger().setLevel(logging.DEBUG)
+    display_header("Voice Tools - Extract Voice Segments")
     # Validate reference file
     if not reference_file.exists():
     \b
     voice-tools scan input.m4a
     """
+    display_header("Voice Tools - Voice Activity Scan")
     processor = BatchProcessor(vad_threshold=vad_threshold)
     """
     from ..web.app import launch
+    display_header("Voice Tools - Web Interface")
     display_info(f"Starting web server on http://{host}:{port}")
     if share:
 @cli.command()
 def info():
     """
+    Display information about Voice Tools.
     Shows configuration, model information, and system details.
     """
     from ..services.model_manager import ModelManager
     from .progress import console
+    display_header("Voice Tools - System Information")
     # Version info
     info_table = Table(title="Version", show_header=False)

src/cli/separate.py CHANGED Viewed

@@ -126,7 +126,7 @@ def separate(
         # Display header
         if not quiet:
-            console.print("\n[bold cyan]Voice Profiler - Speaker Separation[/bold cyan]\n")
         # Create output directory
         output_dir.mkdir(parents=True, exist_ok=True)

         # Display header
         if not quiet:
+            console.print("\n[bold cyan]Voice Tools - Speaker Separation[/bold cyan]\n")
         # Create output directory
         output_dir.mkdir(parents=True, exist_ok=True)

src/web/__init__.py CHANGED Viewed

	@@ -1 +1 @@
1	- """Web interface package for Voice ~~Profiler~~."""


1	+ """Web interface package for Voice Tools."""

src/web/app.py CHANGED Viewed

@@ -1,5 +1,5 @@
 """
-Gradio web interface for Voice Profiler.
 Provides a user-friendly web UI for uploading audio files, configuring
 extraction parameters, and downloading results.
@@ -49,11 +49,11 @@ def create_app() -> gr.Blocks:
         Configured Gradio Blocks app
     """
-    with gr.Blocks(title="Voice Profiler") as app:
         # Header
         gr.Markdown(
             """
-            # 🎤 Voice Profiler
             Extract and profile specific voices from audio files using AI-powered
             speaker diarization and voice matching.
@@ -238,7 +238,7 @@ def create_app() -> gr.Blocks:
             """
             ---
             <div class="footer">
-            Voice Profiler v0.1.0 | Powered by Gradio, PyAnnote, and Transformers
             </div>
             """,
             elem_classes=["footer"],
@@ -266,7 +266,7 @@ def launch(
     app = create_app()
-    logger.info(f"Launching Voice Profiler web interface on {server_name}:{server_port}")
     app.launch(
         server_name=server_name,

 """
+Gradio web interface for Voice Tools.
 Provides a user-friendly web UI for uploading audio files, configuring
 extraction parameters, and downloading results.
         Configured Gradio Blocks app
     """
+    with gr.Blocks(title="Voice Tools") as app:
         # Header
         gr.Markdown(
             """
+            # 🎤 Voice Tools
             Extract and profile specific voices from audio files using AI-powered
             speaker diarization and voice matching.
             """
             ---
             <div class="footer">
+            Voice Tools v0.1.0 | Powered by Gradio, PyAnnote, and Transformers
             </div>
             """,
             elem_classes=["footer"],
     app = create_app()
+    logger.info(f"Launching Voice Tools web interface on {server_name}:{server_port}")
     app.launch(
         server_name=server_name,