How to use MoYoYoTech/VoiceDialogue with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("text-to-speech", model="MoYoYoTech/VoiceDialogue")
messages = [
{"role": "user", "content": "Who are you?"},
]
pipe(messages)
# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("MoYoYoTech/VoiceDialogue", dtype="auto")
How to use MoYoYoTech/VoiceDialogue with llama.cpp:
Install from brew
brew install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf MoYoYoTech/VoiceDialogue:Q6_K
# Run inference directly in the terminal:
llama-cli -hf MoYoYoTech/VoiceDialogue:Q6_K
Install from WinGet (Windows)
winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf MoYoYoTech/VoiceDialogue:Q6_K
# Run inference directly in the terminal:
llama-cli -hf MoYoYoTech/VoiceDialogue:Q6_K
Use pre-built binary
# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf MoYoYoTech/VoiceDialogue:Q6_K
# Run inference directly in the terminal:
./llama-cli -hf MoYoYoTech/VoiceDialogue:Q6_K
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf MoYoYoTech/VoiceDialogue:Q6_K
# Run inference directly in the terminal:
./build/bin/llama-cli -hf MoYoYoTech/VoiceDialogue:Q6_K
Use Docker
docker model run hf.co/MoYoYoTech/VoiceDialogue:Q6_K
How to use MoYoYoTech/VoiceDialogue with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for MoYoYoTech/VoiceDialogue to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for MoYoYoTech/VoiceDialogue to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required
# Open https://huggingface.co/spaces/unsloth/studio in your browser
# Search for MoYoYoTech/VoiceDialogue to start chatting
How to use MoYoYoTech/VoiceDialogue with Hermes Agent:
Start the llama.cpp server
# Install llama.cpp:
brew install llama.cpp
# Start a local OpenAI-compatible server:
llama-server -hf MoYoYoTech/VoiceDialogue:Q6_K
Configure Hermes
# Install Hermes:
curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash
hermes setup
# Point Hermes at the local server:
hermes config set model.provider custom
hermes config set model.base_url http://127.0.0.1:8080/v1
hermes config set model.default MoYoYoTech/VoiceDialogue:Q6_K
Extend TTS registry functionality and integrate default system configurations: implement prioritization logic, language preference handling, and fallback mechanisms in `TTSConfigRegistry`; refactor service factory and lifespan management to support dynamic TTS selection and initialization.
a28f7e3
liumaolincommited on
Introduce core module for API lifecycle management: add configuration, service factories, service manager, and lifespan handlers to streamline application startup, shutdown, and service orchestration.
a16e0e5
liumaolincommited on
Add system utilities and initialize core modules: implement `get_system_language` and `get_system_info`, update API startup with system defaults, and integrate ASR, LLM, and speech modules for enhanced functionality.
5c0e715
liumaolincommited on
Refactor core queue initialization: move queue definitions to `constants.py` and clean up redundant imports in `main.py` for better modularity.
bfefeb3
liumaolincommited on
Introduce initial API structure for VoiceDialogue: add dependencies, middleware, and routes for ASR, TTS, system, and voice modules.
8f823b0
liumaolincommited on
Update dependencies: add FunASR, FunASR-ONNX, FastAPI, and Uvicorn to requirements.txt
d91a26b
liumaolincommited on
Refactor ASR module: introduce modular structure with ASR interface, implement FunASR and Whisper clients, add registry, and consolidate utility functions for enhanced maintainability and extensibility.
59603db
liumaolincommited on
Refactor TTS module: rename `tts_manager` to `manager` for consistency across imports and structure.
89f7f05
liumaolincommited on
Refactor speech processing: add type hint for `_process_active_voice_frame` and replace `max()` with `np.max()` for consistency.
5284873
liumaolincommited on
Refactor TTS audio generation: rename queues for clarity, update `TTSAudioGenerator` initialization, and enhance docstrings for better maintainability.
bba0d84
liumaolincommited on
Refactor TTS architecture: implement runtime interface, TTS manager, universal registry, and factory pattern to support multiple engines.
ef0d09e
liumaolincommited on
Refactor voice model structure: extract MoYoYo-specific configurations and introduce universal TTS registry.
025ca3f
liumaolincommited on
Update TTS inference to validate audio duration using soundfile.
7b86866
liumaolincommited on
Remove unused `prompt_semantic` and `reference_spec` configuration parameters from voice model definitions.
2a5dcf2
liumaolincommited on
Add thread readiness checks and is_ready property across services
e80f558
liumaolincommited on
Update FunASR punc quantized model.
d691bbc
liumaolincommited on
Using FunASR quantized model.
ac62229
liumaolincommited on
Add multilingual support and optimize LLM pipeline configuration.
2988b10
liumaolincommited on
Update .gitattributes to track GGUF files with git-lfs
ecc005d
liumaolincommited on
Add descriptions for Chinese voice models.
4643bb2
liumaolincommited on
Remove unused configuration parameters and conversation templates.
3d953ae
liumaolincommited on
Integrate FunASR service.
516d7b8
liumaolincommited on
Add FunASR models.
76e7fcd
liumaolincommited on
Update .gitattributes to track jieba dictionary and PNG files with git-lfs.
566a4da
liumaolincommited on
Update README for enhanced installation and usage guidance.