Instructions to use MoYoYoTech/VoiceDialogue with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use MoYoYoTech/VoiceDialogue with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-to-speech", model="MoYoYoTech/VoiceDialogue")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("MoYoYoTech/VoiceDialogue", dtype="auto")

llama-cpp-python

How to use MoYoYoTech/VoiceDialogue with llama-cpp-python:

# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="MoYoYoTech/VoiceDialogue",
	filename="assets/models/llm/qwen/Qwen3-8B-Q6_K.gguf",
)

llm.create_chat_completion(
	messages = "\"The answer to the universe is 42\""
)

Notebooks
Google Colab
Kaggle
Local Apps Settings

llama.cpp

How to use MoYoYoTech/VoiceDialogue with llama.cpp:

Install from brew

brew install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf MoYoYoTech/VoiceDialogue:Q6_K
# Run inference directly in the terminal:
llama-cli -hf MoYoYoTech/VoiceDialogue:Q6_K

Install from WinGet (Windows)

winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf MoYoYoTech/VoiceDialogue:Q6_K
# Run inference directly in the terminal:
llama-cli -hf MoYoYoTech/VoiceDialogue:Q6_K

Use pre-built binary

# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf MoYoYoTech/VoiceDialogue:Q6_K
# Run inference directly in the terminal:
./llama-cli -hf MoYoYoTech/VoiceDialogue:Q6_K

Build from source code

git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf MoYoYoTech/VoiceDialogue:Q6_K
# Run inference directly in the terminal:
./build/bin/llama-cli -hf MoYoYoTech/VoiceDialogue:Q6_K

Use Docker

docker model run hf.co/MoYoYoTech/VoiceDialogue:Q6_K

LM Studio
Jan
Ollama
How to use MoYoYoTech/VoiceDialogue with Ollama:
```
ollama run hf.co/MoYoYoTech/VoiceDialogue:Q6_K
```

Unsloth Studio

How to use MoYoYoTech/VoiceDialogue with Unsloth Studio:

Install Unsloth Studio (macOS, Linux, WSL)

curl -fsSL https://unsloth.ai/install.sh | sh
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for MoYoYoTech/VoiceDialogue to start chatting

Install Unsloth Studio (Windows)

irm https://unsloth.ai/install.ps1 | iex
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for MoYoYoTech/VoiceDialogue to start chatting

Using HuggingFace Spaces for Unsloth

# No setup required
# Open https://huggingface.co/spaces/unsloth/studio in your browser
# Search for MoYoYoTech/VoiceDialogue to start chatting

How to use MoYoYoTech/VoiceDialogue with Pi:

Start the llama.cpp server

# Install llama.cpp:
brew install llama.cpp
# Start a local OpenAI-compatible server:
llama-server -hf MoYoYoTech/VoiceDialogue:Q6_K

Configure the model in Pi

# Install Pi:
npm install -g @mariozechner/pi-coding-agent
# Add to ~/.pi/agent/models.json:
{
  "providers": {
    "llama-cpp": {
      "baseUrl": "http://localhost:8080/v1",
      "api": "openai-completions",
      "apiKey": "none",
      "models": [
        {
          "id": "MoYoYoTech/VoiceDialogue:Q6_K"
        }
      ]
    }
  }
}

Run Pi

# Start Pi in your project directory:
pi

Hermes Agent new

How to use MoYoYoTech/VoiceDialogue with Hermes Agent:

Start the llama.cpp server

# Install llama.cpp:
brew install llama.cpp
# Start a local OpenAI-compatible server:
llama-server -hf MoYoYoTech/VoiceDialogue:Q6_K

Configure Hermes

# Install Hermes:
curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash
hermes setup
# Point Hermes at the local server:
hermes config set model.provider custom
hermes config set model.base_url http://127.0.0.1:8080/v1
hermes config set model.default MoYoYoTech/VoiceDialogue:Q6_K

Run Hermes

hermes

Docker Model Runner
How to use MoYoYoTech/VoiceDialogue with Docker Model Runner:
```
docker model run hf.co/MoYoYoTech/VoiceDialogue:Q6_K
```

Lemonade

How to use MoYoYoTech/VoiceDialogue with Lemonade:

Pull the model

# Download Lemonade from https://lemonade-server.ai/
lemonade pull MoYoYoTech/VoiceDialogue:Q6_K

Run and chat with the model

lemonade run user.VoiceDialogue-Q6_K

List all available models

lemonade list

VoiceDialogue / src

Commit History

Update static file routing and root endpoint for frontend integration

f7b034a

liumaolin commited on Jun 13, 2025

Add robust lifecycle management for `audio_player` service in system routes

627c3e7

liumaolin commited on Jun 12, 2025

Standardize service lifecycle management by replacing `stop` with `exit` and introducing `is_exited` check

f5226c0

liumaolin commited on Jun 12, 2025

Remove `voice_schemas.py` and refactor schema imports for TTS and ASR modules in `init.py`

4895dc2

liumaolin commited on Jun 12, 2025

Refactor speech recognizer, audio capture, and system routes for improved clarity and functionality

037e5ae

liumaolin commited on Jun 12, 2025

Add pause and resume functionality to voice dialogue system

d701b8a

liumaolin commited on Jun 12, 2025

Refactor project: split `main.py` functionality into modular components under `cli`, `core`, and `config`.

d08a15b

liumaolin commited on Jun 12, 2025

Increase service startup timeouts and set daemon mode for services.

61524a8

liumaolin commited on Jun 12, 2025

Refactor imports in `whisper.py` and `funasr.py` to use absolute paths for `ensure_minimum_audio_duration`.

d673573

liumaolin commited on Jun 11, 2025

Update `moyoyo.py`: add fallback for `utils` to ensure `HParams` availability in runtime.

bd3673b

liumaolin commited on Jun 11, 2025

Refactor imports for consistency in `kokoro.py` and `processor.py`. Use absolute paths for better readability and maintainability.

8630353

liumaolin commited on Jun 11, 2025

Update `paths.py`: improve PROJECT_ROOT resolution with `_MEIPASS` support and enhance third-party path handling.

664d767

liumaolin commited on Jun 11, 2025

Rename 'src/VoiceDialogue' to 'src/voice_dialogue'.

511ff0c

liumaolin commited on Jun 10, 2025

Revamp API core description: expand feature details for ASR, LLMs, TTS, system control, and real-time communication; improve clarity and structure of documentation.

c57de2a

liumaolin commited on Jun 9, 2025

Update project requirements.

ccdd95f

liumaolin commited on Jun 9, 2025

Integrate WebSocket support: add `/api/v1/ws` endpoint, enable real-time message handling via `websocket_message_queue`, and refactor services and models to support WebSocket-based question and answer updates.

2534744

liumaolin commited on Jun 9, 2025

Refactor `SessionIdManager` module: rename `session_id_manager.py` to `session_manager.py` and update imports accordingly.

83ef092

liumaolin commited on Jun 9, 2025

Refactor session ID handling: replace `current_session_id` with `SessionIdManager` for thread-safe management, update related imports and references.

92bb56d

liumaolin commited on Jun 9, 2025

Refactor `init.py` in TTS runtime: streamline `all` handling, improve logging for import failures, and enhance maintainability of module exports.

6f036c6

liumaolin commited on Jun 9, 2025

Serve static frontend assets through FastAPI: mount static files and replace root endpoint response with `index.html`.

ad7bf8d

liumaolin commited on Jun 9, 2025

Handle `UnboundLocalError` in punctuation model lookup: add exception handling to ensure stability during transcription.

933e84c

liumaolin commited on Jun 9, 2025

Remove `VoiceDialogue` API, models, and settings: clean up unused modules and dependencies for the decommissioned service.

3eb6daa

liumaolin commited on Jun 6, 2025

Refactor assets files

1d3b1b4

liumaolin commited on Jun 6, 2025

Refactor directories across services: rename `audio_generator` to `generators`, `asr` to `recognizers`, and update all import paths for consistency and improved module organization.

919ff3f

liumaolin commited on Jun 6, 2025

Refactor imports across services: replace `services.core` module references with `core` for consistency and maintainability; remove unused `Queue` imports.

619c761

liumaolin commited on Jun 6, 2025

Refactor imports in TTS and ASR modules: switch to absolute imports for improved clarity and maintainability.

d7d0d96

liumaolin commited on Jun 6, 2025

Remove trailing whitespace in `audio_generator/manager.py` and `asr/manager.py` for improved code cleanliness and consistency.

f08ef5f

liumaolin commited on Jun 6, 2025

Refactor ASR manager: remove `_get_asr_supported_languages`, replace static language mapping with `supported_langs` attribute, and update dynamic module import to use `importlib.util` for improved maintainability.

8acaad0

liumaolin commited on Jun 6, 2025