Instructions to use MoYoYoTech/VoiceDialogue with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use MoYoYoTech/VoiceDialogue with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-to-speech", model="MoYoYoTech/VoiceDialogue")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("MoYoYoTech/VoiceDialogue", dtype="auto")

llama-cpp-python

How to use MoYoYoTech/VoiceDialogue with llama-cpp-python:

# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="MoYoYoTech/VoiceDialogue",
	filename="assets/models/llm/qwen/Qwen3-8B-Q6_K.gguf",
)

llm.create_chat_completion(
	messages = "\"The answer to the universe is 42\""
)

Notebooks
Google Colab
Kaggle
Local Apps

llama.cpp

How to use MoYoYoTech/VoiceDialogue with llama.cpp:

Install from brew

brew install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf MoYoYoTech/VoiceDialogue:Q6_K
# Run inference directly in the terminal:
llama-cli -hf MoYoYoTech/VoiceDialogue:Q6_K

Install from WinGet (Windows)

winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf MoYoYoTech/VoiceDialogue:Q6_K
# Run inference directly in the terminal:
llama-cli -hf MoYoYoTech/VoiceDialogue:Q6_K

Use pre-built binary

# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf MoYoYoTech/VoiceDialogue:Q6_K
# Run inference directly in the terminal:
./llama-cli -hf MoYoYoTech/VoiceDialogue:Q6_K

Build from source code

git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf MoYoYoTech/VoiceDialogue:Q6_K
# Run inference directly in the terminal:
./build/bin/llama-cli -hf MoYoYoTech/VoiceDialogue:Q6_K

Use Docker

docker model run hf.co/MoYoYoTech/VoiceDialogue:Q6_K

LM Studio
Jan
Ollama
How to use MoYoYoTech/VoiceDialogue with Ollama:
```
ollama run hf.co/MoYoYoTech/VoiceDialogue:Q6_K
```

Unsloth Studio new

How to use MoYoYoTech/VoiceDialogue with Unsloth Studio:

Install Unsloth Studio (macOS, Linux, WSL)

curl -fsSL https://unsloth.ai/install.sh | sh
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for MoYoYoTech/VoiceDialogue to start chatting

Install Unsloth Studio (Windows)

irm https://unsloth.ai/install.ps1 | iex
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for MoYoYoTech/VoiceDialogue to start chatting

Using HuggingFace Spaces for Unsloth

# No setup required
# Open https://huggingface.co/spaces/unsloth/studio in your browser
# Search for MoYoYoTech/VoiceDialogue to start chatting

Pi new

How to use MoYoYoTech/VoiceDialogue with Pi:

Start the llama.cpp server

# Install llama.cpp:
brew install llama.cpp
# Start a local OpenAI-compatible server:
llama-server -hf MoYoYoTech/VoiceDialogue:Q6_K

Configure the model in Pi

# Install Pi:
npm install -g @mariozechner/pi-coding-agent
# Add to ~/.pi/agent/models.json:
{
  "providers": {
    "llama-cpp": {
      "baseUrl": "http://localhost:8080/v1",
      "api": "openai-completions",
      "apiKey": "none",
      "models": [
        {
          "id": "MoYoYoTech/VoiceDialogue:Q6_K"
        }
      ]
    }
  }
}

Run Pi

# Start Pi in your project directory:
pi

Hermes Agent new

How to use MoYoYoTech/VoiceDialogue with Hermes Agent:

Start the llama.cpp server

# Install llama.cpp:
brew install llama.cpp
# Start a local OpenAI-compatible server:
llama-server -hf MoYoYoTech/VoiceDialogue:Q6_K

Configure Hermes

# Install Hermes:
curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash
hermes setup
# Point Hermes at the local server:
hermes config set model.provider custom
hermes config set model.base_url http://127.0.0.1:8080/v1
hermes config set model.default MoYoYoTech/VoiceDialogue:Q6_K

Run Hermes

hermes

Docker Model Runner
How to use MoYoYoTech/VoiceDialogue with Docker Model Runner:
```
docker model run hf.co/MoYoYoTech/VoiceDialogue:Q6_K
```

Lemonade

How to use MoYoYoTech/VoiceDialogue with Lemonade:

Pull the model

# Download Lemonade from https://lemonade-server.ai/
lemonade pull MoYoYoTech/VoiceDialogue:Q6_K

Run and chat with the model

lemonade run user.VoiceDialogue-Q6_K

List all available models

lemonade list

liumaolin commited on Jun 26, 2025

Commit

6556ced

1 Parent(s): 469433f

Refactor `test_llm_dialogue.py` to enhance multi-dataset testing for LLM dialogue

Browse files

- Introduce multiple test datasets covering diverse topics in both Chinese and English.
- Replace `user_questions` with structured `test_datasets`.
- Add `_get_prompt_by_language` for language-specific prompt handling.
- Integrate `create_langchain_pipeline` for improved pipeline creation and execution logic.

Files changed (1) hide show

tests/test_llm_dialogue.py +109 -28

tests/test_llm_dialogue.py CHANGED Viewed

@@ -16,6 +16,7 @@ if lib_path.exists() and lib_path.as_posix() not in sys.path:
 from voice_dialogue.config import paths
 from voice_dialogue.config.llm_config import get_llm_model_params
 CHINESE_SYSTEM_PROMPT = (
     "你是善于模拟真实的思考过程的AI助手。"
@@ -41,7 +42,7 @@ class TestLLMDialogue(unittest.TestCase):
         self.history_store = {}
         model_path = paths.LLM_MODELS_PATH / 'qwen' / 'Qwen3-8B-Q6_K.gguf'
-        langchain_instance = ChatLlamaCpp(model_path=model_path.as_posix(), **model_params)
         system_message = SystemMessage(content=CHINESE_SYSTEM_PROMPT)
         human_message = HumanMessagePromptTemplate.from_template("{input}")
@@ -51,27 +52,96 @@ class TestLLMDialogue(unittest.TestCase):
             human_message
         ])
-        chain = prompt | langchain_instance
         self.chain_with_history = RunnableWithMessageHistory(chain, self.get_session_history,
                                                              history_messages_key='history')
         self.warmup()
-        # 连续对话测试问题集
-        self.user_questions = [
-            # 第1轮：开放性话题引入
-            "最近人工智能技术发展很快，你觉得AI对我们日常生活带来了哪些改变？",
-            # 第2轮：基于前一个回答的深入探讨
-            "你刚才提到的这些改变中，哪一个你认为是最重要的？为什么？",
-            # 第3轮：转向具体场景和个人观点
-            "如果让你选择一个AI应用来帮助解决教育领域的问题，你会选择什么？具体怎么实现？",
-            # 第4轮：挑战性问题，测试逻辑思维
-            "但是也有人担心AI在教育中会让学生过度依赖技术，失去独立思考能力。你怎么看待这个担忧？",
-            # 第5轮：总结性问题，测试整合能力
-            "综合我们刚才讨论的内容，你认为在AI快速发展的时代，普通人应该如何适应和准备？"
         ]
     def get_session_history(self, session_id: str) -> InMemoryChatMessageHistory:
@@ -97,14 +167,25 @@ class TestLLMDialogue(unittest.TestCase):
         for chunk in self.chain_with_history.stream(input={'input': 'This is a warmup step.'}, config=config):
             pass
     def test_dialogue(self):
-        session_id = 'test_dialogue'
-        for user_question in self.user_questions:
-            print('User question:', user_question)
-            config = {"configurable": {"session_id": session_id}}
-            print(f'LLM answer: ', end='')
-            for chunk in self.chain_with_history.stream(input={'input': user_question}, config=config):
-                print(chunk.content, end='')
-            print()
-            print('-' * 80)
-            print()

 from voice_dialogue.config import paths
 from voice_dialogue.config.llm_config import get_llm_model_params
+from voice_dialogue.services.text.processor import create_langchain_pipeline
 CHINESE_SYSTEM_PROMPT = (
     "你是善于模拟真实的思考过程的AI助手。"
         self.history_store = {}
         model_path = paths.LLM_MODELS_PATH / 'qwen' / 'Qwen3-8B-Q6_K.gguf'
+        self.langchain_instance = ChatLlamaCpp(model_path=model_path.as_posix(), **model_params)
         system_message = SystemMessage(content=CHINESE_SYSTEM_PROMPT)
         human_message = HumanMessagePromptTemplate.from_template("{input}")
             human_message
         ])
+        chain = prompt | self.langchain_instance
         self.chain_with_history = RunnableWithMessageHistory(chain, self.get_session_history,
                                                              history_messages_key='history')
         self.warmup()
+        self.test_datasets = [
+            {
+                'session_id': 'test_dataset_1',
+                'language': 'zh',
+                'questions': [
+                    # 第1轮：开放性话题引入
+                    "最近人工智能技术发展很快，你觉得AI对我们日常生活带来了哪些改变？",
+                    # 第2轮：基于前一个回答的深入探讨
+                    "你刚才提到的这些改变中，哪一个你认为是最重要的？为什么？",
+                    # 第3轮：转向具体场景和个人观点
+                    "如果让你选择一个AI应用来帮助解决教育领域的问题，你会选择什么？具体怎么实现？",
+                    # 第4轮：挑战性问题，测试逻辑思维
+                    "但是也有人担心AI在教育中会让学生过度依赖技术，失去独立思考能力。你怎么看待这个担忧？",
+                    # 第5轮：总结性问题，测试整合能力
+                    "综合我们刚才讨论的内容，你认为在AI快速发展的时代，普通人应该如何适应和准备？"
+                ]
+            },
+            {
+                'session_id': 'test_dataset_2',
+                'language': 'zh',
+                'questions': [
+                    # 第1轮：开放性话题引入
+                    "近年来环境问题越来越受到关注，你认为我们个人在日常生活中可以为环保做些什么？",
+                    # 第2轮：基于前一个回答的深入探讨
+                    "在这些环保行为中，你觉得哪一种最容易被大家接受和实践？原因是什么？",
+                    # 第3rd轮：转向具体场景和个人观点
+                    "如果让你设计一个推广垃圾分类的社区活动，你会怎么做？",
+                    # 第4轮：挑战性问题，测试逻辑思维
+                    "有些人认为，个人的环保努力相比工业污染只是杯水车薪，这种看法你怎么看？",
+                    # 第5轮：总结性问题，测试整合能力
+                    "总的来说，为了实现可持续发展，你认为政府、企业和个人应该分别扮演什么样的角色？"
+                ]
+            },
+            {
+                'session_id': 'test_dataset_3',
+                'language': 'zh',
+                'questions': [
+                    # 第1轮：开放性话题引入
+                    "随着科技的发展，未来的工作模式可能会发生很大变化，你想象中未来的工作是什么样的？",
+                    # 第2轮：基于前一个回答的深入探讨
+                    "你提到的远程办公和灵活工作时间，对员工和公司来说，各自最大的好处和���战是什么？",
+                    # 第3轮：转向具体场景和个人观点
+                    "假设你是一名公司经理，你会如何利用技术工具来提高远程团队的协作效率？",
+                    # 第4轮：挑战性问题，测试逻辑思维
+                    "自动化和人工智能可能会取代一部分人的工作，这引起了很多人的焦虑。你认为我们应该如何应对这种“失业恐慌”？",
+                    # 第5轮：总结性问题，测试整合能力
+                    "面对未来工作的种种不确定性，你认为现在的年轻人最需要培养哪些核心能力？"
+                ]
+            },
+            {
+                'session_id': 'test_dataset_4',
+                'language': 'en',
+                'questions': [
+                    # Round 1: Open-ended topic introduction
+                    "Mental health has become a more prominent topic recently. What are some common stressors you think people face in modern society?",
+                    # Round 2: In-depth discussion based on the previous answer
+                    "Of the stressors you mentioned, which one do you believe has the most significant impact on people's well-being, and why?",
+                    # Round 3: Shift to specific scenarios and personal opinions
+                    "If you were to design a mobile app to help people manage stress, what key features would you include?",
+                    # Round 4: Challenging question, testing logical thinking
+                    "Some argue that the increased focus on mental health can sometimes lead to over-diagnosis or the medicalization of normal emotions. What are your thoughts on this concern?",
+                    # Round 5: Summarizing question, testing integration ability
+                    "To sum up, what kind of societal changes do you think would be most effective in promoting better mental health for everyone?"
+                ]
+            },
+            {
+                'session_id': 'test_dataset_5',
+                'language': 'en',
+                'questions': [
+                    # Round 1: Open-ended topic introduction
+                    "Humanity has always been fascinated by space. What do you see as the most exciting developments in space exploration right now?",
+                    # Round 2: In-depth discussion based on the previous answer
+                    "You mentioned the push towards colonizing Mars. What do you think are the biggest scientific and ethical challenges we need to overcome for that to become a reality?",
+                    # Round 3: Shift to specific scenarios and personal opinions
+                    "If you were given the chance to send a single message to an extraterrestrial civilization, what would it say?",
+                    # Round 4: Challenging question, testing logical thinking
+                    "There's a debate about whether the vast amounts of money spent on space exploration could be better used to solve problems here on Earth. How would you justify the continued investment in space programs?",
+                    # Round 5: Summarizing question, testing integration ability
+                    "Considering everything we've discussed, what long-term benefits do you believe humanity will gain from its ventures into space?"
+                ]
+            }
         ]
     def get_session_history(self, session_id: str) -> InMemoryChatMessageHistory:
         for chunk in self.chain_with_history.stream(input={'input': 'This is a warmup step.'}, config=config):
             pass
+    def _get_prompt_by_language(self, language: str) -> str:
+        """根据语言获取对应的 prompt"""
+        if language == "zh":
+            return CHINESE_SYSTEM_PROMPT
+        else:
+            return ENGLISH_SYSTEM_PROMPT
     def test_dialogue(self):
+        for test_dataset in self.test_datasets:
+            session_id = test_dataset.get('session_id')
+            print(f'Test dataset: {session_id}')
+            print('=' * 80)
+            for question in test_dataset.get('questions'):
+                print('Test question:', question)
+                config = {"configurable": {"session_id": session_id}}
+                prompt = self._get_prompt_by_language(test_dataset.get('language'))
+                pipeline = create_langchain_pipeline(self.langchain_instance, prompt, self.get_session_history)
+                print(f'LLM answer: ', end='')
+                for chunk in pipeline.stream(input={'input': question}, config=config):
+                    print(chunk.content, end='')
+                print()
+                print('-' * 80)