Instructions to use MoYoYoTech/Translator with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use MoYoYoTech/Translator with llama-cpp-python:

# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="MoYoYoTech/Translator",
	filename="moyoyo_asr_models/qwen2.5-1.5b-instruct-q5_0.gguf",
)

llm.create_chat_completion(
	messages = "No input example has been defined for this model task."
)

Notebooks
Google Colab
Kaggle
Local Apps

llama.cpp

How to use MoYoYoTech/Translator with llama.cpp:

Install from brew

brew install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf MoYoYoTech/Translator:Q5_0
# Run inference directly in the terminal:
llama-cli -hf MoYoYoTech/Translator:Q5_0

Install from WinGet (Windows)

winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf MoYoYoTech/Translator:Q5_0
# Run inference directly in the terminal:
llama-cli -hf MoYoYoTech/Translator:Q5_0

Use pre-built binary

# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf MoYoYoTech/Translator:Q5_0
# Run inference directly in the terminal:
./llama-cli -hf MoYoYoTech/Translator:Q5_0

Build from source code

git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf MoYoYoTech/Translator:Q5_0
# Run inference directly in the terminal:
./build/bin/llama-cli -hf MoYoYoTech/Translator:Q5_0

Use Docker

docker model run hf.co/MoYoYoTech/Translator:Q5_0

LM Studio
Jan
Ollama
How to use MoYoYoTech/Translator with Ollama:
```
ollama run hf.co/MoYoYoTech/Translator:Q5_0
```

Unsloth Studio

How to use MoYoYoTech/Translator with Unsloth Studio:

Install Unsloth Studio (macOS, Linux, WSL)

curl -fsSL https://unsloth.ai/install.sh | sh
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for MoYoYoTech/Translator to start chatting

Install Unsloth Studio (Windows)

irm https://unsloth.ai/install.ps1 | iex
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for MoYoYoTech/Translator to start chatting

Using HuggingFace Spaces for Unsloth

# No setup required
# Open https://huggingface.co/spaces/unsloth/studio in your browser
# Search for MoYoYoTech/Translator to start chatting

How to use MoYoYoTech/Translator with Pi:

Start the llama.cpp server

# Install llama.cpp:
brew install llama.cpp
# Start a local OpenAI-compatible server:
llama-server -hf MoYoYoTech/Translator:Q5_0

Configure the model in Pi

# Install Pi:
npm install -g @mariozechner/pi-coding-agent
# Add to ~/.pi/agent/models.json:
{
  "providers": {
    "llama-cpp": {
      "baseUrl": "http://localhost:8080/v1",
      "api": "openai-completions",
      "apiKey": "none",
      "models": [
        {
          "id": "MoYoYoTech/Translator:Q5_0"
        }
      ]
    }
  }
}

Run Pi

# Start Pi in your project directory:
pi

Hermes Agent new

How to use MoYoYoTech/Translator with Hermes Agent:

Start the llama.cpp server

# Install llama.cpp:
brew install llama.cpp
# Start a local OpenAI-compatible server:
llama-server -hf MoYoYoTech/Translator:Q5_0

Configure Hermes

# Install Hermes:
curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash
hermes setup
# Point Hermes at the local server:
hermes config set model.provider custom
hermes config set model.base_url http://127.0.0.1:8080/v1
hermes config set model.default MoYoYoTech/Translator:Q5_0

Run Hermes

hermes

Docker Model Runner
How to use MoYoYoTech/Translator with Docker Model Runner:
```
docker model run hf.co/MoYoYoTech/Translator:Q5_0
```

Lemonade

How to use MoYoYoTech/Translator with Lemonade:

Pull the model

# Download Lemonade from https://lemonade-server.ai/
lemonade pull MoYoYoTech/Translator:Q5_0

Run and chat with the model

lemonade run user.Translator-Q5_0

List all available models

lemonade list

daihui.zhang commited on Apr 16, 2025

Commit

ce0e589

1 Parent(s): 3ec4a4f

add text length threhold

Browse files

Files changed (4) hide show

config.py +2 -0
tests/test_whisper_cpp.py +2 -2
transcribe/pipelines/pipe_translate.py +7 -3
transcribe/strategy.py +4 -4

config.py CHANGED Viewed

@@ -21,6 +21,8 @@ console_formatter = logging.Formatter("%(asctime)s - %(levelname)s - %(message)s
 console_handler.setFormatter(console_formatter)
 logging.getLogger().addHandler(console_handler)
 BASE_DIR = pathlib.Path(__file__).parent
 MODEL_DIR = BASE_DIR / "moyoyo_asr_models"

 console_handler.setFormatter(console_formatter)
 logging.getLogger().addHandler(console_handler)
+# 文字输出长度阈值
+TEXT_THREHOLD = 16
 BASE_DIR = pathlib.Path(__file__).parent
 MODEL_DIR = BASE_DIR / "moyoyo_asr_models"

tests/test_whisper_cpp.py CHANGED Viewed

@@ -3,7 +3,7 @@ import config
 import soundfile
 from pywhispercpp.utils import to_timestamp
-mel, _, = soundfile.read("/Users/david/Samples/Audio/en/sample-10.wav")
 # mel, _, = soundfile.read(f"{config.ASSERT_DIR}/jfk.flac")
 models_dir = config.MODEL_DIR.as_posix()
@@ -19,7 +19,7 @@ model = Model(
               no_context=True
               )
 print(mel.shape, mel.dtype) # (160000,) float64
-segments = model.transcribe(mel[:, 0],
                             # initial_prompt="",# 'The following is an English sentence.', # "以下是简体中文句子。"
                             language='en',
                             # initial_prompt="以下是简体中文句子。",

 import soundfile
 from pywhispercpp.utils import to_timestamp
+mel, _, = soundfile.read("test/6_before_cut_56640.wav")
 # mel, _, = soundfile.read(f"{config.ASSERT_DIR}/jfk.flac")
 models_dir = config.MODEL_DIR.as_posix()
               no_context=True
               )
 print(mel.shape, mel.dtype) # (160000,) float64
+segments = model.transcribe(mel,
                             # initial_prompt="",# 'The following is an English sentence.', # "以下是简体中文句子。"
                             language='en',
                             # initial_prompt="以下是简体中文句子。",

transcribe/pipelines/pipe_translate.py CHANGED Viewed

@@ -2,7 +2,7 @@
 from .base import MetaItem, BasePipe, Segment
 from llama_cpp import Llama
 from ..helpers.translator import QwenTranslator
-from config import LLM_MODEL_PATH, LLM_SYS_PROMPT_EN, LLM_SYS_PROMPT_ZH, LLM_LARGE_MODEL_PATH
 class TranslatePipe(BasePipe):
@@ -16,8 +16,12 @@ class TranslatePipe(BasePipe):
     def process(self, in_data: MetaItem) -> MetaItem:
         context = in_data.transcribe_content
-        result = self.translator.translate(
-            context, src_lang=in_data.source_language, dst_lang=in_data.destination_language)
         in_data.translate_content = result
         return in_data

 from .base import MetaItem, BasePipe, Segment
 from llama_cpp import Llama
 from ..helpers.translator import QwenTranslator
+from config import LLM_MODEL_PATH, LLM_SYS_PROMPT_EN, LLM_SYS_PROMPT_ZH, LLM_LARGE_MODEL_PATH, ALL_MARKERS
 class TranslatePipe(BasePipe):
     def process(self, in_data: MetaItem) -> MetaItem:
         context = in_data.transcribe_content
+        all_punctuatioin = all([ch in ALL_MARKERS for ch in context])
+        if all_punctuatioin:
+            result = ""
+        else:
+            result = self.translator.translate(
+                context, src_lang=in_data.source_language, dst_lang=in_data.destination_language)
         in_data.translate_content = result
         return in_data

transcribe/strategy.py CHANGED Viewed

@@ -8,7 +8,7 @@ from typing import List, Tuple, Optional, Deque, Any, Iterator,Literal
 from config import SENTENCE_END_MARKERS, ALL_MARKERS,SENTENCE_END_PATTERN,REGEX_MARKERS, PAUSEE_END_PATTERN,SAMPLE_RATE
 from enum import Enum
 import wordninja
 import re
 logger = logging.getLogger("TranscriptionStrategy")
@@ -199,7 +199,7 @@ class TranscriptBuffer:
         count = 0
         current_sentences = []
-        while len(self._sentences) and count < 20:
             item = self._sentences.popleft()
             current_sentences.append(item)
             if self._separator:
@@ -265,10 +265,10 @@ class TranscriptBuffer:
                 self.update_pending_text(stable_str)
                 self.commit_line()
-            current_text_len =  len(self.current_not_commit_text.split(self._separator)) if self._separator else len(self.current_not_commit_text)
             # current_text_len = len(self.current_not_commit_text.split(self._separator))
             self.update_pending_text(remaining_string)
-            if current_text_len >= 20:
                 self.commit_paragraph()
                 self._current_seg_id += 1
                 return True

 from config import SENTENCE_END_MARKERS, ALL_MARKERS,SENTENCE_END_PATTERN,REGEX_MARKERS, PAUSEE_END_PATTERN,SAMPLE_RATE
 from enum import Enum
 import wordninja
+import config
 import re
 logger = logging.getLogger("TranscriptionStrategy")
         count = 0
         current_sentences = []
+        while len(self._sentences): # and count < 20:
             item = self._sentences.popleft()
             current_sentences.append(item)
             if self._separator:
                 self.update_pending_text(stable_str)
                 self.commit_line()
+            current_text_len = len(self.current_not_commit_text.split(self._separator)) if self._separator else len(self.current_not_commit_text)
             # current_text_len = len(self.current_not_commit_text.split(self._separator))
             self.update_pending_text(remaining_string)
+            if current_text_len >= config.TEXT_THREHOLD:
                 self.commit_paragraph()
                 self._current_seg_id += 1
                 return True