Instructions to use niobures/MOSS-TTS with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- llama-cpp-python
How to use niobures/MOSS-TTS with llama-cpp-python:
# !pip install llama-cpp-python from llama_cpp import Llama llm = Llama.from_pretrained( repo_id="niobures/MOSS-TTS", filename="models/MOSS-TTS-GGUF/MOSS_TTS_F16.gguf", )
output = llm( "Once upon a time,", max_tokens=512, echo=True ) print(output)
- Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- llama.cpp
How to use niobures/MOSS-TTS with llama.cpp:
Install from brew
brew install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf niobures/MOSS-TTS:Q4_K_M # Run inference directly in the terminal: llama-cli -hf niobures/MOSS-TTS:Q4_K_M
Install from WinGet (Windows)
winget install llama.cpp # Start a local OpenAI-compatible server with a web UI: llama-server -hf niobures/MOSS-TTS:Q4_K_M # Run inference directly in the terminal: llama-cli -hf niobures/MOSS-TTS:Q4_K_M
Use pre-built binary
# Download pre-built binary from: # https://github.com/ggerganov/llama.cpp/releases # Start a local OpenAI-compatible server with a web UI: ./llama-server -hf niobures/MOSS-TTS:Q4_K_M # Run inference directly in the terminal: ./llama-cli -hf niobures/MOSS-TTS:Q4_K_M
Build from source code
git clone https://github.com/ggerganov/llama.cpp.git cd llama.cpp cmake -B build cmake --build build -j --target llama-server llama-cli # Start a local OpenAI-compatible server with a web UI: ./build/bin/llama-server -hf niobures/MOSS-TTS:Q4_K_M # Run inference directly in the terminal: ./build/bin/llama-cli -hf niobures/MOSS-TTS:Q4_K_M
Use Docker
docker model run hf.co/niobures/MOSS-TTS:Q4_K_M
- LM Studio
- Jan
- Ollama
How to use niobures/MOSS-TTS with Ollama:
ollama run hf.co/niobures/MOSS-TTS:Q4_K_M
- Unsloth Studio
How to use niobures/MOSS-TTS with Unsloth Studio:
Install Unsloth Studio (macOS, Linux, WSL)
curl -fsSL https://unsloth.ai/install.sh | sh # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for niobures/MOSS-TTS to start chatting
Install Unsloth Studio (Windows)
irm https://unsloth.ai/install.ps1 | iex # Run unsloth studio unsloth studio -H 0.0.0.0 -p 8888 # Then open http://localhost:8888 in your browser # Search for niobures/MOSS-TTS to start chatting
Using HuggingFace Spaces for Unsloth
# No setup required # Open https://huggingface.co/spaces/unsloth/studio in your browser # Search for niobures/MOSS-TTS to start chatting
- Docker Model Runner
How to use niobures/MOSS-TTS with Docker Model Runner:
docker model run hf.co/niobures/MOSS-TTS:Q4_K_M
- Lemonade
How to use niobures/MOSS-TTS with Lemonade:
Pull the model
# Download Lemonade from https://lemonade-server.ai/ lemonade pull niobures/MOSS-TTS:Q4_K_M
Run and chat with the model
lemonade run user.MOSS-TTS-Q4_K_M
List all available models
lemonade list
| *.7z filter=lfs diff=lfs merge=lfs -text | |
| *.arrow filter=lfs diff=lfs merge=lfs -text | |
| *.bin filter=lfs diff=lfs merge=lfs -text | |
| *.bz2 filter=lfs diff=lfs merge=lfs -text | |
| *.ckpt filter=lfs diff=lfs merge=lfs -text | |
| *.ftz filter=lfs diff=lfs merge=lfs -text | |
| *.gz filter=lfs diff=lfs merge=lfs -text | |
| *.h5 filter=lfs diff=lfs merge=lfs -text | |
| *.joblib filter=lfs diff=lfs merge=lfs -text | |
| *.lfs.* filter=lfs diff=lfs merge=lfs -text | |
| *.mlmodel filter=lfs diff=lfs merge=lfs -text | |
| *.model filter=lfs diff=lfs merge=lfs -text | |
| *.msgpack filter=lfs diff=lfs merge=lfs -text | |
| *.npy filter=lfs diff=lfs merge=lfs -text | |
| *.npz filter=lfs diff=lfs merge=lfs -text | |
| *.onnx filter=lfs diff=lfs merge=lfs -text | |
| *.ot filter=lfs diff=lfs merge=lfs -text | |
| *.parquet filter=lfs diff=lfs merge=lfs -text | |
| *.pb filter=lfs diff=lfs merge=lfs -text | |
| *.pickle filter=lfs diff=lfs merge=lfs -text | |
| *.pkl filter=lfs diff=lfs merge=lfs -text | |
| *.pt filter=lfs diff=lfs merge=lfs -text | |
| *.pth filter=lfs diff=lfs merge=lfs -text | |
| *.rar filter=lfs diff=lfs merge=lfs -text | |
| *.safetensors filter=lfs diff=lfs merge=lfs -text | |
| saved_model/**/* filter=lfs diff=lfs merge=lfs -text | |
| *.tar.* filter=lfs diff=lfs merge=lfs -text | |
| *.tar filter=lfs diff=lfs merge=lfs -text | |
| *.tflite filter=lfs diff=lfs merge=lfs -text | |
| *.tgz filter=lfs diff=lfs merge=lfs -text | |
| *.wasm filter=lfs diff=lfs merge=lfs -text | |
| *.xz filter=lfs diff=lfs merge=lfs -text | |
| *.zip filter=lfs diff=lfs merge=lfs -text | |
| *.zst filter=lfs diff=lfs merge=lfs -text | |
| *tfevents* filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-GGUF/first_class/MOSS_TTS_FIRST_CLASS_F16.gguf filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-GGUF/first_class/MOSS_TTS_FIRST_CLASS_Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-GGUF/MOSS_TTS_F16.gguf filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-GGUF/MOSS_TTS_Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-GGUF/MOSS_TTS_Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-GGUF/MOSS_TTS_Q6_K.gguf filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-GGUF/MOSS_TTS_Q8_0.gguf filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-GGUF/tokenizer/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Realtime-ONNX/audio_ref/david-attenborough.mp3 filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Realtime-ONNX/audio_ref/female_shadowheart.flac filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Realtime-ONNX/audio_ref/male_old_movie.flac filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Realtime-ONNX/audio_ref/male_petergriffin.wav filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Realtime-ONNX/audio_ref/male_stewie.mp3 filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Realtime-ONNX/audio_ref/rick-sanchez.mp3 filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Realtime-ONNX/audio_synth/test_basic_streaming-onnx_10_decoderint8.wav filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Realtime-ONNX/audio_synth/test_basic_streaming-onnx_11_allfp32.wav filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Realtime-ONNX/audio_synth/test_basic_streaming-onnx_12_allfp32.wav filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Realtime-ONNX/audio_synth/test_basic_streaming-onnx_13_allfp32.wav filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Realtime-ONNX/audio_synth/test_basic_streaming-onnx_14_allfp32.wav filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Realtime-ONNX/audio_synth/test_basic_streaming-onnx_15_allfp32.wav filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Realtime-ONNX/audio_synth/test_basic_streaming-onnx_16_decoderint8.wav filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Realtime-ONNX/audio_synth/test_basic_streaming-onnx_3_allfp32.wav filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Realtime-ONNX/audio_synth/test_basic_streaming-onnx_4_decoderint8.wav filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Realtime-ONNX/audio_synth/test_basic_streaming-onnx_5_allfp32.wav filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Realtime-ONNX/audio_synth/test_basic_streaming-onnx_6_decoderint8.wav filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Realtime-ONNX/audio_synth/test_basic_streaming-onnx_7_decoderint8.wav filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Realtime-ONNX/audio_synth/test_basic_streaming-onnx_8_decoderint8.wav filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Realtime-ONNX/audio_synth/test_basic_streaming-onnx_9_decoderint8.wav filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Realtime-ONNX/onnx_models_quantized/codec_decoder_int8/codec_decoder_int8.data filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Realtime-ONNX/onnx_models/backbone_f32/backbone_f32.data filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Realtime-ONNX/onnx_models/codec_decoder/codec_decoder.data filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Realtime-ONNX/onnx_models/codec_encoder/codec_encoder.data filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Realtime-ONNX/onnx_models/local_transformer_f32/local_transformer_f32.data filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Realtime-ONNX/tokenizers/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| MOSS-TTS[[:space:]]Technical[[:space:]]Report.pdf filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Nano-100M-ONNX/moss_tts_global_shared.data filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Nano-100M-ONNX/moss_tts_local_shared.data filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Local-Transformer/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Nano-100M/assets/images/arch_moss_audio_tokenizer_nano.png filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Nano-100M/assets/images/concept.png filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Local-1.7B-ONNX/moss_tts_local17b_decode_step_int8.onnx.data filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Local-1.7B-ONNX/moss_tts_local17b_local_fixed_sampled_frame_int8.onnx.data filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Local-1.7B-ONNX/moss_tts_local17b_prefill_int8.onnx.data filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Local-1.7B-ONNX/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Nano-GGUF/codec-f16.gguf filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Nano-GGUF/codec-f32.gguf filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Nano-GGUF/codec-q4_k_m.gguf filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Nano-GGUF/codec-q5_k_m.gguf filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Nano-GGUF/codec-q8_0.gguf filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Realtime-GGUF/codec-f16.gguf filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Realtime-GGUF/codec-f32.gguf filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Realtime-GGUF/codec-q4_k_m.gguf filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Realtime-GGUF/codec-q5_k_m.gguf filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-Realtime-GGUF/codec-q8_0.gguf filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-v1.5-GGUF/moss-tts-v1.5-q8_0.extras.gguf filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-v1.5-GGUF/moss-tts-v1.5-q8_0.gguf filter=lfs diff=lfs merge=lfs -text | |
| models/MOSS-TTS-v1.5/tokenizer.json filter=lfs diff=lfs merge=lfs -text | |