Celune
Collection
All models and datasets related to Celune. • 11 items • Updated
“It’s not just a TTS, it’s a character.”
Celune is a single-speaker neural TTS voice built on top of Qwen3-TTS. This model defines one identity, one voice, and one consistent speaking style.
This is part of the Celune voice lineup, where each model represents a distinct interpretation of the same character.
The default voice mode. Designed for straightforward text-to-speech with clear, natural delivery and no added stylistic bias.
Characteristics
import numpy as np
import soundfile as sf
from faster_qwen3_tts import FasterQwen3TTS
model = FasterQwen3TTS.from_pretrained("lunahr/Celune-1.7B-Neutral")
buffer = []
for audio_chunk, sr, timing in model.generate_custom_voice_streaming(
text="Hello, this is Celune.",
language="Auto",
speaker="celune"
):
buffer.append(audio_chunk)
wav = np.concatenate(buffer)
sf.write("output.wav", wav, 24000)
You can influence delivery style using text or instruct prompts:
Examples:
The model will stay within the defined voice but expand expression.
Celune is a local-first TTS system focused on speed, control, and identity.
Each model in the Celune lineup is:
MIT and Qwen's Apache 2.0 license
Base model
Qwen/Qwen3-TTS-12Hz-1.7B-Base