Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
pltobing
/
Qwen3-TTS-Streaming-ONNX
like
4
Text-to-Speech
ONNX
10 languages
TTS
ONNX
qwen3-tts
voice-clone
streaming
qwen3
vq
rvq
ecapa-tdnn
multilingual
arxiv:
2601.15621
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
1
Copy to bucket
new
main
Qwen3-TTS-Streaming-ONNX
/
src
Commit History
feat: enhance continuous stream. and preproc.
4e24bde
pltobing
commited on
22 days ago
feat: add standalone text processors of Qwen3-TTS
58a6a37
pltobing
commited on
24 days ago
feat: improve the last chunk with dynamic codec
6c8db5a
pltobing
commited on
28 days ago
feat: further improve the handling of cont. stream
4d53432
pltobing
commited on
29 days ago
feat: revise handling of generated wav for spk-emb
08c67e2
pltobing
commited on
29 days ago
feat: revised the continuous streaming to work
b5ac9cc
pltobing
commited on
about 1 month ago
fix: remove prefill append and use reset_turn
baad676
pltobing
commited on
Apr 27
feat: continuous streaming w/ list of string chunks
5773b24
pltobing
commited on
Apr 27
fix: bug missing text_eos_id when text ends
5cc054f
pltobing
commited on
Apr 27
docs: enhance README and docstrings
1a38298
pltobing
commited on
Apr 26
fix: codec decoder input structure and bugs
f8a6b67
pltobing
commited on
Apr 25
fix: local talker & talker input structure
d4e8429
pltobing
commited on
Apr 25
fix: bind all inputs for codec decoder
5f75b85
pltobing
commited on
Apr 24
fix: bind all inputs for all talker & local
9fab525
pltobing
commited on
Apr 23
fix: dtype of `inv_freq` to float32 in the init.
e320535
pltobing
commited on
Apr 22
fix: make cos and sin for rope as input for talker
6797f9f
pltobing
commited on
Apr 22
fix: make attention_mask as input for talker
622b7a5
pltobing
commited on
Apr 21
refactor: use one .onnx model for local talker
f18f2db
pltobing
commited on
Apr 21
refactor: use one .onnx model for talker
724c4f8
pltobing
commited on
Apr 21
feat: modify codec decoder for cuda graph compat.
4ddf9d1
pltobing
commited on
Apr 21
refactor: CUDA graph compatibility with IOBinding
1741312
pltobing
commited on
Apr 19
fix: mel-spec compute bugs causes wrong identity
4915823
pltobing
commited on
Apr 17
feat: add n_iter, codec_decoder, latency log warm-up
8370970
pltobing
commited on
Apr 16
feat: add warmup and latency stats
46dd791
pltobing
commited on
Apr 16
Add files, models, and assets
3717103
pltobing
commited on
Apr 12