Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

scrappylabs
/
narrator-tts

Text-to-Speech
Transformers
Safetensors
Qwen3-TTS
English
text-generation
tts
finetune
voice-cloning
narrator
qwen
speech
audio
Model card Files Files and versions
xet
Community

Instructions to use scrappylabs/narrator-tts with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • Transformers

    How to use scrappylabs/narrator-tts with Transformers:

    # Use a pipeline as a high-level helper
    from transformers import pipeline
    
    pipe = pipeline("text-to-speech", model="scrappylabs/narrator-tts")
    # Load model directly
    from transformers import AutoModelForSeq2SeqLM
    model = AutoModelForSeq2SeqLM.from_pretrained("scrappylabs/narrator-tts", dtype="auto")
  • Qwen3-TTS

    How to use scrappylabs/narrator-tts with Qwen3-TTS:

    # pip install qwen-tts
    import torch
    import soundfile as sf
    from qwen_tts import Qwen3TTSModel
    
    model = Qwen3TTSModel.from_pretrained(
        "scrappylabs/narrator-tts",
        device_map="cuda:0",
        dtype=torch.bfloat16,
        attn_implementation="flash_attention_2",
    )
    
    wavs, sr = model.generate_custom_voice(
        text="Your text here.",
        language="English",
        speaker="Ryan",
        instruct="Speak in a natural tone.",
    )
    
    sf.write("output.wav", wavs[0], sr)
  • Notebooks
  • Google Colab
  • Kaggle
narrator-tts
4.52 GB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 8 commits
loserbcc's picture
loserbcc
Add audio samples to model card
82b3a6f verified 4 months ago
  • samples
    Add audio sample: samples/conversational.mp3 4 months ago
  • speech_tokenizer
    Initial upload: Narrator TTS finetune (Qwen3-TTS 1.7B based) 4 months ago
  • .gitattributes
    1.52 kB
    initial commit 4 months ago
  • README.md
    5.86 kB
    Add audio samples to model card 4 months ago
  • config.json
    4.53 kB
    Initial upload: Narrator TTS finetune (Qwen3-TTS 1.7B based) 4 months ago
  • generation_config.json
    245 Bytes
    Initial upload: Narrator TTS finetune (Qwen3-TTS 1.7B based) 4 months ago
  • merges.txt
    1.67 MB
    Initial upload: Narrator TTS finetune (Qwen3-TTS 1.7B based) 4 months ago
  • model.safetensors
    3.83 GB
    xet
    Initial upload: Narrator TTS finetune (Qwen3-TTS 1.7B based) 4 months ago
  • preprocessor_config.json
    127 Bytes
    Initial upload: Narrator TTS finetune (Qwen3-TTS 1.7B based) 4 months ago
  • tokenizer_config.json
    7.34 kB
    Initial upload: Narrator TTS finetune (Qwen3-TTS 1.7B based) 4 months ago
  • vocab.json
    2.78 MB
    Initial upload: Narrator TTS finetune (Qwen3-TTS 1.7B based) 4 months ago