ResembleAI
/

Chatterbox-Multilingual-pt-pt

single-language-tts

Model card Files Files and versions

Chatterbox-Multilingual-pt-pt / README.md

tedi-resemble's picture

Link model card to Chatterbox base model

d728f1a verified 23 days ago

|

History Blame Contribute Delete

2.23 kB

	---
	license: mit
	language:
	- pt
	tags:
	- chatterbox
	- text-to-speech
	- tts
	- multilingual
	- single-language-tts
	- voice-cloning
	- chatterbox-v3
	pipeline_tag: text-to-speech
	base_model: ResembleAI/chatterbox
	base_model_relation: finetune
	---

	<!-- chatterbox-space-link -->
	> 🎙️ Live demo: Try this model in the [`ResembleAI/Chatterbox-Multilingual-TTS-pt-pt`](https://huggingface.co/spaces/ResembleAI/Chatterbox-Multilingual-TTS-pt-pt) Space.
	<!-- chatterbox-space-link -->

	# Chatterbox Multilingual: Portuguese (Portugal)

	Chatterbox Multilingual: Portuguese (Portugal) is a dedicated single-language finetune in the Chatterbox Multilingual V3 Single Language Pack. It is optimized for Portuguese as spoken in Portugal, with language- and region-specific behavior for expressive text-to-speech and voice cloning.

	Use this model when you want tighter Portuguese (Portugal) quality control than the broad multilingual checkpoint. For a single model that covers all supported languages, use [`ResembleAI/chatterbox`](https://huggingface.co/ResembleAI/chatterbox).

	## Demo

	Try the hosted demo Space: [`ResembleAI/Chatterbox-Multilingual-TTS-pt-pt`](https://huggingface.co/spaces/ResembleAI/Chatterbox-Multilingual-TTS-pt-pt).

	## Files

	- `t3_pt_pt.safetensors`: T3 state dict in safetensors format.
	- `s3gen_v3.pt` / `s3gen_v3.safetensors`: V3 S3Gen speech decoder checkpoint.
	- `grapheme_mtl_merged_expanded_v1.json`: multilingual tokenizer config.

	## Language

	- Locale: `pt-PT`
	- Chatterbox language ID: `pt`

	## Checkpoint Metadata

	- Source step: `137700`
	- Source checkpoint: `t3_137700.pth.tar`
	- Tensor count: `292`
	- Dtype: `float32`
	- Text embedding shape: `(2454, 1024)`
	- Speech embedding shape: `(8194, 1024)`
	- Size: `2143990296` bytes
	- SHA256: `547c6e734908621badc806f5b56d773f05271b4ae653ae448c5d068e94de12db`

	## Loader Notes

	This repository contains Chatterbox Multilingual V3 single-language assets used by the linked demo Space. The T3 checkpoint is loaded with multilingual vocabulary shape `2454` and S3 speech vocabulary shape `8194`.

	The demo combines these model-specific assets with the shared Chatterbox inference code and companion assets needed for end-to-end speech generation.