WitoldG
/

polish_piper_models

Model card Files Files and versions

polish_piper_models / README.md

WitoldG's picture

Update README.md

ed4cda3 verified over 1 year ago

|

history blame contribute delete

1.33 kB

	List of several Polish voice models for piper
	===

	All models were trained on the RTX4090 graphics card. Datasets for the indicated models can be found in another repository. 1600-2000 samples were used to generate the models. Generated sample texts read by the included models are also included.


	How to use models?
	---

	`pip install piper-tts`

	`echo 'Witamy w świecie syntezy mowy!' \| piper --model ./pl_PL-jarvis_wg_glos-medium.onnx --config ./pl_PL-jarvis_wg_glos-medium.onnx.json --output_file witaj.wav`


	How to use models in MacOS:
	---

	```
	pip install piper-phonemize-cross
	pip install piper-tts --no-deps
	pip install onnxruntime
	```

	`echo 'Witamy w świecie syntezy mowy!' \| piper --model ./pl_PL-meski_wg_glos-medium.onnx --config ./pl_PL-meski_wg_glos-medium.onnx.json --output_file witaj.wav`

	Info
	---

	All models was tuning from file `epoch=2164-step=1355540.ckpt` (https://huggingface.co/datasets/rhasspy/piper-checkpoints/resolve/main/en/en_US/lessac/medium/epoch%3D2164-step%3D1355540.ckpt) and tuning ware taken around 10h per voice.

	pl_PL-jarvis_wg_glos-medium: `epoch=2499-step=1395740.ckpt`

	pl_PL-justyna_wg_glos-medium: `epoch=2499-step=1387030.ckpt`

	pl_PL-meski_wg_glos-medium: `epoch=4449-step=1593180.ckpt`

	pl_PL-zenski_wg_glos-medium: `epoch=4949-step=1645180.ckpt`


	---
	license: mit
	---