shadow-cann
/

hispark-modelzoo-fastspeech2

Model card Files Files and versions

hispark-modelzoo-fastspeech2 / README.md

shadow-cann's picture

Add files using upload-large-folder tool

100694b verified 17 days ago

|

history blame contribute delete

1.54 kB

	---
	language:
	- zh
	tags:
	- hisilicon
	- hispark
	- npu
	- openharmony
	- modelzoo
	- pytorch
	---

	# FastSpeech2

	FastSpeech2 是一种高效的端到端语音合成模型。相比 FastSpeech，FastSpeech2 引入了多尺度时长预测器和能量 / 基频预测分支，优化了时长预测模块并新增韵律特征建模，在合成速度和语音自然度上均有大幅提升。

	## Mirror Metadata

	- Hugging Face repo: shadow-cann/hispark-modelzoo-fastspeech2
	- Portal model id: ie2sc9g1qk00
	- Created at: 2026-01-08 16:18:57
	- Updated at: 2026-01-09 15:08:50
	- Category: 音频

	## Framework

	- PyTorch

	## Supported OS

	- OpenHarmony
	- Linux

	## Computing Power

	- Hi3403V100 SVP_NNN

	## Tags

	- 文本转语音

	## Detail Parameters

	- 输入: 1x40
	- 参数量: 35.266M
	- 计算量: 29.162GFLOPs

	## Files In This Repo

	- fastspeech_hifigan_en.onnx (源模型 / 源模型下载; 源模型 / 源模型元数据; 编译模型 / OM 元数据 / a16w8)

	## Upstream Links

	- Portal card: https://gitbubble.github.io/hisilicon-developer-portal-mirror/model-detail.html?id=ie2sc9g1qk00
	- Upstream repository: https://gitee.com/HiSpark/modelzoo/blob/master/samples/built-in/audio/FastSpeech2/README.md
	- License reference: https://github.com/ming024/FastSpeech2/blob/master/LICENSE

	## Notes

	- This repository was mirrored from the HiSilicon Developer Portal model card and local downloads captured on 2026-03-27.
	- File ownership follows the portal card mapping, not just filename similarity.
	- Cover image: 1722265270026243_fastspeech2.jpg