shadow-cann's picture
Add files using upload-large-folder tool
100694b verified
metadata
language:
  - zh
tags:
  - hisilicon
  - hispark
  - npu
  - openharmony
  - modelzoo
  - pytorch

FastSpeech2

FastSpeech2 是一种高效的端到端语音合成模型。相比 FastSpeech,FastSpeech2 引入了多尺度时长预测器和能量 / 基频预测分支,优化了时长预测模块并新增韵律特征建模,在合成速度和语音自然度上均有大幅提升。

Mirror Metadata

  • Hugging Face repo: shadow-cann/hispark-modelzoo-fastspeech2
  • Portal model id: ie2sc9g1qk00
  • Created at: 2026-01-08 16:18:57
  • Updated at: 2026-01-09 15:08:50
  • Category: 音频

Framework

  • PyTorch

Supported OS

  • OpenHarmony
  • Linux

Computing Power

  • Hi3403V100 SVP_NNN

Tags

  • 文本转语音

Detail Parameters

  • 输入: 1x40
  • 参数量: 35.266M
  • 计算量: 29.162GFLOPs

Files In This Repo

  • fastspeech_hifigan_en.onnx (源模型 / 源模型下载; 源模型 / 源模型元数据; 编译模型 / OM 元数据 / a16w8)

Upstream Links

Notes

  • This repository was mirrored from the HiSilicon Developer Portal model card and local downloads captured on 2026-03-27.
  • File ownership follows the portal card mapping, not just filename similarity.
  • Cover image: 1722265270026243_fastspeech2.jpg