Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
taeyoun811
/
whisfusion
like
2
Automatic Speech Recognition
English
whisfusion
audio
asr
whisper
diffusion
arxiv:
2508.07048
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
whisfusion
1.22 GB
1 contributor
History:
2 commits
taeyoun811
Upload Whisfusion model
032cb52
verified
5 months ago
.gitattributes
1.52 kB
initial commit
5 months ago
README.md
1.25 kB
Upload Whisfusion model
5 months ago
config.json
459 Bytes
Upload Whisfusion model
5 months ago
whisfusion_stage1_adapter.pt
170 MB
xet
Upload Whisfusion model
5 months ago
whisfusion_stage2_decoder.pt
1.05 GB
xet
Upload Whisfusion model
5 months ago