Update metadata with huggingface_hub

010be78 verified 1 day ago

1.72 kB

license: mit
tags:
  - whisper
  - speech-recognition
  - transcription
  - comfyui
  - ffmpega

OpenAI Whisper Models

Mirror of OpenAI Whisper model weights in safetensors format, hosted by AEmotionStudio for use with ComfyUI-FFMPEGA.

Available Models

File	Size	Parameters
`tiny.safetensors`	~72 MB	39 M
`base.safetensors`	~139 MB	74 M
`small.safetensors`	~461 MB	244 M
`medium.safetensors`	~1.5 GB	769 M
`large-v3.safetensors`	~2.9 GB	1550 M

Original source: OpenAI Whisper (Azure CDN)

Format

Models are stored in .safetensors format for security. The dims configuration metadata is preserved in each file's safetensors header. ComfyUI-FFMPEGA automatically handles the conversion to .pt format locally when downloading from this mirror.

Usage

These models are automatically downloaded by ComfyUI-FFMPEGA when audio transcription or subtitle generation is used. No manual setup is required if allow_model_downloads is enabled on the FFMPEG Agent node.

Manual Installation

Download the desired .safetensors file from this repo
Convert to .pt or place in ComfyUI/models/whisper/
FFMPEGA will detect the local file and skip downloading

Why This Mirror?

This is a first-party mirror maintained by AEmotionStudio to ensure supply chain resilience. If OpenAI's CDN becomes unavailable, FFMPEGA will still be able to download models from this mirror.

License

OpenAI Whisper is released under the MIT license.