Upload README.md with huggingface_hub
Browse files
README.md
ADDED
|
@@ -0,0 +1,34 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
# AI Upscale Models for FFMPEGA
|
| 2 |
+
|
| 3 |
+
Pre-trained super-resolution models for use with [ComfyUI-FFMPEGA](https://github.com/AEmotionStudio/ComfyUI-FFMPEGA)'s AI Upscale feature.
|
| 4 |
+
|
| 5 |
+
Models are automatically downloaded on first use — no manual setup required.
|
| 6 |
+
|
| 7 |
+
## Models
|
| 8 |
+
|
| 9 |
+
| File | Architecture | Scale | Size | VRAM | Best For |
|
| 10 |
+
|---|---|---|---|---|---|
|
| 11 |
+
| `RealESRGAN_x4plus.pth` | RRDBNet (GAN) | 4× | 67 MB | ~2 GB | General real-world photos |
|
| 12 |
+
| `RealESRGAN_x4plus_anime_6B.pth` | RRDBNet (compact) | 4× | 18 MB | ~1 GB | Anime, cartoon, illustration |
|
| 13 |
+
| `Real_HAT_GAN_SRx4.pth` | HAT (GAN) | 4× | 170 MB | ~4 GB | **SOTA** — highest quality |
|
| 14 |
+
| `003_realSR_BSRGAN_DFOWMFC_s64w8_SwinIR-L_x4_GAN.pth` | SwinIR-Large | 4× | 142 MB | ~3 GB | Classical SR, clean images |
|
| 15 |
+
|
| 16 |
+
All models output 4× resolution. For 2× output, the upscaler runs at 4× then applies high-quality Lanczos downscaling.
|
| 17 |
+
|
| 18 |
+
## Usage in FFMPEGA
|
| 19 |
+
|
| 20 |
+
1. Set **llm_model** → `none`
|
| 21 |
+
2. Set **no_llm_mode** → `ai_upscale`
|
| 22 |
+
3. Choose **upscale_model** (e.g. `hat_x4` for best quality)
|
| 23 |
+
4. Choose **upscale_scale** (`4` or `2`)
|
| 24 |
+
5. Connect an image or video input and run
|
| 25 |
+
|
| 26 |
+
## Model Loading
|
| 27 |
+
|
| 28 |
+
Models are loaded via [spandrel](https://github.com/chaiNNer-org/spandrel), which auto-detects the architecture from the checkpoint file. No additional dependencies are needed beyond what ComfyUI already provides.
|
| 29 |
+
|
| 30 |
+
## Credits
|
| 31 |
+
|
| 32 |
+
- **Real-ESRGAN**: [xinntao/Real-ESRGAN](https://github.com/xinntao/Real-ESRGAN) — BSD-3-Clause
|
| 33 |
+
- **HAT**: [XPixelGroup/HAT](https://github.com/XPixelGroup/HAT) — MIT
|
| 34 |
+
- **SwinIR**: [JingyunLiang/SwinIR](https://github.com/JingyunLiang/SwinIR) — Apache 2.0
|