Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -1,3 +1,21 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
# AI Upscale Models for FFMPEGA
|
| 2 |
|
| 3 |
Pre-trained super-resolution models for use with [ComfyUI-FFMPEGA](https://github.com/AEmotionStudio/ComfyUI-FFMPEGA)'s AI Upscale feature.
|
|
@@ -7,20 +25,20 @@ Models are automatically downloaded on first use — no manual setup required.
|
|
| 7 |
## Models
|
| 8 |
|
| 9 |
| File | Architecture | Scale | Size | VRAM | Best For |
|
| 10 |
-
|---|---|---|---|---|---|
|
| 11 |
| `RealESRGAN_x4plus.pth` | RRDBNet (GAN) | 4× | 67 MB | ~2 GB | General real-world photos |
|
| 12 |
| `RealESRGAN_x4plus_anime_6B.pth` | RRDBNet (compact) | 4× | 18 MB | ~1 GB | Anime, cartoon, illustration |
|
| 13 |
-
| `Real_HAT_GAN_SRx4.pth` | HAT (
|
| 14 |
-
| `003_realSR_BSRGAN_DFOWMFC_s64w8_SwinIR-L_x4_GAN.pth` | SwinIR-Large | 4× |
|
| 15 |
|
| 16 |
All models output 4× resolution. For 2× output, the upscaler runs at 4× then applies high-quality Lanczos downscaling.
|
| 17 |
|
| 18 |
## Usage in FFMPEGA
|
| 19 |
|
| 20 |
-
1. Set
|
| 21 |
-
2. Set
|
| 22 |
-
3. Choose
|
| 23 |
-
4. Choose
|
| 24 |
5. Connect an image or video input and run
|
| 25 |
|
| 26 |
## Model Loading
|
|
|
|
| 1 |
+
---
|
| 2 |
+
library_name: spandrel
|
| 3 |
+
license: other
|
| 4 |
+
license_name: mixed
|
| 5 |
+
license_link: LICENSE
|
| 6 |
+
tags:
|
| 7 |
+
- image-super-resolution
|
| 8 |
+
- super-resolution
|
| 9 |
+
- upscaling
|
| 10 |
+
- real-esrgan
|
| 11 |
+
- hat
|
| 12 |
+
- swinir
|
| 13 |
+
- comfyui
|
| 14 |
+
- ffmpega
|
| 15 |
+
- video-processing
|
| 16 |
+
pipeline_tag: image-to-image
|
| 17 |
+
---
|
| 18 |
+
|
| 19 |
# AI Upscale Models for FFMPEGA
|
| 20 |
|
| 21 |
Pre-trained super-resolution models for use with [ComfyUI-FFMPEGA](https://github.com/AEmotionStudio/ComfyUI-FFMPEGA)'s AI Upscale feature.
|
|
|
|
| 25 |
## Models
|
| 26 |
|
| 27 |
| File | Architecture | Scale | Size | VRAM | Best For |
|
| 28 |
+
|------|-------------|-------|------|------|----------|
|
| 29 |
| `RealESRGAN_x4plus.pth` | RRDBNet (GAN) | 4× | 67 MB | ~2 GB | General real-world photos |
|
| 30 |
| `RealESRGAN_x4plus_anime_6B.pth` | RRDBNet (compact) | 4× | 18 MB | ~1 GB | Anime, cartoon, illustration |
|
| 31 |
+
| `Real_HAT_GAN_SRx4.pth` | HAT (hybrid attention) | 4× | 170 MB | ~4 GB | SOTA quality, fine detail |
|
| 32 |
+
| `003_realSR_BSRGAN_DFOWMFC_s64w8_SwinIR-L_x4_GAN.pth` | SwinIR-Large | 4× | 48 MB | ~3 GB | Clean images, classical SR |
|
| 33 |
|
| 34 |
All models output 4× resolution. For 2× output, the upscaler runs at 4× then applies high-quality Lanczos downscaling.
|
| 35 |
|
| 36 |
## Usage in FFMPEGA
|
| 37 |
|
| 38 |
+
1. Set `llm_model` → `none`
|
| 39 |
+
2. Set `no_llm_mode` → `ai_upscale`
|
| 40 |
+
3. Choose `upscale_model` (e.g. `hat_x4` for best quality)
|
| 41 |
+
4. Choose `upscale_scale` (`4` or `2`)
|
| 42 |
5. Connect an image or video input and run
|
| 43 |
|
| 44 |
## Model Loading
|