third-eye-models / README.md
Jacid23's picture
Upload README.md with huggingface_hub
30d6a69 verified
---
license: cc-by-nc-sa-4.0
tags:
- third-eye
- image-restoration
- video-restoration
- super-resolution
- denoise
- deblur
- colorization
- frame-interpolation
library_name: pytorch
---
# Third Eye - Model Weights Bundle
Reliable mirror of AI model weights used by [Third Eye](https://github.com/Jacid23/Third_Eye), a media organizer with hidden editing dimensions.
These weights are downloaded automatically by `scripts/fetch_models.py` during installation. Self-hosting them here removes dependency on upstream Google Drive links and unreliable community mirrors.
## License
The bundle is tagged **CC BY-NC-SA 4.0** β€” the most restrictive license among the included models. By using these weights you agree to:
- **Non-commercial use only**
- Provide **attribution** to the original authors (listed below)
- Distribute any derivatives under the **same license**
## Files and Attribution
Every weight in this repo is a verbatim copy of the file released by its original author. Original sources and licenses below.
### Denoise / Deblur (NAFNet)
- `NAFNet-SIDD-width64.pth` β€” denoise model (SIDD dataset)
- `NAFNet-REDS-width64.pth` β€” deblur model (REDS dataset)
- `NAFNet-GoPro-width64.pth` β€” deblur model (GoPro dataset, alternative to REDS)
**Authors:** Liangyu Chen, Xiaojie Chu, Xiangyu Zhang, Jian Sun (Megvii Research)
**Upstream:** https://github.com/megvii-research/NAFNet
**License:** MIT
**Paper:** "Simple Baselines for Image Restoration" (ECCV 2022)
### Frame Interpolation (RIFE)
- `flownet.pkl` β€” RIFE 4.6 weights
**Authors:** Zhewei Huang et al. (Practical-RIFE team)
**Upstream:** https://github.com/hzwer/Practical-RIFE
**License:** MIT (code) / non-commercial (weights, per author note)
**Paper:** "Real-Time Intermediate Flow Estimation for Video Frame Interpolation"
### Community RRDBNet Upscale Models
**4x variants:**
- `4x-UltraSharp.pth` β€” community upscale model by Kim2091
- `foolhardy_Remacri.pth` β€” community model by foolhardy
- `RealisticRescaler_100000_G.pth` β€” community upscale model
- `4x-UniScale-Balanced-72000g.pth` β€” UniScale community variant
- `4x-UniScale-Strong-42400g.pth` β€” UniScale community variant
- `4xJaypeg90.pth` β€” JPEG-focused 4x cleanup upscaler
- `4xLSDIRplus.pth` β€” LSDIR dataset upscaler
- `4xLSDIRplusR.pth` β€” LSDIR refined variant
- `CountryRoads_377000_G.pth` β€” general-purpose community upscaler
- `NMKD-Superscale-SP_178000_G.pth` β€” NMKD standard print
- `NMKDSuperscale_Artisoft_120000_G.pth` β€” NMKD artistic-soft
- `A_ESRGAN_Single.pth` β€” A-ESRGAN single-pass
- `Filmify4K_v2_325000_G.pth` β€” film-look upscaler
**8x variants:**
- `8x_NMKD-Superscale_150000_G.pth` β€” NMKD general 8x
- `8x_NMKD-Typescale_175k.pth` β€” NMKD optimised for text/UI
- `TGHQFace8x_500k.pth` β€” face-specific 8x
**1x detail enhancers:**
- `x1_ITF_SkinDiffDetail_Lite_v1.pth` β€” skin texture enhancement
**Upstream catalog:** https://openmodeldb.info/
**License:** CC BY-NC-SA 4.0 (community convention for ESRGAN-derived models)
Architecture is RRDBNet from Real-ESRGAN. Original Real-ESRGAN architecture:
- **Authors:** Xintao Wang et al. (Tencent ARC Lab)
- **Upstream:** https://github.com/xinntao/Real-ESRGAN
- **License:** BSD-3-Clause
### SwinIR (Swin Transformer Image Restoration)
Initial set wired through the engine:
- `classicalSR_DF2K_s64w8_SwinIR-M_x4.pth` β€” classical 4x super-resolution
- `classicalSR_DF2K_s64w8_SwinIR-M_x2.pth` β€” classical 2x super-resolution
- `lightweightSR_DIV2K_s64w8_SwinIR-S_x4.pth` β€” lightweight 4x (smaller/faster)
- `realSR_BSRGAN_DFOWMFC_s64w8_SwinIR-L_x4_GAN.pth` β€” real-world 4x (BSRGAN-trained GAN)
- `colorCAR_DFWB_s126w7_SwinIR-M_jpeg40.pth` β€” JPEG artifact removal (qfβ‰ˆ40)
- `colorDN_DFWB_s128w8_SwinIR-M_noise25.pth` β€” color denoise (sigma=25)
**Authors:** Jingyun Liang, Jiezhang Cao, Guolei Sun, Kai Zhang, Luc Van Gool, Radu Timofte
**Upstream:** https://github.com/JingyunLiang/SwinIR
**License:** Apache 2.0
**Paper:** "SwinIR: Image Restoration Using Swin Transformer" (ICCVW 2021)
Additional SwinIR checkpoints (JPEG qf=10/20/30, noise=15/50, grayscale variants, x3, x8) are available from the upstream releases and can be wired with one MODEL_CONFIGS entry each β€” the architecture supports all of them.
### Transformer Upscale Models (DAT / HAT-L / DRCT-L)
- `4xFFHQDAT.pth` β€” DAT architecture, trained on FFHQ
- `4xFaceUpSharpDAT.pth` β€” DAT, face sharpener
- `4xLSDIRDAT.pth` β€” DAT, LSDIR dataset
- `4xNomos8kHAT-L_otf.pth` β€” HAT-L architecture
- `4xNomos2_hq_drct-l.pth` β€” DRCT-L architecture
**Upstream catalog:** https://openmodeldb.info/
**License:** CC BY-NC-SA 4.0 (community convention)
These are mirrored here for download convenience, but Third Eye's engine does not yet implement the DAT, HAT-L, or DRCT-L architectures. They will be wired up in a future engine update.
Original transformer architecture papers:
- **DAT:** "Dual Aggregation Transformer for Image Super-Resolution" (ICCV 2023)
- **HAT:** "Activating More Pixels in Image Super-Resolution Transformer" (CVPR 2023)
- **DRCT:** "DRCT: Saving Image Super-Resolution away from Information Bottleneck"
## Usage
Download programmatically via the Third Eye installer:
```bat
install.bat
```
Or directly:
```bash
wget https://huggingface.co/Jacid23/third-eye-models/resolve/main/NAFNet-SIDD-width64.pth
```
## Source Code
Third Eye source: https://github.com/Jacid23/Third_Eye
Model download script: `scripts/fetch_models.py`
## Acknowledgements
All credit for the models goes to their original authors and research teams. This repository exists only to provide reliable download mirrors for an open-source application that integrates these models. No modifications have been made to any weight file.