Jacid23 commited on
Commit
956df8e
Β·
verified Β·
1 Parent(s): 23a4a45

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +41 -5
README.md CHANGED
@@ -32,8 +32,9 @@ Every weight in this repo is a verbatim copy of the file released by its origina
32
 
33
  ### Denoise / Deblur (NAFNet)
34
 
35
- - `NAFNet-SIDD-width64.pth` β€” denoise model
36
- - `NAFNet-REDS-width64.pth` β€” deblur model
 
37
 
38
  **Authors:** Liangyu Chen, Xiaojie Chu, Xiangyu Zhang, Jian Sun (Megvii Research)
39
  **Upstream:** https://github.com/megvii-research/NAFNet
@@ -49,13 +50,30 @@ Every weight in this repo is a verbatim copy of the file released by its origina
49
  **License:** MIT (code) / non-commercial (weights, per author note)
50
  **Paper:** "Real-Time Intermediate Flow Estimation for Video Frame Interpolation"
51
 
52
- ### Community Upscale Models
53
 
 
54
  - `4x-UltraSharp.pth` β€” community upscale model by Kim2091
55
  - `foolhardy_Remacri.pth` β€” community model by foolhardy
56
  - `RealisticRescaler_100000_G.pth` β€” community upscale model
57
- - `4x-UniScale-Balanced [72000g].pth` β€” UniScale community variant
58
- - `4x-UniScale-Strong [42400g].pth` β€” UniScale community variant
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
59
 
60
  **Upstream catalog:** https://openmodeldb.info/
61
  **License:** CC BY-NC-SA 4.0 (community convention for ESRGAN-derived models)
@@ -65,6 +83,24 @@ Architecture is RRDBNet from Real-ESRGAN. Original Real-ESRGAN architecture:
65
  - **Upstream:** https://github.com/xinntao/Real-ESRGAN
66
  - **License:** BSD-3-Clause
67
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
68
  ## Usage
69
 
70
  Download programmatically via the Third Eye installer:
 
32
 
33
  ### Denoise / Deblur (NAFNet)
34
 
35
+ - `NAFNet-SIDD-width64.pth` β€” denoise model (SIDD dataset)
36
+ - `NAFNet-REDS-width64.pth` β€” deblur model (REDS dataset)
37
+ - `NAFNet-GoPro-width64.pth` β€” deblur model (GoPro dataset, alternative to REDS)
38
 
39
  **Authors:** Liangyu Chen, Xiaojie Chu, Xiangyu Zhang, Jian Sun (Megvii Research)
40
  **Upstream:** https://github.com/megvii-research/NAFNet
 
50
  **License:** MIT (code) / non-commercial (weights, per author note)
51
  **Paper:** "Real-Time Intermediate Flow Estimation for Video Frame Interpolation"
52
 
53
+ ### Community RRDBNet Upscale Models
54
 
55
+ **4x variants:**
56
  - `4x-UltraSharp.pth` β€” community upscale model by Kim2091
57
  - `foolhardy_Remacri.pth` β€” community model by foolhardy
58
  - `RealisticRescaler_100000_G.pth` β€” community upscale model
59
+ - `4x-UniScale-Balanced-72000g.pth` β€” UniScale community variant
60
+ - `4x-UniScale-Strong-42400g.pth` β€” UniScale community variant
61
+ - `4xJaypeg90.pth` β€” JPEG-focused 4x cleanup upscaler
62
+ - `4xLSDIRplus.pth` β€” LSDIR dataset upscaler
63
+ - `4xLSDIRplusR.pth` β€” LSDIR refined variant
64
+ - `CountryRoads_377000_G.pth` β€” general-purpose community upscaler
65
+ - `NMKD-Superscale-SP_178000_G.pth` β€” NMKD standard print
66
+ - `NMKDSuperscale_Artisoft_120000_G.pth` β€” NMKD artistic-soft
67
+ - `A_ESRGAN_Single.pth` β€” A-ESRGAN single-pass
68
+ - `Filmify4K_v2_325000_G.pth` β€” film-look upscaler
69
+
70
+ **8x variants:**
71
+ - `8x_NMKD-Superscale_150000_G.pth` β€” NMKD general 8x
72
+ - `8x_NMKD-Typescale_175k.pth` β€” NMKD optimised for text/UI
73
+ - `TGHQFace8x_500k.pth` β€” face-specific 8x
74
+
75
+ **1x detail enhancers:**
76
+ - `x1_ITF_SkinDiffDetail_Lite_v1.pth` β€” skin texture enhancement
77
 
78
  **Upstream catalog:** https://openmodeldb.info/
79
  **License:** CC BY-NC-SA 4.0 (community convention for ESRGAN-derived models)
 
83
  - **Upstream:** https://github.com/xinntao/Real-ESRGAN
84
  - **License:** BSD-3-Clause
85
 
86
+ ### Transformer Upscale Models (DAT / HAT-L / DRCT-L)
87
+
88
+ - `4xFFHQDAT.pth` β€” DAT architecture, trained on FFHQ
89
+ - `4xFaceUpSharpDAT.pth` β€” DAT, face sharpener
90
+ - `4xLSDIRDAT.pth` β€” DAT, LSDIR dataset
91
+ - `4xNomos8kHAT-L_otf.pth` β€” HAT-L architecture
92
+ - `4xNomos2_hq_drct-l.pth` β€” DRCT-L architecture
93
+
94
+ **Upstream catalog:** https://openmodeldb.info/
95
+ **License:** CC BY-NC-SA 4.0 (community convention)
96
+
97
+ These are mirrored here for download convenience, but Third Eye's engine does not yet implement the DAT, HAT-L, or DRCT-L architectures. They will be wired up in a future engine update.
98
+
99
+ Original transformer architecture papers:
100
+ - **DAT:** "Dual Aggregation Transformer for Image Super-Resolution" (ICCV 2023)
101
+ - **HAT:** "Activating More Pixels in Image Super-Resolution Transformer" (CVPR 2023)
102
+ - **DRCT:** "DRCT: Saving Image Super-Resolution away from Information Bottleneck"
103
+
104
  ## Usage
105
 
106
  Download programmatically via the Third Eye installer: