model details and training updates to readme
Browse files- README_ben.md +70 -3
README_ben.md
CHANGED
|
@@ -2,16 +2,83 @@
|
|
| 2 |
license: apache-2.0
|
| 3 |
datasets:
|
| 4 |
- eugenesiow/Div2k
|
| 5 |
-
- eugenesiow/Set5
|
| 6 |
language:
|
| 7 |
- en
|
| 8 |
tags:
|
| 9 |
- RyzenAI
|
| 10 |
-
-
|
| 11 |
- SISR
|
| 12 |
-
-
|
|
|
|
| 13 |
---
|
| 14 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 15 |
|
| 16 |
## Quantitative Analyses
|
| 17 |
|
|
|
|
| 2 |
license: apache-2.0
|
| 3 |
datasets:
|
| 4 |
- eugenesiow/Div2k
|
|
|
|
| 5 |
language:
|
| 6 |
- en
|
| 7 |
tags:
|
| 8 |
- RyzenAI
|
| 9 |
+
- Super Resolution
|
| 10 |
- SISR
|
| 11 |
+
- SESR
|
| 12 |
+
- ONNX
|
| 13 |
---
|
| 14 |
|
| 15 |
+
# 🚀 SESR-S on AMD AI PC NPU
|
| 16 |
+
|
| 17 |
+
[Bhardwaj et al. (2022)](https://arxiv.org/abs/2103.09404) introduced the Super-Efficient Super Resolution (SESR) model to solve a classic computer vision problem: to take a low-resolution input image and output a high-resolution image. The SESR model is based on a "linear overparameterization of CNNs and creates an efficient model architecture for [Single Image Super Resolution (SISR)]." The official code can be found at their accompanying GitHub repository: https://github.com/ARM-software/sesr. One of the main ideas behind the model was to make it very computationally efficient.
|
| 18 |
+
|
| 19 |
+
This version of the model is the SESR-S (Small) version; it has been converted from PyTorch format to ONNX, and then quantized to INT8 to run on an AMD AI PC NPU with Ryzen AI software. The model in its current form natively accepts a 256x256 RGB image and outputs a 512x512 RGB image; however, alternate versions of the model could accept 1920x1080 and upscale to 3840x2160 (4K) or 7680x4320 (8K).
|
| 20 |
+
|
| 21 |
+
|
| 22 |
+
| Model Details | Description |
|
| 23 |
+
| ----------- | ----------- |
|
| 24 |
+
| Person or organization developing model | [Tong Shen (AMD)](https://rocm.blogs.amd.com/authors/tong-shen.html), [Benjamin Consolvo (AMD)](https://huggingface.co/bconsolvo) |
|
| 25 |
+
| Model date | January 9, 2026 |
|
| 26 |
+
| Model version | 1 |
|
| 27 |
+
| Model type | Super-Resolution (Image-to-Image) |
|
| 28 |
+
| Information about training algorithms, parameters, fairness constraints or other applied approaches, and features | The \\(\times2\\) SESR was trained for "300 epochs using ADAM optimizer with a constant learning rate of \\(5 \times 10^{-4}\\) and a batch size of 32 on DIV2K training set." And the \\(\times4\\) SESR starts with the pretrained \\(\times2\\) SESR and replaces "the final layer of \\(5 \times 5 \times f \times 4\\) with a \\(5 \times 5 \times f \times 16\\) and then perform the depth-to-space operation twice" ([Bhardwaj et al., 2022](https://arxiv.org/abs/2103.09404)). |
|
| 29 |
+
| Paper or other resource for more information| [Bhardwaj, K., Milosavljevic, M., O'Neil, L., Gope, D., Matas, R., Chalfin, A., ... & Loh, D. (2022). Collapsible linear blocks for super-efficient super resolution. Proceedings of machine learning and systems, 4, 529-547](https://arxiv.org/abs/2103.09404) |
|
| 30 |
+
| License | [Apache 2.0](https://huggingface.co/datasets/choosealicense/licenses/blob/main/markdown/apache-2.0.md) |
|
| 31 |
+
| Where to send questions or comments about the model | [Community Tab](https://huggingface.co/amd/sesr/discussions) and [AMD Developer Community Discord](https://discord.gg/amd-dev)|
|
| 32 |
+
|
| 33 |
+
|
| 34 |
+
## ⚡ Intended Use
|
| 35 |
+
|
| 36 |
+
| Intended Use | Description |
|
| 37 |
+
| ----------- | ----------- |
|
| 38 |
+
| Primary intended uses | The model can be used to create high-resolution images from low-resolution images. The model has been converted to ONNX format and quantized for optimized performance on AMD AI PC NPUs. |
|
| 39 |
+
| Primary intended users | Anyone using or evaluating super-resolution models on AMD AI PCs. |
|
| 40 |
+
| Out-of-scope uses | This model is not intended for generating misinformation or disinformation, impersonating others, facilitating or inciting harassment or violence, any use that could lead to the violation of a human right. |
|
| 41 |
+
|
| 42 |
+
### How to Use
|
| 43 |
+
|
| 44 |
+
#### 📐 Hardware Prerequisites
|
| 45 |
+
|
| 46 |
+
Before getting started, make sure you meet the minimum hardware and OS requirements:
|
| 47 |
+
|
| 48 |
+
| Series | Codename | Abbreviation | Launch Year | Windows 11 | Linux |
|
| 49 |
+
|--------|----------|--------------|----------------|-------------|---------|
|
| 50 |
+
| Ryzen AI Max PRO 300 Series | Strix Halo | STX | 2025 | ☑️ | |
|
| 51 |
+
| Ryzen AI PRO 300 Series | Strix Point / Krackan Point | STX/KRK | 2025 | ☑️ | |
|
| 52 |
+
| Ryzen AI Max 300 Series | Strix Halo | STX | 2025 | ☑️ | |
|
| 53 |
+
| Ryzen AI 300 Series | Strix Point | STX | 2025 | ☑️ | |
|
| 54 |
+
| Ryzen Pro 200 Series | Hawk Point | HPT | 2025 | ☑️ | |
|
| 55 |
+
| Ryzen 200 Series | Hawk Point | HPT | 2025 | ☑️ | |
|
| 56 |
+
| Ryzen PRO 8000 Series | Hawk Point | HPT | 2024 | ☑️ | |
|
| 57 |
+
| Ryzen 8000 Series | Hawk Point | HPT | 2024 | ☑️ | |
|
| 58 |
+
| Ryzen Pro 7000 Series | Phoenix | PHX | 2023 | ☑️ | |
|
| 59 |
+
| Ryzen 7000 Series | Phoenix | PHX | 2023 | ☑️ | |
|
| 60 |
+
|
| 61 |
+
#### Getting Started
|
| 62 |
+
|
| 63 |
+
1. Follow the instructions here to download necessary NPU drivers and Ryzen AI software: [Ryzen AI SW Installation Instructions](https://ryzenai.docs.amd.com/en/latest/inst.html). Please allow for around **30 minutes** to install all of the necessary components of Ryzen AI SW. The tested working version as of writing is Ryzen AI 1.7.0.
|
| 64 |
+
|
| 65 |
+
2. Activate the previously installed conda environment from Ryzen AI (RAI) SW, and set the RAI environment variable to your installation path:
|
| 66 |
+
```powershell
|
| 67 |
+
conda activate ryzen-ai-1.7.0
|
| 68 |
+
$Env:RYZEN_AI_INSTALLATION_PATH = 'C:/Program Files/RyzenAI/1.7.0/'
|
| 69 |
+
```
|
| 70 |
+
3. Clone the Hugging Face model repository:
|
| 71 |
+
|
| 72 |
+
```powershell
|
| 73 |
+
git clone https://hf.co/amd/real-esrgan_npu
|
| 74 |
+
```
|
| 75 |
+
|
| 76 |
+
4. Install the prerequisites:
|
| 77 |
+
```powershell
|
| 78 |
+
pip install -r requirements.txt
|
| 79 |
+
```
|
| 80 |
+
|
| 81 |
+
|
| 82 |
|
| 83 |
## Quantitative Analyses
|
| 84 |
|