jwheo commited on
Commit
fa3fe84
·
verified ·
1 Parent(s): 30d3936

Upload Stage 1 and Stage 2 checkpoints

Browse files
Files changed (1) hide show
  1. README.md +39 -0
README.md ADDED
@@ -0,0 +1,39 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: pytorch
3
+ license: other
4
+ tags:
5
+ - super-resolution
6
+ - latent-diffusion
7
+ - pytorch
8
+ - rocm
9
+ - research
10
+ ---
11
+
12
+ # sr-diffusion
13
+
14
+ Research checkpoint storage for the `sr-diffusion` project.
15
+
16
+ GitHub: https://github.com/BitIntx/sr-diffusion
17
+
18
+ This project trains a vision-only x4 latent diffusion super-resolution pipeline
19
+ from scratch. It does not use a pretrained text-to-image diffusion model.
20
+
21
+ Current artifacts are study/research checkpoints. They are not a production SR
22
+ model and are not intended for commercial use. Dataset license constraints
23
+ should be reviewed before redistributing derived weights publicly.
24
+
25
+ ## Artifacts
26
+
27
+ | Path | Source | SHA256 |
28
+ | --- | --- | --- |
29
+ | `checkpoints/stage1_autoencoder_best_eval_recon.pt` | `best_eval_recon.pt` | `6f01ec6d1ded` |
30
+ | `configs/autoencoder_photo10k.yaml` | `autoencoder_photo10k.yaml` | `b1743ae89efc` |
31
+ | `checkpoints/stage2_latent_pretrain_step_0004000.pt` | `step_0004000.pt` | `0bf569e909d5` |
32
+ | `configs/latent_pretrain_photo10k.yaml` | `latent_pretrain_photo10k.yaml` | `85175fa83f41` |
33
+ | `metrics/stage2_step_0004000_metrics.json` | `step_0004000_metrics.json` | `189c98b4c35c` |
34
+
35
+ ## Stages
36
+
37
+ - Stage 1: factor-4 VAE / Autoencoder over 512px HR crops.
38
+ - Stage 2: deterministic LR-to-HR-latent pretraining with the Stage 1 VAE frozen.
39
+ - Stage 3: conditional latent diffusion U-Net, planned.