update readme
Browse files
README.md
CHANGED
|
@@ -7,6 +7,15 @@ This repository contains the model weights and configuration files for the [**Sp
|
|
| 7 |
> [!Note]
|
| 8 |
> These model weights have been **reproduced** with the released code and yield slightly different evaluation results compared to those reported in the original paper.
|
| 9 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 10 |
Download model checkpoints and put them in `./workspace/experiments`.
|
| 11 |
The directory tree should look like this:
|
| 12 |
|
|
@@ -19,3 +28,71 @@ The directory tree should look like this:
|
|
| 19 |
|── sphere-l-imagenet
|
| 20 |
|── sphere-xl-imagenet
|
| 21 |
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 7 |
> [!Note]
|
| 8 |
> These model weights have been **reproduced** with the released code and yield slightly different evaluation results compared to those reported in the original paper.
|
| 9 |
|
| 10 |
+
## Model Card
|
| 11 |
+
|
| 12 |
+
| dataset | :hugs: hf model repo | params |
|
| 13 |
+
|:--:|:--:|:--:|
|
| 14 |
+
| Animal-Faces | [`sphere-l-af`](sphere-l-af) | 642M |
|
| 15 |
+
| Oxford-Flowers | [`sphere-l-of`](sphere-l-of) | 948M |
|
| 16 |
+
| ImageNet | [`sphere-l-imagenet`](sphere-l-imagenet) | 950M |
|
| 17 |
+
| ImageNet | [`sphere-xl-imagenet`](sphere-xl-imagenet) | 1.3B |
|
| 18 |
+
|
| 19 |
Download model checkpoints and put them in `./workspace/experiments`.
|
| 20 |
The directory tree should look like this:
|
| 21 |
|
|
|
|
| 28 |
|── sphere-l-imagenet
|
| 29 |
|── sphere-xl-imagenet
|
| 30 |
```
|
| 31 |
+
|
| 32 |
+
## Evaluation Results
|
| 33 |
+
|
| 34 |
+
Evaluate **ImageNet** models with `CFG = 1.4`:
|
| 35 |
+
|
| 36 |
+
```bash
|
| 37 |
+
# --job_dir can be
|
| 38 |
+
# sphere-l-imagenet, or sphere-xl-imagenet
|
| 39 |
+
|
| 40 |
+
./run.sh eval.py \
|
| 41 |
+
--job_dir sphere-xl-imagenet \
|
| 42 |
+
--forward_steps 1 4 \
|
| 43 |
+
--report_fid rfid gfid \
|
| 44 |
+
--use_cfg True \
|
| 45 |
+
--cfg_min 1.4 \
|
| 46 |
+
--cfg_max 1.4 \
|
| 47 |
+
--cfg_position combo \
|
| 48 |
+
--rm_folder_after_eval True
|
| 49 |
+
```
|
| 50 |
+
|
| 51 |
+
The evaluation results will be saved in `./workspace/experiments/sphere-xl-imagenet/eval/`:
|
| 52 |
+
|
| 53 |
+
| dataset | model | steps | rFID ↓ | gFID ↓ | IS ↑ |
|
| 54 |
+
|:--:|:--|:--:|:--:|--:|:--:|
|
| 55 |
+
ImageNet 256x256 | Sphere-L | 1 | 0.62 | 15.69 | 274.5 |
|
| 56 |
+
|| Sphere-L | 4 | - | 4.78 | 259.1 |
|
| 57 |
+
|| Sphere-XL | 1 | 0.62 | 14.52 | 299.3 |
|
| 58 |
+
|| Sphere-XL | 4 | - | 4.05 | 266.0 |
|
| 59 |
+
|
| 60 |
+
Evaluate unconditional **Animal-Faces** model:
|
| 61 |
+
|
| 62 |
+
```bash
|
| 63 |
+
./run.sh eval.py \
|
| 64 |
+
--job_dir sphere-l-af \
|
| 65 |
+
--forward_steps 1 4 \
|
| 66 |
+
--report_fid gfid \
|
| 67 |
+
--rm_folder_after_eval True
|
| 68 |
+
```
|
| 69 |
+
|
| 70 |
+
| dataset | model | steps | rFID ↓ | gFID ↓ | IS ↑ |
|
| 71 |
+
|:--:|:--|:--:|:--:|:--:|:--:|
|
| 72 |
+
Animal-Faces 256x256 | Sphere-L | 1 | - | 21.56 | 8.3 |
|
| 73 |
+
|| Sphere-L | 4 | - | 18.73 | 9.8 |
|
| 74 |
+
|
| 75 |
+
Evaluate **Oxford-Flowers** model with `CFG = 1.4`:
|
| 76 |
+
|
| 77 |
+
```bash
|
| 78 |
+
./run.sh eval.py \
|
| 79 |
+
--job_dir sphere-l-of \
|
| 80 |
+
--forward_steps 1 4 \
|
| 81 |
+
--report_fid gfid \
|
| 82 |
+
--use_cfg True \
|
| 83 |
+
--cfg_min 1.6 \
|
| 84 |
+
--cfg_max 1.6 \
|
| 85 |
+
--cfg_position combo \
|
| 86 |
+
--num_eval_samples 51000 \
|
| 87 |
+
--rm_folder_after_eval True \
|
| 88 |
+
--cache_sampling_noise False \
|
| 89 |
+
```
|
| 90 |
+
|
| 91 |
+
`--num_eval_samples = 51000` are set for 102 classes such that each class has 500 samples for evaluation on 8 gpus.
|
| 92 |
+
Adjust them accordingly if you have different number of gpus or want to evaluate on different number of samples.
|
| 93 |
+
|
| 94 |
+
| dataset | model | steps | rFID ↓ | gFID ↓ | IS ↑ |
|
| 95 |
+
|:--:|:--|:--:|:--:|:--:|:--:|
|
| 96 |
+
| Oxford-Flowers 256x256 | Sphere-L | 1 | - | 25.10 | 3.4 |
|
| 97 |
+
|| Sphere-L | 4 | - | 11.27 | 3.2 |
|
| 98 |
+
|