sugarquark
/

u-shaped-masked-dit

Model card Files Files and versions

Masked model

U-shaped transformer model in CIELAB color space. The model reconstructs the input image.

LAB input, RGB output
8 channel latent

As in the image restoration model, only a random subset of the image patches are taught to the model, which shortens the learning time.

The upsample layers generate images (at different resolution):

heatmap from labels (as in CLIP retrieval)
RGB image
optional, edge detection

It prioritizes color accuracy.

Datasets

Pixiv_1024

References

Downloads last month: -; Downloads are not tracked for this model. How to track

Safetensors

Model size

0.3B params

Tensor type

F32

·