YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

LOTS of Fashion! Multi-Conditioning for Image Generation via Sketch-Text Pairing

Project Page

Model on HFDataset on HF

LOTS

This is the official implementation of the LOTS adapter from the paper "LOTS of Fashion! Multi-Conditioning for Image Generation via Sketch-Text Pairing", published as Oral at ICCV25 in Honolulu.

To access the Sketchy dataset, refer to the HuggingFace repository

Road Map

  • Code release
  • Weights release
  • Platform release

Repository Structure

  1. ckpts folder
  • Contains the pre-trained weights of the LOTS adapter.
  1. scripts folder
  • Contains all the scripts for training and inference with LOTS on Sketchy.
  1. src folder
  • Contains all the source code for the classes, models, and dataloaders used in the scripts.

Installation

Clone the repository

git clone https://huggingface.co/federicogirella/lots
cd lots

We advise creating a Conda environment as follows

  • conda create -n lots python=3.12
  • conda activate lots
  • pip install torch==2.5.1 torchvision==0.20.1 torchaudio==2.5.1 --index-url https://download.pytorch.org/whl/cu121
  • pip install -r requirements.txt
  • pip install -e .

Training

We provide the script to train LOTS on our Sketchy dataset in scripts/lots/train_lots.py. For an example of usage, check run_train.sh, which contains the default parameters used in our experiments.

Inference

You can test our pre-trained model with the inference script in scripts/lots/inference_lots.py. For an example, check run_inference.sh. This script generates an image for each item in the test split of Sketchy, and saves them in a structured folder, with each item identified by its unique ID.

Citation

If you find our work useful, please cite our work:

@inproceedings{girella2025lots,
  author    = {Girella, Federico and Talon, Davide and Lie, Ziyue and Ruan, Zanxi and Wang, Yiming and Cristani, Marco},
  title     = {LOTS of Fashion! Multi-Conditioning for Image Generation via Sketch-Text Pairing},
  journal   = {Proceedings of the International Conference on Computer Vision},
  year      = {2025},
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support