ControlNet β€” conditional diffusion 🚧 not trained yet

Steer Stable Diffusion with a structure map (edges / pose / depth).

Status β€” documented recipe (placeholder). A production-grade pipeline from Ropedia Academy for an advanced, GPU-heavy task. Everything below β€” base model, objective, dataset, config, the exact evaluation β€” is specified; the weights / metrics / figures land here automatically when you run the notebook on a GPU (one click below). Try the trained models live in the Ropedia demos Space.

At a glance

Base model SD 1.5 / SDXL + a ControlNet (pretrained)
Task structure-conditioned image generation
Training objective Structure-conditioned generation (edges / depth / pose) β€” inference.
Track LM Β· Language & multimodal
Built on huggingface/diffusers
Notebook Open In Colab
Compute / storage / time GPU required β€” see the Compute Β· storage Β· time table in the notebook

Dataset

  • Source: Your condition maps + prompts.

Training config

GPU-scale β€” the notebook ships a demo profile (free Colab T4) and a full profile, with an exact Compute Β· storage Β· time table. Hyperparameters (optimizer, steps, batch, LoRA rank, …) are in the training cell.

Evaluation results

⏳ Pending β€” run the notebook on a GPU to fill this in. This lab reports condition fidelity (edge IoU / depth err) Β· CLIP score on a held-out split (see its Evaluate cell).

Inference example

No weights are published yet. After a GPU run, load the checkpoint/adapter the notebook saves (it also has a ready inference cell). Base model: SD 1.5 / SDXL + a ControlNet (pretrained).

How to fill this repo

  1. Open the notebook in Colab β†’ Runtime β†’ GPU β†’ Run all (runs the real pipeline).
  2. Run its Publish to the Hugging Face Hub step (or HfApi().upload_folder(...)) β€” the checkpoint + metrics.json + figures replace this placeholder.
  • Train / run on a GPU Β· [ ] upload weights Β· [ ] add metrics.json Β· [ ] add figures Β· [ ] swap in the real results card

Limitations

Not yet trained β€” no numbers to report. The pipeline is GPU-heavy (see the compute table); on free Colab use the demo-scale settings. This is an educational, reproducible recipe, not a tuned production release.

License

Code: MIT (this repository). The base model (huggingface/diffusers) and dataset are each under their own licenses β€” check the upstream source before redistribution.

Citation

@misc{ropedia_academy,
  title  = {Ropedia Academy: an interactive course on embodied & spatial AI},
  author = {Ropedia Academy},
  year   = {2026},
  howpublished = {\url{https://chaoyue0307.github.io/ropedia-academy/}}
}

Method / original work: Zhang et al., ControlNet, ICCV 2023.

Related assets


Documented placeholder in the Ropedia Academy collection β€” train it on a GPU to publish the real model. Contributions welcome on GitHub.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for cy0307/ropedia-lm-controlnet

Finetuned
(2)
this model

Collection including cy0307/ropedia-lm-controlnet