| --- |
| license: apache-2.0 |
| library_name: comfyui |
| pipeline_tag: image-to-image |
| tags: |
| - hdr |
| - diffusion |
| - comfyui |
| - vae-decoder |
| - openexr |
| - radiance |
| - rudra |
| --- |
| |
| # RUDRA — HDR Decoders for Diffusion Models |
|
|
| **Radiometric Dynamic-Range Conditioning for HDR-Aware Diffusion Models** |
| FXTD Studios / Radiance Research |
|
|
| Distilled decoders that turn diffusion **latents into scene-linear HDR / OpenEXR** instead |
| of tone-mapped SDR. They replace the standard VAE decode inside the *Radiance HDR VAE |
| Decode* ComfyUI node, preserving highlights, exposure, and wide-gamut color. |
|
|
| ➡️ **Code, training, and docs:** https://github.com/fxtdstudios/RUDRA |
|
|
| ## Files |
|
|
| Each file is a trained `RadianceTurboDecoder` / `RadianceFullDecoder` for one backbone: |
|
|
| ``` |
| rudra_{turbo|full}_decoder_{backbone}_ema.safetensors |
| ``` |
|
|
| | Backbone | Recommended file | Quality (PSNR_log) | |
| |---|---|---| |
| | Flux.1 | `rudra_full_decoder_flux_ema.safetensors` | 29.77 | |
| | Wan | `rudra_full_decoder_wan_ema.safetensors` | 32.45 | |
| | SDXL | `rudra_turbo_decoder_sdxl_ema.safetensors` | 33.86 | |
| | Qwen-Image | `rudra_turbo_decoder_qwen_ema.safetensors` | 26.67 | |
| | Flux.2 Klein | `rudra_turbo_decoder_flux2-klein_ema.safetensors` | 28.57 | |
| | LTX (2.3) | `rudra_full_decoder_ltx-video_ema.safetensors` | 25.47 | |
| | Z-Image | use the Flux decoder (shares the FLUX.1 VAE) | — | |
| |
| `turbo` (~0.5 M params) is fast and strong on simple latents (SDXL); `full` (~5.6 M) wins |
| on Flux/Wan/LTX. Both are provided where trained. |
| |
| ## Usage (ComfyUI) |
| |
| 1. Download into `ComfyUI/models/radiance/`: |
| ```bash |
| huggingface-cli download fxtdstudios/RUDRA --include "rudra_*.safetensors" \ |
| --local-dir "ComfyUI/models/radiance" |
| ``` |
| 2. In the **Radiance HDR VAE Decode** node: set `rudra_decoder = Enabled`, pick |
| `decoder_size` (`rudra_turbo` or `rudra_full`) per the table above, and set |
| `target_space` to your output color space (Linear / ACEScg / Rec.2020 / LogC4…). |
| |
| ## Notes |
| |
| - **Backbone-specific:** a decoder is tied to its VAE latent space — use the matching file |
| for the model feeding the node (Flux decoder for a Flux workflow, etc.). |
| - **Flux.2 Klein** uses a 128-channel / 16× VAE, so its decoder has an extra upsample stage |
| (requires the updated `fast_vae.py` from the GitHub repo). |
| - Quality is reported as held-out **log-space PSNR**; perceptual HDR evaluation uses |
| **ColorVideoVDP** (JOD). SDXL/Qwen/Klein were trained on a smaller pair set and can be |
| improved with more data. |
| |
| ## Citation |
| |
| > RUDRA: Radiometric Dynamic-Range Conditioning for HDR-Aware Diffusion Models. |
| > FXTD Studios / Radiance Research. |
| |
| License: change the `license:` field above to match your release t |