RUDRA / README.md
fxtdstudios's picture
Upload folder using huggingface_hub
7e59e1c verified
|
Raw
History Blame Contribute Delete
2.68 kB
---
license: apache-2.0
library_name: comfyui
pipeline_tag: image-to-image
tags:
- hdr
- diffusion
- comfyui
- vae-decoder
- openexr
- radiance
- rudra
---
# RUDRA — HDR Decoders for Diffusion Models
**Radiometric Dynamic-Range Conditioning for HDR-Aware Diffusion Models**
FXTD Studios / Radiance Research
Distilled decoders that turn diffusion **latents into scene-linear HDR / OpenEXR** instead
of tone-mapped SDR. They replace the standard VAE decode inside the *Radiance HDR VAE
Decode* ComfyUI node, preserving highlights, exposure, and wide-gamut color.
➡️ **Code, training, and docs:** https://github.com/fxtdstudios/RUDRA
## Files
Each file is a trained `RadianceTurboDecoder` / `RadianceFullDecoder` for one backbone:
```
rudra_{turbo|full}_decoder_{backbone}_ema.safetensors
```
| Backbone | Recommended file | Quality (PSNR_log) |
|---|---|---|
| Flux.1 | `rudra_full_decoder_flux_ema.safetensors` | 29.77 |
| Wan | `rudra_full_decoder_wan_ema.safetensors` | 32.45 |
| SDXL | `rudra_turbo_decoder_sdxl_ema.safetensors` | 33.86 |
| Qwen-Image | `rudra_turbo_decoder_qwen_ema.safetensors` | 26.67 |
| Flux.2 Klein | `rudra_turbo_decoder_flux2-klein_ema.safetensors` | 28.57 |
| LTX (2.3) | `rudra_full_decoder_ltx-video_ema.safetensors` | 25.47 |
| Z-Image | use the Flux decoder (shares the FLUX.1 VAE) | — |
`turbo` (~0.5 M params) is fast and strong on simple latents (SDXL); `full` (~5.6 M) wins
on Flux/Wan/LTX. Both are provided where trained.
## Usage (ComfyUI)
1. Download into `ComfyUI/models/radiance/`:
```bash
huggingface-cli download fxtdstudios/RUDRA --include "rudra_*.safetensors" \
--local-dir "ComfyUI/models/radiance"
```
2. In the **Radiance HDR VAE Decode** node: set `rudra_decoder = Enabled`, pick
`decoder_size` (`rudra_turbo` or `rudra_full`) per the table above, and set
`target_space` to your output color space (Linear / ACEScg / Rec.2020 / LogC4…).
## Notes
- **Backbone-specific:** a decoder is tied to its VAE latent space — use the matching file
for the model feeding the node (Flux decoder for a Flux workflow, etc.).
- **Flux.2 Klein** uses a 128-channel / 16× VAE, so its decoder has an extra upsample stage
(requires the updated `fast_vae.py` from the GitHub repo).
- Quality is reported as held-out **log-space PSNR**; perceptual HDR evaluation uses
**ColorVideoVDP** (JOD). SDXL/Qwen/Klein were trained on a smaller pair set and can be
improved with more data.
## Citation
> RUDRA: Radiometric Dynamic-Range Conditioning for HDR-Aware Diffusion Models.
> FXTD Studios / Radiance Research.
License: change the `license:` field above to match your release t