data-archetype
/

capacitor_decoder

@@ -18,15 +18,18 @@ architecture.
 ## Decode Speed
 | Resolution | Speedup vs FLUX.2 | Peak VRAM Reduction | capacitor_decoder (ms/image) | FLUX.2 VAE (ms/image) | capacitor_decoder peak VRAM | FLUX.2 peak VRAM |
 |---:|---:|---:|---:|---:|---:|---:|
-| `512x512` | `1.85x` | `59.3%` | `11.40` | `21.14` | `391.6 MiB` | `961.9 MiB` |
-| `1024x1024` | `3.28x` | `79.1%` | `26.31` | `86.24` | `601.4 MiB` | `2876.4 MiB` |
-| `2048x2048` | `4.70x` | `86.4%` | `86.29` | `405.84` | `1437.4 MiB` | `10531.4 MiB` |
-These measurements are decode-only. Each image is first encoded once with the
-same FLUX.2 encoder, latents are cached in memory, and then both decoders are
-timed over the same cached latent set.
 ## 2k PSNR Benchmark
@@ -124,4 +127,3 @@ upstream and call `decode(..., latents_are_flux2_whitened=False)`.
   url     = {https://huggingface.co/data-archetype/capacitor_decoder},
 }
 ```

 ## Decode Speed
+### RTX 5090
 | Resolution | Speedup vs FLUX.2 | Peak VRAM Reduction | capacitor_decoder (ms/image) | FLUX.2 VAE (ms/image) | capacitor_decoder peak VRAM | FLUX.2 peak VRAM |
 |---:|---:|---:|---:|---:|---:|---:|
+| `512x512` | `6.15x` | `61.5%` | `3.89` | `23.94` | `356.2 MiB` | `925.5 MiB` |
+| `1024x1024` | `11.98x` | `80.8%` | `9.86` | `118.19` | `540.2 MiB` | `2815.2 MiB` |
+| `2048x2048` | `10.81x` | `87.7%` | `52.12` | `563.28` | `1277.8 MiB` | `10371.8 MiB` |
+These measurements are decode-only and were run on an `NVIDIA GeForce RTX 5090`.
+Each image is first encoded once with the same FLUX.2 encoder, latents are
+cached in memory, and then both decoders are timed over the same cached latent
+set.
 ## 2k PSNR Benchmark
   url     = {https://huggingface.co/data-archetype/capacitor_decoder},
 }
 ```