Buckets:
71.8 GB
8 files
Updated 7 days ago
Ctrl+K
| Name | Size | Uploaded | Xet hash |
|---|---|---|---|
| .gitattributes | 1.95 kB xet | 1565e08d | |
| README.md | 2.65 kB xet | 9c15dc38 | |
| bernini_r_high_noise_14B-Q4_K_M.gguf | 9.66 GB xet | 817b7f14 | |
| bernini_r_high_noise_14B-Q5_K_M.gguf | 10.8 GB xet | abb905ae | |
| bernini_r_high_noise_14B-Q8_0.gguf | 15.4 GB xet | 2067374b | |
| bernini_r_low_noise_14B-Q4_K_M.gguf | 9.66 GB xet | ce1cb80d | |
| bernini_r_low_noise_14B-Q5_K_M.gguf | 10.8 GB xet | ee62d6ec | |
| bernini_r_low_noise_14B-Q8_0.gguf | 15.4 GB xet | ce733f36 |
Bernini-R โ GGUF (high & low noise experts)
๐ฌ Discord โ updates, roadmaps, projects, or just to chat ๐งฌ Bernini-R Wan2.2-T2V-A14B
GGUF quantizations of ByteDance/Bernini-R
(= Wan2.2-T2V-A14B + source-id RoPE + multi-condition APG guidance), for use in ComfyUI with
ComfyUI-BerniniR + city96/ComfyUI-GGUF.
Bernini-R is a dual-expert model (Wan 2.2): a high-noise expert sets the composition and a low-noise expert refines detail. Both are quantized here. GGUF carries no fp8 tensors, so the two experts coexist in 24 GB VRAM without the offload crash the fp8 build hits.
Files
| File | Expert | Quant | Size |
|---|---|---|---|
bernini_r_high_noise_14B-Q4_K_M.gguf |
high-noise | Q4_K_M | 9.66 GB |
bernini_r_high_noise_14B-Q5_K_M.gguf |
high-noise | Q5_K_M | 10.8 GB |
bernini_r_high_noise_14B-Q8_0.gguf |
high-noise | Q8_0 | 15.4 GB |
bernini_r_low_noise_14B-Q4_K_M.gguf |
low-noise | Q4_K_M | 9.66 GB |
bernini_r_low_noise_14B-Q5_K_M.gguf |
low-noise | Q5_K_M | 10.8 GB |
bernini_r_low_noise_14B-Q8_0.gguf |
low-noise | Q8_0 | 15.4 GB |
Q5_K_M is the recommended balance; Q8_0 for best quality, Q4_K_M for the lowest VRAM.
Usage in ComfyUI
- Install ComfyUI-BerniniR and ComfyUI-GGUF.
- Put the
.gguffiles inComfyUI/models/unet/. You also need the Wan VAE (wan_2.1_vae.safetensors) and the UMT5 text encoder (umt5_xxl_fp8_e4m3fn_scaled.safetensors). - t2v / t2i (
source_id=0is identical to stock Wan 2.2): oneUnetLoaderGGUFโ your sampler. - Editing (i2i / v2v), both experts: load each GGUF with
UnetLoaderGGUF, send each through BerniniR ยท Apply Patches, then into BerniniR ยท Guider (model= high,model_low= low). The guider switches expert by timestep (t=875) and runs the APG guidance. Ready-made graph:workflows/ui/bernini_i2i_gguf_dual.jsonin the node repo.
License: Apache-2.0 (same as the base model).
- Total size
- 71.8 GB
- Files
- 8
- Last updated
- Jun 20
- Pre-warmed CDN
- US EU US EU