Instructions to use neuregex/Bernini-R-GGUF with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Wan2.2
How to use neuregex/Bernini-R-GGUF with Wan2.2:
# No code snippets available yet for this library. # To use this model, check the repository files and the library's documentation. # Want to help? PRs adding snippets are welcome at: # https://github.com/huggingface/huggingface.js
- Notebooks
- Google Colab
- Kaggle
Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -1,15 +1,51 @@
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
| 3 |
base_model: ByteDance/Bernini-R
|
| 4 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 5 |
---
|
| 6 |
|
| 7 |
-
# Bernini-R — GGUF (high
|
|
|
|
|
|
|
| 8 |
|
| 9 |
GGUF quantizations of **[ByteDance/Bernini-R](https://huggingface.co/ByteDance/Bernini-R)**
|
| 10 |
-
(Wan2.2-T2V-A14B + source-id RoPE + APG) for use with
|
| 11 |
-
**[ComfyUI-BerniniR](https://github.com/neuregex/ComfyUI-BerniniR)** + `ComfyUI-GGUF`.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 12 |
|
| 13 |
-
|
| 14 |
-
Load each with `UnetLoaderGGUF` then `BerniniR · Apply Patches`. GGUF avoids the fp8
|
| 15 |
-
dual-expert memory crash, so both experts run in 24 GB.
|
|
|
|
| 1 |
---
|
| 2 |
license: apache-2.0
|
| 3 |
base_model: ByteDance/Bernini-R
|
| 4 |
+
pipeline_tag: text-to-video
|
| 5 |
+
library_name: gguf
|
| 6 |
+
tags:
|
| 7 |
+
- gguf
|
| 8 |
+
- wan2.2
|
| 9 |
+
- comfyui
|
| 10 |
+
- bernini-r
|
| 11 |
+
- text-to-video
|
| 12 |
+
- image-editing
|
| 13 |
+
- video-editing
|
| 14 |
---
|
| 15 |
|
| 16 |
+
# Bernini-R — GGUF (high & low noise experts)
|
| 17 |
+
|
| 18 |
+
**[💻 Code / ComfyUI nodes](https://github.com/neuregex/ComfyUI-BerniniR)** · **[💬 Discord — updates, roadmaps, projects, or just to chat](https://discord.gg/HxfP9TnctJ)** · **[🧬 Bernini-R](https://huggingface.co/ByteDance/Bernini-R)** · **[Wan2.2-T2V-A14B](https://huggingface.co/Wan-AI/Wan2.2-T2V-A14B)**
|
| 19 |
|
| 20 |
GGUF quantizations of **[ByteDance/Bernini-R](https://huggingface.co/ByteDance/Bernini-R)**
|
| 21 |
+
(= Wan2.2-T2V-A14B + source-id RoPE + multi-condition APG guidance), for use in ComfyUI with
|
| 22 |
+
**[ComfyUI-BerniniR](https://github.com/neuregex/ComfyUI-BerniniR)** + [`city96/ComfyUI-GGUF`](https://github.com/city96/ComfyUI-GGUF).
|
| 23 |
+
|
| 24 |
+
Bernini-R is a **dual-expert** model (Wan 2.2): a high-noise expert sets the composition and a
|
| 25 |
+
low-noise expert refines detail. Both are quantized here. GGUF carries no fp8 tensors, so the two
|
| 26 |
+
experts coexist in **24 GB VRAM** without the offload crash the fp8 build hits.
|
| 27 |
+
|
| 28 |
+
## Files
|
| 29 |
+
|
| 30 |
+
| File | Expert | Quant | Size |
|
| 31 |
+
|------|--------|-------|------|
|
| 32 |
+
| `bernini_r_high_noise_14B-Q4_K_M.gguf` | high-noise | Q4_K_M | 9.66 GB |
|
| 33 |
+
| `bernini_r_high_noise_14B-Q5_K_M.gguf` | high-noise | Q5_K_M | 10.8 GB |
|
| 34 |
+
| `bernini_r_high_noise_14B-Q8_0.gguf` | high-noise | Q8_0 | 15.4 GB |
|
| 35 |
+
| `bernini_r_low_noise_14B-Q4_K_M.gguf` | low-noise | Q4_K_M | 9.66 GB |
|
| 36 |
+
| `bernini_r_low_noise_14B-Q5_K_M.gguf` | low-noise | Q5_K_M | 10.8 GB |
|
| 37 |
+
| `bernini_r_low_noise_14B-Q8_0.gguf` | low-noise | Q8_0 | 15.4 GB |
|
| 38 |
+
|
| 39 |
+
`Q5_K_M` is the recommended balance; `Q8_0` for best quality, `Q4_K_M` for the lowest VRAM.
|
| 40 |
+
|
| 41 |
+
## Usage in ComfyUI
|
| 42 |
+
|
| 43 |
+
1. Install **[ComfyUI-BerniniR](https://github.com/neuregex/ComfyUI-BerniniR)** and **[ComfyUI-GGUF](https://github.com/city96/ComfyUI-GGUF)**.
|
| 44 |
+
2. Put the `.gguf` files in `ComfyUI/models/unet/`. You also need the Wan VAE (`wan_2.1_vae.safetensors`) and the UMT5 text encoder (`umt5_xxl_fp8_e4m3fn_scaled.safetensors`).
|
| 45 |
+
3. **t2v / t2i** (`source_id=0` is identical to stock Wan 2.2): one `UnetLoaderGGUF` → your sampler.
|
| 46 |
+
4. **Editing (i2i / v2v), both experts:** load each GGUF with `UnetLoaderGGUF`, send each through
|
| 47 |
+
**BerniniR · Apply Patches**, then into **BerniniR · Guider** (`model` = high, `model_low` = low).
|
| 48 |
+
The guider switches expert by timestep (t=875) and runs the APG guidance.
|
| 49 |
+
Ready-made graph: `workflows/ui/bernini_i2i_gguf_dual.json` in the node repo.
|
| 50 |
|
| 51 |
+
License: Apache-2.0 (same as the base model).
|
|
|
|
|
|