neuregex commited on
Commit
560ea3c
·
verified ·
1 Parent(s): 9c6e373

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +43 -7
README.md CHANGED
@@ -1,15 +1,51 @@
1
  ---
2
  license: apache-2.0
3
  base_model: ByteDance/Bernini-R
4
- tags: [gguf, wan2.2, comfyui, bernini-r, text-to-video, image-editing]
 
 
 
 
 
 
 
 
 
5
  ---
6
 
7
- # Bernini-R — GGUF (high / low noise experts)
 
 
8
 
9
  GGUF quantizations of **[ByteDance/Bernini-R](https://huggingface.co/ByteDance/Bernini-R)**
10
- (Wan2.2-T2V-A14B + source-id RoPE + APG) for use with
11
- **[ComfyUI-BerniniR](https://github.com/neuregex/ComfyUI-BerniniR)** + `ComfyUI-GGUF`.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
12
 
13
- Two experts (Wan 2.2 high/low-noise), quants: `Q4_K_M`, `Q5_K_M`, `Q8_0`.
14
- Load each with `UnetLoaderGGUF` then `BerniniR · Apply Patches`. GGUF avoids the fp8
15
- dual-expert memory crash, so both experts run in 24 GB.
 
1
  ---
2
  license: apache-2.0
3
  base_model: ByteDance/Bernini-R
4
+ pipeline_tag: text-to-video
5
+ library_name: gguf
6
+ tags:
7
+ - gguf
8
+ - wan2.2
9
+ - comfyui
10
+ - bernini-r
11
+ - text-to-video
12
+ - image-editing
13
+ - video-editing
14
  ---
15
 
16
+ # Bernini-R — GGUF (high & low noise experts)
17
+
18
+ **[💻 Code / ComfyUI nodes](https://github.com/neuregex/ComfyUI-BerniniR)** · **[💬 Discord — updates, roadmaps, projects, or just to chat](https://discord.gg/HxfP9TnctJ)** · **[🧬 Bernini-R](https://huggingface.co/ByteDance/Bernini-R)** · **[Wan2.2-T2V-A14B](https://huggingface.co/Wan-AI/Wan2.2-T2V-A14B)**
19
 
20
  GGUF quantizations of **[ByteDance/Bernini-R](https://huggingface.co/ByteDance/Bernini-R)**
21
+ (= Wan2.2-T2V-A14B + source-id RoPE + multi-condition APG guidance), for use in ComfyUI with
22
+ **[ComfyUI-BerniniR](https://github.com/neuregex/ComfyUI-BerniniR)** + [`city96/ComfyUI-GGUF`](https://github.com/city96/ComfyUI-GGUF).
23
+
24
+ Bernini-R is a **dual-expert** model (Wan 2.2): a high-noise expert sets the composition and a
25
+ low-noise expert refines detail. Both are quantized here. GGUF carries no fp8 tensors, so the two
26
+ experts coexist in **24 GB VRAM** without the offload crash the fp8 build hits.
27
+
28
+ ## Files
29
+
30
+ | File | Expert | Quant | Size |
31
+ |------|--------|-------|------|
32
+ | `bernini_r_high_noise_14B-Q4_K_M.gguf` | high-noise | Q4_K_M | 9.66 GB |
33
+ | `bernini_r_high_noise_14B-Q5_K_M.gguf` | high-noise | Q5_K_M | 10.8 GB |
34
+ | `bernini_r_high_noise_14B-Q8_0.gguf` | high-noise | Q8_0 | 15.4 GB |
35
+ | `bernini_r_low_noise_14B-Q4_K_M.gguf` | low-noise | Q4_K_M | 9.66 GB |
36
+ | `bernini_r_low_noise_14B-Q5_K_M.gguf` | low-noise | Q5_K_M | 10.8 GB |
37
+ | `bernini_r_low_noise_14B-Q8_0.gguf` | low-noise | Q8_0 | 15.4 GB |
38
+
39
+ `Q5_K_M` is the recommended balance; `Q8_0` for best quality, `Q4_K_M` for the lowest VRAM.
40
+
41
+ ## Usage in ComfyUI
42
+
43
+ 1. Install **[ComfyUI-BerniniR](https://github.com/neuregex/ComfyUI-BerniniR)** and **[ComfyUI-GGUF](https://github.com/city96/ComfyUI-GGUF)**.
44
+ 2. Put the `.gguf` files in `ComfyUI/models/unet/`. You also need the Wan VAE (`wan_2.1_vae.safetensors`) and the UMT5 text encoder (`umt5_xxl_fp8_e4m3fn_scaled.safetensors`).
45
+ 3. **t2v / t2i** (`source_id=0` is identical to stock Wan 2.2): one `UnetLoaderGGUF` → your sampler.
46
+ 4. **Editing (i2i / v2v), both experts:** load each GGUF with `UnetLoaderGGUF`, send each through
47
+ **BerniniR · Apply Patches**, then into **BerniniR · Guider** (`model` = high, `model_low` = low).
48
+ The guider switches expert by timestep (t=875) and runs the APG guidance.
49
+ Ready-made graph: `workflows/ui/bernini_i2i_gguf_dual.json` in the node repo.
50
 
51
+ License: Apache-2.0 (same as the base model).