Instructions to use AbstractFramework/wan2.2-t2v-a14b-diffusers-bf16 with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- MLX
How to use AbstractFramework/wan2.2-t2v-a14b-diffusers-bf16 with MLX:
# Download the model from the Hub pip install huggingface_hub[hf_xet] huggingface-cli download --local-dir wan2.2-t2v-a14b-diffusers-bf16 AbstractFramework/wan2.2-t2v-a14b-diffusers-bf16
- Wan2.2
How to use AbstractFramework/wan2.2-t2v-a14b-diffusers-bf16 with Wan2.2:
# No code snippets available yet for this library. # To use this model, check the repository files and the library's documentation. # Want to help? PRs adding snippets are welcome at: # https://github.com/huggingface/huggingface.js
- Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- LM Studio
Improve validation summary readability
Browse files
README.md
CHANGED
|
@@ -37,16 +37,26 @@ For Wan checkpoints, MLX-Gen loads transformer and VAE weights at BF16 runtime p
|
|
| 37 |
|
| 38 |
## Local Validation
|
| 39 |
|
| 40 |
-
Validation used 384x224, 17 frames, 12 denoising steps, guidance 4, guidance-2 3, fps 8, and seed 4242 on Apple Silicon.
|
| 41 |
-
|
| 42 |
-
|
| 43 |
-
|
| 44 |
-
|
| 45 |
-
|
| 46 |
-
|
| 47 |
-
|
|
| 48 |
-
|
|
| 49 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 50 |
|
| 51 |
## Compatibility
|
| 52 |
|
|
|
|
| 37 |
|
| 38 |
## Local Validation
|
| 39 |
|
| 40 |
+
Validation used 384x224, 17 frames, 12 denoising steps, guidance 4, guidance-2 3, fps 8, and seed 4242 on Apple Silicon. The memory numbers cover the full run from model init through video save. MLX peak is the MLX allocator peak; physical peak is the Darwin process physical footprint, which better reflects Apple Silicon unified-memory pressure than RSS alone.
|
| 41 |
+
|
| 42 |
+
Bottom line:
|
| 43 |
+
|
| 44 |
+
- The BF16 package reduces storage, not runtime memory. It is useful when you want a smaller, uploadable package with byte-identical output to the original source-cache run.
|
| 45 |
+
- The mixed q8/BF16 package reduces both storage and runtime memory. Use it when memory footprint matters.
|
| 46 |
+
|
| 47 |
+
| Layout | Disk | Runtime Memory | Improvement |
|
| 48 |
+
| --- | ---: | --- | --- |
|
| 49 |
+
| Original source snapshot | 118 GiB | Baseline | Baseline. |
|
| 50 |
+
| This BF16 package | 64 GiB | Same class as original | Storage only; output was byte-identical. |
|
| 51 |
+
| Mixed q8/BF16 package | 40 GiB | Lower | Storage and memory; side-by-side quality validation passed. |
|
| 52 |
+
|
| 53 |
+
Compared with the original source snapshot, this BF16 package cuts disk usage by about 46% but does not materially reduce generation memory. The mixed q8/BF16 package cuts disk usage by about 66% and physical peak memory by about 35% in this validation run.
|
| 54 |
+
|
| 55 |
+
Raw measurements:
|
| 56 |
+
|
| 57 |
+
- Original source snapshot: 32.99 GiB MLX peak, 48.90 GiB physical peak, 108.31 s.
|
| 58 |
+
- This BF16 package: 32.98 GiB MLX peak, 45.12 GiB physical peak, 114.39 s.
|
| 59 |
+
- Mixed q8/BF16 package: 20.84 GiB MLX peak, 31.75 GiB physical peak, 110.34 s.
|
| 60 |
|
| 61 |
## Compatibility
|
| 62 |
|