lpalbou commited on
Commit
a1c9286
·
verified ·
1 Parent(s): 14622af

Update TI2V-5B memory and validation card

Browse files
Files changed (1) hide show
  1. README.md +44 -16
README.md CHANGED
@@ -8,6 +8,7 @@ tags:
8
  - mlx-gen
9
  - mflux
10
  - apple-silicon
 
11
  - wan
12
  - wan2.2
13
  - video-generation
@@ -16,31 +17,54 @@ tags:
16
  ---
17
  # wan2.2-ti2v-5b-diffusers-bf16
18
 
19
- This repository contains MLX-Gen saved weights for `Wan-AI/Wan2.2-TI2V-5B-Diffusers`. The checkpoint is designed for local Apple Silicon inference with [`mlx-gen`](https://github.com/lpalbou/mlx-gen).
 
 
 
20
 
21
- It uses the mflux/MLX saved-weight layout. It is not a Diffusers or Transformers `from_pretrained()` checkpoint.
 
22
 
23
  ## Source Model
24
 
25
  Original model: [`Wan-AI/Wan2.2-TI2V-5B-Diffusers`](https://huggingface.co/Wan-AI/Wan2.2-TI2V-5B-Diffusers).
26
 
27
- ## License and Access
28
-
29
  This prepared derivative follows the Apache 2.0 license of the source model.
30
 
31
- ## Quantization
 
 
 
 
 
 
 
 
32
 
33
- This checkpoint stores MLX-Gen Wan2.2 TI2V-5B weights without an explicit quantization level.
34
 
35
- For Wan checkpoints, MLX-Gen loads transformer and VAE weights at BF16 runtime precision. The UMT5 text encoder is preserved from the source model. Wan supports text-to-video and selected image-to-video routes depending on the source model.
36
 
37
- ## Compatibility
 
38
 
39
- Requires `mlx-gen >= 0.18.10`.
 
 
 
 
40
 
41
- Generated with `mlx-gen 0.18.10`.
 
42
 
43
- Use the `mlxgen` command and Python import path for new MLX-Gen projects.
 
 
 
 
 
 
 
44
 
45
  ## Usage
46
 
@@ -51,19 +75,23 @@ mlxgen download --model AbstractFramework/wan2.2-ti2v-5b-diffusers-bf16
51
 
52
  mlxgen generate \
53
  --model AbstractFramework/wan2.2-ti2v-5b-diffusers-bf16 \
54
- --prompt "Your video prompt here" \
 
55
  --width 1280 \
56
  --height 704 \
57
- --frames 121 \
58
- --steps 50 \
59
  --guidance 5 \
60
  --fps 24 \
61
- --seed 42 \
62
  --output video.mp4
63
  ```
64
 
 
 
65
  ## Attribution
66
 
67
- MLX-Gen is based on [mflux](https://github.com/filipstrand/mflux) by Filip Strand and the original mflux contributors. This model card is generated by MLX-Gen so derived checkpoints keep that attribution visible.
 
68
 
69
  Prepared and contributed by [@lpalbou](https://huggingface.co/lpalbou).
 
8
  - mlx-gen
9
  - mflux
10
  - apple-silicon
11
+ - bf16
12
  - wan
13
  - wan2.2
14
  - video-generation
 
17
  ---
18
  # wan2.2-ti2v-5b-diffusers-bf16
19
 
20
+ This repository contains BF16 MLX-Gen saved weights for
21
+ [`Wan-AI/Wan2.2-TI2V-5B-Diffusers`](https://huggingface.co/Wan-AI/Wan2.2-TI2V-5B-Diffusers).
22
+ It is designed for local Apple Silicon inference with
23
+ [`mlx-gen`](https://github.com/lpalbou/mlx-gen).
24
 
25
+ It uses the mflux/MLX saved-weight layout. It is not a Diffusers or Transformers
26
+ `from_pretrained()` checkpoint.
27
 
28
  ## Source Model
29
 
30
  Original model: [`Wan-AI/Wan2.2-TI2V-5B-Diffusers`](https://huggingface.co/Wan-AI/Wan2.2-TI2V-5B-Diffusers).
31
 
 
 
32
  This prepared derivative follows the Apache 2.0 license of the source model.
33
 
34
+ ## Precision
35
+
36
+ The upstream TI2V-5B source snapshot is not uniformly 16-bit on disk: the transformer and VAE
37
+ safetensors are FP32, while the UMT5 text encoder is BF16. MLX-Gen loads Wan transformer/VAE
38
+ weights at BF16 runtime precision, so this prepared BF16 package reduces storage and download size
39
+ but is not a runtime-memory optimization versus source generation.
40
+
41
+ Use this package when you want a smaller reusable MLX-Gen folder that preserves source behavior.
42
+ Use the mixed q8/BF16 package when you want a smaller model footprint.
43
 
44
+ ## Measurements
45
 
46
+ Measured on 2026-06-04 with `mlx-gen 0.18.10` on an Apple M5 Max with 128 GiB unified memory.
47
 
48
+ Validation profile: `1280x704`, 17 frames, 20 denoising steps, guidance `5`, 24 fps, seed `321`,
49
+ explicit empty negative prompt.
50
 
51
+ | Layout | Storage | Logical Model | Full-Process Physical Peak | Max RSS | MLX Peak | Total Time | Output |
52
+ | --- | ---: | ---: | ---: | ---: | ---: | ---: | --- |
53
+ | Upstream source snapshot | 31.9 GiB | 10.6 GiB | 102.7 GiB | 13.7 GiB | 58.5 GiB | 216.2 s | [base-source.mp4](validation/ti2v5b-clean/base-source.mp4) |
54
+ | This BF16 package | 21.2 GiB | 10.6 GiB | 102.6 GiB | 14.5 GiB | 58.5 GiB | 261.6 s | [prepared-bf16.mp4](validation/ti2v5b-clean/prepared-bf16.mp4) |
55
+ | Mixed q8/BF16 package | 16.9 GiB | 6.3 GiB | 103.7 GiB | 13.8 GiB | 54.2 GiB | 243.4 s | [mixed-q8-bf16.mp4](validation/ti2v5b-clean/mixed-q8-bf16.mp4) |
56
 
57
+ The source and this BF16 package produced byte-identical decoded MP4 frames. The mixed q8/BF16
58
+ package stayed visually in the same family with mean frame MAE `1.66` versus source/BF16.
59
 
60
+ `Storage` is the Hugging Face repository total. `Logical Model` is the loaded Wan transformer plus
61
+ VAE tensor footprint measured from MLX arrays. `Full-Process Physical Peak` is Darwin
62
+ `phys_footprint` sampled from model initialization through MP4 save and health validation.
63
+
64
+ Validation assets:
65
+
66
+ - [contact-sheet.png](validation/ti2v5b-clean/contact-sheet.png)
67
+ - [metrics.json](validation/ti2v5b-clean/metrics.json)
68
 
69
  ## Usage
70
 
 
75
 
76
  mlxgen generate \
77
  --model AbstractFramework/wan2.2-ti2v-5b-diffusers-bf16 \
78
+ --prompt "A short cinematic video of a glowing orange glass sphere floating above calm teal water, soft reflections, gentle camera movement" \
79
+ --negative-prompt "" \
80
  --width 1280 \
81
  --height 704 \
82
+ --frames 17 \
83
+ --steps 20 \
84
  --guidance 5 \
85
  --fps 24 \
86
+ --seed 321 \
87
  --output video.mp4
88
  ```
89
 
90
+ TI2V-5B also supports first-frame image-to-video in MLX-Gen when one input image is supplied.
91
+
92
  ## Attribution
93
 
94
+ MLX-Gen is based on [mflux](https://github.com/filipstrand/mflux) by Filip Strand and the original
95
+ mflux contributors.
96
 
97
  Prepared and contributed by [@lpalbou](https://huggingface.co/lpalbou).