lpalbou commited on
Commit
161bb7a
·
verified ·
1 Parent(s): 177422b

Release Wan2.2 T2V A14B q8/BF16 MLX-Gen package

Browse files
README.md CHANGED
@@ -13,14 +13,13 @@ tags:
13
  - wan2.2
14
  - video-generation
15
  - text-to-video
16
- - image-to-video
17
  - wan-a14b
18
  ---
19
  # wan2.2-t2v-a14b-diffusers-8bit
20
 
21
  This repository contains MLX-Gen saved weights for `Wan-AI/Wan2.2-T2V-A14B-Diffusers`. The checkpoint is designed for local Apple Silicon inference with [`mlx-gen`](https://github.com/lpalbou/mlx-gen).
22
 
23
- It uses the mflux/MLX saved-weight layout and MLX quantization tensors. It is not a Diffusers or Transformers `from_pretrained()` checkpoint.
24
 
25
  ## Source Model
26
 
@@ -36,7 +35,7 @@ This is an MLX q8 checkpoint for Wan2.2 A14B. MLX-Gen uses 8-bit quantization fo
36
 
37
  - q8 for quantizable Wan transformer attention and feed-forward modules.
38
  - BF16 for the Wan VAE.
39
- - BF16 for Wan transformer conditioning/output projection linears, the UMT5 text encoder, scheduler metadata, tokenizer files, norms, and other non-quantizable parameters.
40
 
41
  Wan q4 quality and any possible mixed q4/q8 policy are still under validation. Prefer q8 for publishable Wan checkpoints until the q4 policy is documented.
42
 
@@ -44,31 +43,30 @@ See the [MLX-Gen quantization docs](https://github.com/lpalbou/mlx-gen/blob/main
44
 
45
  ## Local Validation
46
 
47
- Validation used 384x224, 17 frames, 12 denoising steps, guidance 4, guidance-2 3, fps 8, and seed 4242 on Apple Silicon. The memory numbers cover the full run from model init through video save. MLX peak is the MLX allocator peak; physical peak is the Darwin process physical footprint, which better reflects Apple Silicon unified-memory pressure than RSS alone.
48
 
49
- Bottom line:
50
-
51
- - The BF16 package reduces storage, not runtime memory.
52
- - This mixed q8/BF16 package reduces both storage and runtime memory. This is the package to use when generation memory footprint matters.
53
-
54
- | Layout | Disk | MLX Peak | Physical Peak | Time | Result |
55
- | --- | ---: | ---: | ---: | ---: | --- |
56
- | Original source snapshot | 118 GiB | 33.0 GiB | 48.9 GiB | 108.3 s | Baseline. |
57
- | BF16 package | 64 GiB | 33.0 GiB | 45.1 GiB | 114.4 s | Storage only; output was byte-identical. |
58
- | This mixed q8/BF16 package | 40 GiB | 20.8 GiB | 31.8 GiB | 110.3 s | Storage and memory; side-by-side quality validation passed. |
59
-
60
- Compared with the original source snapshot, this mixed q8/BF16 package cuts disk usage by about 66%, MLX peak memory by about 37%, and physical peak memory by about 35% in this validation run. It is not byte-identical to BF16, but the validation contact sheet stayed in the same visual family. The prepared q8/BF16 output was byte-identical to running `--quantize 8` from the upstream source snapshot.
61
 
62
  ## Compatibility
63
 
64
- Requires `mlx-gen >= 0.18.8`.
65
 
66
- Generated with `mlx-gen 0.18.8`.
67
 
68
  Use the `mlxgen` command and Python import path for new MLX-Gen projects.
69
 
70
  ## Usage
71
 
 
 
72
  ```bash
73
  python -m pip install -U mlx-gen
74
 
@@ -78,14 +76,15 @@ mlxgen generate \
78
  --model AbstractFramework/wan2.2-t2v-a14b-diffusers-8bit \
79
  --task text-to-video \
80
  --prompt "Your video prompt here" \
81
- --width 1280 \
82
- --height 720 \
83
- --frames 81 \
84
- --steps 40 \
85
  --guidance 4 \
86
  --guidance-2 3 \
87
- --fps 16 \
88
- --seed 42 \
 
89
  --output video.mp4
90
  ```
91
 
 
13
  - wan2.2
14
  - video-generation
15
  - text-to-video
 
16
  - wan-a14b
17
  ---
18
  # wan2.2-t2v-a14b-diffusers-8bit
19
 
20
  This repository contains MLX-Gen saved weights for `Wan-AI/Wan2.2-T2V-A14B-Diffusers`. The checkpoint is designed for local Apple Silicon inference with [`mlx-gen`](https://github.com/lpalbou/mlx-gen).
21
 
22
+ It uses the mflux/MLX saved-weight layout. Quantized checkpoints include MLX quantization tensors. It is not a Diffusers or Transformers `from_pretrained()` checkpoint.
23
 
24
  ## Source Model
25
 
 
35
 
36
  - q8 for quantizable Wan transformer attention and feed-forward modules.
37
  - BF16 for the Wan VAE.
38
+ - BF16 for Wan transformer conditioning/output projection linears, the UMT5 text encoder, scheduler metadata, tokenizer files, norms, convolutions, and other non-quantizable parameters.
39
 
40
  Wan q4 quality and any possible mixed q4/q8 policy are still under validation. Prefer q8 for publishable Wan checkpoints until the q4 policy is documented.
41
 
 
43
 
44
  ## Local Validation
45
 
46
+ These measurements are validation-sized release checks for this uploaded package. They verify package loading, video integrity, and prompt influence for this profile only; they do not claim full-size `1280x720`, 81-frame, 40-step readiness.
47
 
48
+ | Measurement | Value |
49
+ |---|---:|
50
+ | Package disk usage | 39.5 GiB |
51
+ | Validation profile | 384x224, 33 frames, 12 steps, 8.0 fps, seed 4242, `--low-ram` |
52
+ | Prompt pair | scientist scene / red car scene |
53
+ | Video health | 33 / 33 frames decoded, 8.0 fps, nonblank |
54
+ | Mean temporal delta | 5.6 / 3.2 luma |
55
+ | Prompt delta | 102.0 mean abs RGB |
56
+ | Generation time | 162.2 s / 319.6 s |
 
 
 
57
 
58
  ## Compatibility
59
 
60
+ Requires `mlx-gen >= 0.18.9`.
61
 
62
+ Generated with `mlx-gen 0.18.9`.
63
 
64
  Use the `mlxgen` command and Python import path for new MLX-Gen projects.
65
 
66
  ## Usage
67
 
68
+ The q8 A14B example below is intentionally validation-sized. Do not use this card to claim full-size `1280x720`, 81-frame, 40-step readiness until that exact path has passed video integrity and quality validation.
69
+
70
  ```bash
71
  python -m pip install -U mlx-gen
72
 
 
76
  --model AbstractFramework/wan2.2-t2v-a14b-diffusers-8bit \
77
  --task text-to-video \
78
  --prompt "Your video prompt here" \
79
+ --width 384 \
80
+ --height 224 \
81
+ --frames 33 \
82
+ --steps 12 \
83
  --guidance 4 \
84
  --guidance-2 3 \
85
+ --fps 8 \
86
+ --seed 4242 \
87
+ --metadata \
88
  --output video.mp4
89
  ```
90
 
transformer/0.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d324244b04b73bd63a5e41775a9a5fc0e38a0fdc6f6f641d274496fed8d8d727
3
  size 2126914156
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:778d017f1aaec61aa14ac84ee8305f47cd6b8c7ee988b213adea84cdbcd84da8
3
  size 2126914156
transformer/1.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:df5419e4f564780aaa48b01874bae29a765c3df5466f5d8878fd2b85537f6cd9
3
  size 2129375449
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4d515a2ca0dd66597b72baf73badfcfb2bcce966fb9c725d2e3e324aa827efe4
3
  size 2129375449
transformer/2.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:cd1774082f5db81dca8f8ed514c241e0ab58f3b434b108c579ab72c9298cc4be
3
  size 2109848506
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ec9e66ad6f169635400291353eca81e8566bdc19e66c6950ad77c39ff1dd5679
3
  size 2109848506
transformer/3.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b85b62cfbe121c9ba895d83e71fb22df7f793f907ba6d462da7d86e000aa7294
3
  size 2137714721
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a935fc1df9a54abd8b3c95ab963797e73e813de307df9e4fc378cad6f48cb215
3
  size 2137714721
transformer/4.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1c966f65563f26fcce793a4cf823949926f987a60ed8ddb837eddff91f133642
3
  size 2129375671
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2ba701f70de8516d08cc0290341edadc298e89878b14b47374fd0e7200075274
3
  size 2129375671
transformer/5.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:43cccf0b11fc18715f05c530d2cfe8e4ee6c1c1e7c8fbc57f13d02d6c535468d
3
  size 2081985022
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:34de1e3c49b2d6b32768925b31a7bfd21ec11ec68f43e6ddc50d4a8378717825
3
  size 2081985022
transformer/6.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:401c3d202660a8ab6d75d119cf1b7a019c94e9aed131b478a7d67cc5d71e5747
3
  size 2137714713
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:92d9390b17d0a97ca5029b99cec0ca1c10de21a5c4ed6cc4ae934919eae05218
3
  size 2137714713
transformer/7.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3684a00b35ab9db7a5c103704ef99943a12d60ad18ad9e2932d126f40afd0abc
3
  size 552561394
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:658d8b09c15faa69ab0baa75fd28c8b743cbe14317fa93e17060984cdfacb1d3
3
  size 552561394
transformer/model.safetensors.index.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "metadata": {
3
  "quantization_level": "8",
4
- "mflux_version": "0.18.8"
5
  },
6
  "weight_map": {
7
  "rope.freqs_cos": "0.safetensors",
 
1
  {
2
  "metadata": {
3
  "quantization_level": "8",
4
+ "mflux_version": "0.18.9"
5
  },
6
  "weight_map": {
7
  "rope.freqs_cos": "0.safetensors",
transformer_2/0.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2732c943575c11fef937956a9e8400ced38374295954b58fbc5a1af01d32f986
3
  size 2126914156
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:dbeaaaf67edd24e54ea4e9f80d314df1be530be819243e227a81547704ab4c03
3
  size 2126914156
transformer_2/1.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3e2f216038257ac1b3a21def6062756d862ce266fd571f1c4a9cd03ca1b7e0b7
3
  size 2129375449
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ba39c2e3bfcd1c5a704e2ddec57f49ea45803c12ea50418841fd475e1049d864
3
  size 2129375449
transformer_2/2.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e94647aeaca6371b650257c9879bf1d16cb273f4b158176ab1169a93703adad0
3
  size 2109848506
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:df2cd8228f861d6e2a21e59c12037cfb5014edc9f509483699af5dd0fdfda46a
3
  size 2109848506
transformer_2/3.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:544ac869de8cf8205875ca1677617ec28f300b440cb8b414a8c868bc2d45b1e4
3
  size 2137714721
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cb84b2a865ce243d3f86da5919626c10bf14c878103d7905a6988bea287b15e1
3
  size 2137714721
transformer_2/4.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0efb68e19c4b89ad3d822c4d1d5c8971f3be063da25785cdf98f4ef3d23973c3
3
  size 2129375671
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f9e27a54ee311ea9a0eb9c347b5da1b198a6c5797d024b4968d3b80ed12c2f7b
3
  size 2129375671
transformer_2/5.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f50af08070a793cca64ade0eb3209d928511fa524e229eeb090cee13f62d1af4
3
  size 2081985022
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2ee7573a19f22cdb1b35985ce0628b652584f304eaa66d41400ed8e13ed2be6b
3
  size 2081985022
transformer_2/6.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:68d30462066067fa5e8e8c0386f56257e71a721303e6575e7395af8e7e0c68c3
3
  size 2137714713
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7d95c814e12f9f8a55a182b09f9822bbd59a1b17a48a185964d0650092cdd728
3
  size 2137714713
transformer_2/7.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fc7a2a4ef35f0d608768caf516d2a0dd3b2d8da782b6598294adeb5272ecaed1
3
  size 552561394
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f4c5f350f81e43166976120fad70c218d45907474ced78002636ffcee4c2e977
3
  size 552561394
transformer_2/model.safetensors.index.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "metadata": {
3
  "quantization_level": "8",
4
- "mflux_version": "0.18.8"
5
  },
6
  "weight_map": {
7
  "rope.freqs_cos": "0.safetensors",
 
1
  {
2
  "metadata": {
3
  "quantization_level": "8",
4
+ "mflux_version": "0.18.9"
5
  },
6
  "weight_map": {
7
  "rope.freqs_cos": "0.safetensors",
vae/0.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:039ca54426d7cbfb4ecf852f06b0d943c2450f34a6c92c357824c69d8449587f
3
- size 253807817
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e3d43fd7e6e6370daf762a357d4fab09f34118d0b9f9dc383f322fffe0db0411
3
+ size 253807820
vae/model.safetensors.index.json CHANGED
@@ -1,7 +1,7 @@
1
  {
2
  "metadata": {
3
- "quantization_level": "8",
4
- "mflux_version": "0.18.8"
5
  },
6
  "weight_map": {
7
  "encoder.conv_in.conv3d.weight": "0.safetensors",
 
1
  {
2
  "metadata": {
3
+ "quantization_level": "None",
4
+ "mflux_version": "0.18.9"
5
  },
6
  "weight_map": {
7
  "encoder.conv_in.conv3d.weight": "0.safetensors",