xocialize commited on
Commit
46f851f
·
verified ·
1 Parent(s): fa6e717

Add SeedVR2-3B MLX-Swift weights (fp16)

Browse files
Files changed (5) hide show
  1. README.md +42 -0
  2. config.json +13 -0
  3. pos_emb.safetensors +3 -0
  4. transformer.safetensors +3 -0
  5. vae.safetensors +3 -0
README.md ADDED
@@ -0,0 +1,42 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ library_name: mlx
4
+ tags:
5
+ - mlx
6
+ - mlx-swift
7
+ - super-resolution
8
+ - image-upscaling
9
+ - diffusion
10
+ - apple-silicon
11
+ pipeline_tag: image-to-image
12
+ ---
13
+
14
+ # SeedVR2-3B (MLX-Swift) — fp16
15
+
16
+ **MLX-Swift** weights for **SeedVR2-3B**, ByteDance's one-step diffusion **super-resolution /
17
+ restoration** model (ICLR 2026). For on-device upscaling on Apple Silicon via the
18
+ [`seedvr2-mlx-swift`](https://github.com/xocialize/seedvr2-mlx-swift) package (built for
19
+ **MLXEngine / ForgeUpscaler**). int8 variant: [`SeedVR2-3B-mlx-int8`](https://huggingface.co/mlx-community/SeedVR2-3B-mlx-int8).
20
+
21
+ - **Files:** `transformer.safetensors` (DiT, fp16, ~7.9 GB) · `vae.safetensors` (3D-causal-conv VAE, fp16) · `pos_emb.safetensors` (precomputed text embedding) · `config.json`.
22
+ - **Precision:** fp16. Parity vs the mflux reference (CPU): transformer `t_out` max_abs **2.1e-4**, VAE encode/decode **3.5e-3 / 7.2e-3**, RNG/scheduler **0.0**.
23
+
24
+ ## Usage
25
+
26
+ ```swift
27
+ import SeedVR2MLX // github.com/xocialize/seedvr2-mlx-swift
28
+ let upscaler = try SeedVR2Upscaler(directory: weightsDir) // downloaded from this repo
29
+ let out = upscaler.upscale(processedImage: img, seed: 42) // [-1,1], dims padded to /16
30
+ ```
31
+
32
+ (Preprocess — resolution/softness bicubic resize — and LAB color-correction are host-side;
33
+ VAE tiling for large images is handled by the host, e.g. ForgeUpscaler's tile processor.)
34
+
35
+ ## Provenance & license
36
+
37
+ Chain: **ByteDance Seed** — *SeedVR2: One-Step Video Restoration via Diffusion Adversarial
38
+ Post-Training* (ICLR 2026), [ByteDance-Seed/SeedVR](https://github.com/ByteDance-Seed/SeedVR),
39
+ **Apache-2.0** → PyTorch fp16 redistribution [`numz/SeedVR2_comfyUI`](https://huggingface.co/numz/SeedVR2_comfyUI)
40
+ → MLX reference impl [`filipstrand/mflux`](https://github.com/filipstrand/mflux) → MLX-Swift
41
+ port + weight conversion by **MVS Collective (xocialize)**. These are format-converted weight
42
+ artifacts (not a new model); Apache-2.0 applies. Credit ByteDance Seed (original), cite the paper.
config.json ADDED
@@ -0,0 +1,13 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "model_type": "seedvr2",
3
+ "variant": "seedvr2-3b",
4
+ "transformer_overrides": {},
5
+ "pos_emb_shape": [
6
+ 58,
7
+ 5120
8
+ ],
9
+ "dtype": "float16",
10
+ "upstream": "ByteDance-Seed/SeedVR (Apache-2.0)",
11
+ "mlx_reference": "filipstrand/mflux",
12
+ "pytorch_weights": "numz/SeedVR2_comfyUI"
13
+ }
pos_emb.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d2e06cadb10f1f149b5aa781185b250eb60f0e64327f12efd17886cecc9c2984
3
+ size 594008
transformer.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f4b96e02638210c807b2e3748665a6d71a47330cdbb112ffaf238f6b05cd55ab
3
+ size 7936589830
vae.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a2efbfb38a82a99a9cd16432ca5d564510678d228ce6209692d528aa32285cfb
3
+ size 501324772