File size: 2,021 Bytes
46f851f
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
---
license: apache-2.0
library_name: mlx
tags:
  - mlx
  - mlx-swift
  - super-resolution
  - image-upscaling
  - diffusion
  - apple-silicon
pipeline_tag: image-to-image
---

# SeedVR2-3B (MLX-Swift) — fp16

**MLX-Swift** weights for **SeedVR2-3B**, ByteDance's one-step diffusion **super-resolution /
restoration** model (ICLR 2026). For on-device upscaling on Apple Silicon via the
[`seedvr2-mlx-swift`](https://github.com/xocialize/seedvr2-mlx-swift) package (built for
**MLXEngine / ForgeUpscaler**). int8 variant: [`SeedVR2-3B-mlx-int8`](https://huggingface.co/mlx-community/SeedVR2-3B-mlx-int8).

- **Files:** `transformer.safetensors` (DiT, fp16, ~7.9 GB) · `vae.safetensors` (3D-causal-conv VAE, fp16) · `pos_emb.safetensors` (precomputed text embedding) · `config.json`.
- **Precision:** fp16. Parity vs the mflux reference (CPU): transformer `t_out` max_abs **2.1e-4**, VAE encode/decode **3.5e-3 / 7.2e-3**, RNG/scheduler **0.0**.

## Usage

```swift
import SeedVR2MLX   // github.com/xocialize/seedvr2-mlx-swift
let upscaler = try SeedVR2Upscaler(directory: weightsDir)   // downloaded from this repo
let out = upscaler.upscale(processedImage: img, seed: 42)   // [-1,1], dims padded to /16
```

(Preprocess — resolution/softness bicubic resize — and LAB color-correction are host-side;
VAE tiling for large images is handled by the host, e.g. ForgeUpscaler's tile processor.)

## Provenance & license

Chain: **ByteDance Seed** — *SeedVR2: One-Step Video Restoration via Diffusion Adversarial
Post-Training* (ICLR 2026), [ByteDance-Seed/SeedVR](https://github.com/ByteDance-Seed/SeedVR),
**Apache-2.0** → PyTorch fp16 redistribution [`numz/SeedVR2_comfyUI`](https://huggingface.co/numz/SeedVR2_comfyUI)
→ MLX reference impl [`filipstrand/mflux`](https://github.com/filipstrand/mflux) → MLX-Swift
port + weight conversion by **MVS Collective (xocialize)**. These are format-converted weight
artifacts (not a new model); Apache-2.0 applies. Credit ByteDance Seed (original), cite the paper.