Add model card, link to paper and GitHub repository
#1
by nielsr HF Staff - opened
README.md
ADDED
|
@@ -0,0 +1,66 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
pipeline_tag: unconditional-image-generation
|
| 3 |
+
---
|
| 4 |
+
|
| 5 |
+
# Parallel Rollout Approximation (PRA)
|
| 6 |
+
|
| 7 |
+
This repository contains the weights for **Parallel Rollout Approximation (PRA)**, a scalable framework for class-conditional pixel-space autoregressive image generation.
|
| 8 |
+
|
| 9 |
+
More details can be found in the paper [Parallel Rollout Approximation for Pixel-Space Autoregressive Image Generation](https://huggingface.co/papers/2606.27978).
|
| 10 |
+
|
| 11 |
+
* **Repository:** [GitHub Repository](https://github.com/MangataX/PRA)
|
| 12 |
+
* **Paper:** [arXiv:2606.27978](https://huggingface.co/papers/2606.27978)
|
| 13 |
+
|
| 14 |
+
## Model Description
|
| 15 |
+
|
| 16 |
+
Parallel Rollout Approximation (PRA) is a pixel-space continuous-token autoregressive (AR) generation model. PRA generates low-dimensional intermediate states instead of high-dimensional pixel patches, mapping them back to pixel-space tokens with a pixel decoder. It effectively mitigates error accumulation during autoregressive steps by approximating the pixel-feedback interface encountered during inference-time rollout while retaining parallel teacher-forced training.
|
| 17 |
+
|
| 18 |
+
## Model Checkpoints
|
| 19 |
+
|
| 20 |
+
The following checkpoints are available:
|
| 21 |
+
|
| 22 |
+
| Model | Params | FID (256x256) | Weight |
|
| 23 |
+
|:---:|:---:|:---:|:---:|
|
| 24 |
+
| PRA-S | 135M | 2.58 | [PRA_S.pt](https://huggingface.co/MangataX/PRA/blob/main/PRA_S.pt) |
|
| 25 |
+
| PRA-B | 250M | 2.21 | [PRA_B.pt](https://huggingface.co/MangataX/PRA/blob/main/PRA_B.pt) |
|
| 26 |
+
| PRA-L | 511M | 1.94 | [PRA_L.pt](https://huggingface.co/MangataX/PRA/blob/main/PRA_L.pt) |
|
| 27 |
+
|
| 28 |
+
## Environment & Usage
|
| 29 |
+
|
| 30 |
+
For environment setup, training, and evaluation scripts, please refer to the official [GitHub Repository](https://github.com/MangataX/PRA).
|
| 31 |
+
|
| 32 |
+
### Sampling Example
|
| 33 |
+
|
| 34 |
+
You can run distributed class-balanced sampling using the `sample_ddp.py` script provided in the repository:
|
| 35 |
+
|
| 36 |
+
```shell
|
| 37 |
+
ckpt=your_ckpt_path
|
| 38 |
+
sample_dir=your_result_path
|
| 39 |
+
|
| 40 |
+
torchrun --nnodes=1 --nproc_per_node=4 --node_rank=0 \
|
| 41 |
+
sample_ddp.py \
|
| 42 |
+
--ckpt $ckpt \
|
| 43 |
+
--sample-dir $sample_dir \
|
| 44 |
+
--model PRA-L \
|
| 45 |
+
--image-size 256 \
|
| 46 |
+
--patch-size 16 \
|
| 47 |
+
--latent-dim 16 \
|
| 48 |
+
--cfg-scale 4.1 \
|
| 49 |
+
--sample-steps 100 \
|
| 50 |
+
--sampler euler_maruyama \
|
| 51 |
+
--per-proc-batch-size 200 \
|
| 52 |
+
--sample-mask-rate 0.9 \
|
| 53 |
+
--token-mask-rate 0.5 \
|
| 54 |
+
--save-png
|
| 55 |
+
```
|
| 56 |
+
|
| 57 |
+
## Citation
|
| 58 |
+
|
| 59 |
+
```bibtex
|
| 60 |
+
@article{xu2026parallel,
|
| 61 |
+
title={Parallel Rollout Approximation for Pixel-Space Autoregressive Image Generation},
|
| 62 |
+
author={Xu, Jiayi and He, Di and Ke, Guolin},
|
| 63 |
+
journal={arXiv preprint arXiv:2606.27978},
|
| 64 |
+
year={2026}
|
| 65 |
+
}
|
| 66 |
+
```
|