GPIC: A Giant Permissive Image Corpus for Visual Generation
Paper • 2605.30341 • Published • 2
This repository contains the reference baseline models for GPIC: A Giant Permissive Image Corpus for Visual Generation.
GPIC is a large-scale image corpus comprising approximately 28 trillion pixels and 100M training examples, all under permissive licenses. These models provide a reference baseline for pixel-space flow matching trained on this dataset.
@misc{chandrasegaran2026gpic,
title={GPIC: A Giant Permissive Image Corpus for Visual Generation},
author={Keshigeyan Chandrasegaran and Kyle Sargent and Suchir Agarwal and Michael Jang and Michael Poli and Juan Carlos Niebles and Justin Johnson and Jiajun Wu and Li Fei-Fei},
year={2026},
eprint={2605.30341},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2605.30341},
}