Lumatrix
/

prx-pixel-fp8

custom-inference

Model card Files Files and versions

prx-pixel-fp8 / README.md

Lumatrix's picture

Update README.md

45d08d0 verified 21 days ago

|

History Blame Contribute Delete

2.74 kB

	---
	tags:
	- text-to-image
	- comfyui
	- prxpixel
	- fp8
	- safetensors
	- custom-inference
	pipeline_tag: text-to-image
	base_model:
	- Photoroom/prxpixel-t2i
	---

	# PRX Pixel FP8 Mixed Transformer

	This repository contains a single-file PRX Pixel transformer checkpoint converted from `Photoroom/prxpixel-t2i` into a mixed fp8 safetensors format for the `lumina_prx_pixel` custom ComfyUI runtime.

	## What This File Is

	- This is not the full original model repo.
	- This file contains the transformer only.
	- The tokenizer and text encoder / clip side should still be loaded from the original base model source:
	`Photoroom/prxpixel-t2i`

	## Mixed FP8 Means

	This checkpoint uses a mixed fp8 storage recipe, not full end-to-end fp8 for every tensor.

	- Large transformer projection weights are stored in `float8_e4m3fn`
	- More sensitive tensors stay in higher precision
	- Runtime is mixed precision
	fp8 weights are used for selected large linear layers, while activations / outputs remain in bf16 or the loader compute dtype

	In other words:

	- weight storage is reduced for selected heavy layers
	- compute is still mixed, not pure fp8 everywhere

	## Intended Runtime

	This file is intended for:

	- `lumina_prx_pixel`
	- `Load PRX Pixel model`
	- `Load prx clip model only`

	from the matching custom ComfyUI node pack.

	It is not meant for stock diffusers loading.

	## How To Use In ComfyUI

	1. Place the `.safetensors` file inside `ComfyUI/models/diffusion_models` or `ComfyUI/models/unet`
	2. Load it with `Load PRX Pixel model`
	3. Load the text side with `Load prx clip model only`
	4. Set that clip source to `Photoroom/prxpixel-t2i`
	5. Connect both into `lumina_prx_pixel`

	## Runtime Notes

	- For the real fp8 matmul path, you want a CUDA setup with fp8 support in PyTorch
	- On unsupported backends, the runtime may fall back to dequantized math for execution
	- This is still useful for packaging and selective storage reduction, but the best VRAM win is on supported CUDA fp8 paths

	## Conversion Notes

	This file was produced from the original PRX Pixel transformer shards and packed into a custom single-file format used by the local runtime.

	The embedded metadata includes:

	- transformer config
	- scheduler config
	- PRX Pixel generation settings
	- fp8 scale metadata for converted layers

	## Limitations

	- This is a converted checkpoint, not an original upstream release artifact
	- Output quality and numerical behavior can differ slightly from the source bf16 weights
	- Compatibility is tied to the custom `lumina_prx_pixel` runtime that knows how to read this format

	## Credit

	- Base model: `Photoroom/prxpixel-t2i`
	- FP8 mixed single-file packaging and runtime support: `Lumnatrix`