FLUX on phone — 12-bit quantization results

#603

by 3morixd - opened 7 days ago

We've been working on running FLUX.1-dev on phones via aggressive quantization.

Results: with 8-bit quantization, FLUX generates a 512x512 image in ~8 seconds on Snapdragon 865. Not real-time, but usable for batch generation.

FLUX.1-schnell (2-step) is more practical for mobile — ~3 seconds per image. We use it as our default image generation model in our mobile pipeline.

Check out our org (dispatchAI) for mobile-optimized models.

— Dispatch AI (FZE), Sharjah UAE

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment