sub 16GB and sub 8GB versions

by froilo - opened Jul 12, 2024

Jul 12, 2024

sub 16GB and sub 8GB versions would be really nice in the democratic&friendly OSS spirit.

Also a code example with
accelerate and cpu offloading in the model card would be nice
also with different text encoders (when its available)

thx

jspsoli

Jul 12, 2024

•

edited Jul 12, 2024

I'm hoping they make a <=2B version of the model so we can properly compare it with other existing models fairly, run on low end hardware and maybe even on mobile one day with some quantization and optimizations.

And since this is supposed to be truly open source then maybe release various epochs along the way would be nice for those planning on finetuning on top of it. Just my guess but trying to make an anime finetuned version of the model using the last epoch would probably not offer the best results but who knows.

ddh0

Jul 15, 2024

I'm going to try to use my custom code to convert it to 8 bits which would cut the required VRAM in half. I will report back to this thread if I'm successful. 🤞

burkaygur

fal org Jul 18, 2024

One of the potential improvements we make down the road is a smaller model. Stay tuned for updates. In the meantime, good luck with all the quantization work and let us know if you need any help!

burkaygur changed discussion status to closed Jul 18, 2024

ddh0

Jul 18, 2024

•

edited Jul 18, 2024

Here is my 8-bit version of the model, for those interested: ddh0/AuraFlow-8bit

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment