Text generation quality is low with diffusers

by perk11 - opened Jan 14

Discussion

perk11

Jan 14

Running the test diffusers code, I'm geting this:

Even simpler examples seem to have errors in the text:

tengjiayan

Z.ai org Jan 14

Because the model uses an autoregressive generation structure, it exhibits greater diversity compared to diffusion models. Therefore, you can try generating multiple images, and occasional random errors in individual letters are within normal expectations.

perk11

Jan 14

I expected better text quality based on benchmark results, but completely understandable. Thank you for the explanation.

krigeta

Jan 15

Because the model uses an autoregressive generation structure, it exhibits greater diversity compared to diffusion models. Therefore, you can try generating multiple images, and occasional random errors in individual letters are within normal expectations.

How can we train Loras for this model? And when will the optimisations be released?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment