[Solved] "Undertrained" Loras

by CreatorHuggins - opened Jan 25

Jan 25

Hi! I would like to draw attention to an issue that has already been raised on Reddit. It seems that some (hopefully not all) Loras are not sufficiently trained. I have already been able to confirm this with a few Loras. It seems that the further away they are, the less recognizable some characters become as they approach the base checkpoint. For example, the character becomes an Asian character. This circumstance then disappears again in portraits. To support my thesis, I used the same prompt to generate an image of Jennifer Aniston with your Lora and an image with Malcom's Lora. I think the differences are obvious.

Malcom:

nphSi

Owner Jan 25

Can you please share the prompt and settings so i can reproduce this?

CreatorHuggins

Jan 25

Can you please share the prompt and settings so i can reproduce this?

Sure buddy,

photo realistic, high quality photo, realistic, full length photograph, 16k, HDR, uber detailed.
A woman is wearing a pink long-sleeved high-collar fuzzy sweater, red with black dots flannel mini-skirt and red stiletto strappy sandals. She has gold earrings and carries a red leather purse. She is on the balcony of a high rise Manhattan apartment building looking out at night over Times Square. The balcony has glass walls. Ultra-detailed environment, natural textures,
vibrant lighting, clean architectural lines.

Euler Beta, 8 Steps, CFG 1

nphSi

Owner Jan 25

•

edited Jan 25

You need to use the full lora name in the prompt. I caption in this way to reduce character bleeding/merging. Try this:

photo realistic, high quality photo, realistic, full length photograph, 16k, HDR, uber detailed.
Jennifer Aniston (vrtlJenniferAniston) is wearing a pink long-sleeved high-collar fuzzy sweater, red with black dots flannel mini-skirt and red stiletto strappy sandals. She has gold earrings and carries a red leather purse. She is on the balcony of a high rise Manhattan apartment building looking out at night over Times Square. The balcony has glass walls. Ultra-detailed environment, natural textures,
vibrant lighting, clean architectural lines.

CreatorHuggins

Jan 25

Ahh! Much better. Thanks for the info. Does that apply to all Loras? So first and last name? And what does that mean in brackets: vrtlJenniferAniston (vrtl)

nphSi

Owner Jan 25

(vrtlxxxxx) is my unique token for every lora i make. Everything the model learns is associated with that token. The base model itself knows a bit about i.e. Jennifer Aniston, so using Jennifer Aniston (vrtlJenniferAniston) blends base model knowledge and lora knowledge together while "Woman" still is the standard asian of the z-image checkpoint.

nphSi

Owner Jan 25

My version:

Frontal photo of Jennifer Aniston (vrtlJenniferAniston) wearing pink cashmere pullover and red tartan patterned mini skirt, red high heel sandals, red handbag, standing on the balcony in the fifth floor of a skyscraper, a city with skyscrapers at dark night in the background, looking down at the street, neon signs and steamy air

nphSi changed discussion title from "Undertrained" Loras to [Solved] "Undertrained" Loras Jan 25

CreatorHuggins

Jan 25

(vrtlxxxxx) is my unique token for every lora i make. Everything the model learns is associated with that token. The base model itself knows a bit about i.e. Jennifer Aniston, so using Jennifer Aniston (vrtlJenniferAniston) blends base model knowledge and lora knowledge together while "Woman" still is the standard asian of the z-image checkpoint.

So with token you mean "trigger" word or is it something else? Just curious as I'm training loras myself using the yaml file from malcom. I often have the problem that if there are more than 2 people in the image, the lora start bleeding which means for me that the second person looks like the lora character. Is that what you meant? I always thought this was something that couldn't be changed. Or do you mean something else with bleeding?

nphSi

Owner Jan 25

•

edited Jan 25

Thats exactly what i mean. And yes token means trigger word. Its not a 100% fix for the same face problem but it helps. The models are not well trained for different unique people interacting. Good prompting can help too. Current models really really want it described precisely.

scruffynerf

Jan 29

Just to be clear: Malcolm's loras have a pretty standard trigger that is the same on ALL of his loras. There is a real benefit to doing it the way @nphSi does it, if you want to combine people in one image. (Honestly, you still need to mask and/or use some sort of regional/mask conditioning, but unique triggers are better.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment