Why weight avg instead of lora merge?

by totally-not-an-llm - opened Jul 11, 2023

Discussion

totally-not-an-llm

Jul 11, 2023

Airoboros is a qlora, why not just merge the lora into chronos?

Henk717

Owner Jul 11, 2023

Partially the tools I am familair with and me not noticing the qlora. But in this case the merge ratio is 75% chronos. So it is not just applying the lora. Its applying varing percentages and settling on one I liked.

7erminalVelociraptor

Jul 11, 2023

Partially the tools I am familair with and me not noticing the qlora. But in this case the merge ratio is 75% chronos. So it is not just applying the lora. Its applying varing percentages and settling on one I liked.

Hijacking this thread a bit, but speaking of merging models can you share your method or script how you accomplished this? I'm trying to do something similar and just can't get it working right for some reason.

Henk717

Owner Jul 11, 2023

Scripts are here : https://github.com/ontocord/MDEL/tree/main/Model%20Merge%20And%20Analysis%20Tools

For this model I used the Enhanced Merger not the more advanced ones that let you do individual layers. Script variable was edited to merge it with 0.75, airoboros was selected as the first model.

7erminalVelociraptor

Jul 11, 2023

Scripts are here : https://github.com/ontocord/MDEL/tree/main/Model%20Merge%20And%20Analysis%20Tools

For this model I used the Enhanced Merger not the more advanced ones that let you do individual layers. Script variable was edited to merge it with 0.75, airoboros was selected as the first model.

Thank you, that is very helpful.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment