Thoughts

by HAV0X1014 - opened Mar 9

Mar 9

I like this model! Its writing style isn't terrible and doesn't make me roll my eyes every time I see a repeated line. I use it with thinking disabled (--reasoning-budget 0 in llama.cpp) and I've noticed that it ignores the first prompt the user sends sometimes. I'm not sure if that's a qwen thing, a finetune thing, or a lack of thinking thing.

I like that it can behave as a character and actually be helpful for coding. I've used some mistral 24b models that could write really well but couldn't do any assistant tasks whatsoever. The qwen 3.5 base really helps here.

I think a full-er finetune on more character + assistant tasks would make this even better, if that's how it works.

zerofata

Owner Mar 12

Thank you for the feedback!

There's definitely more potential in the model than what this current finetune shows, how exactly to draw that out I'm still figuring out how to do though.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment