You know that already but

#2
by SerialKicked - opened

This is probably the best merge of (non CoT) MS models I came across. I'm a bit late on commenting on your model, but it's impressively good. I normally don't evaluate merges unless they've been fine-tuned on top, but this one is too good to pass on.

It keeps most (if not all) of the advantages of MS (good instruction following, task oriented queries), while still good at being a general chat partner. It passed my personal battery of tests (bunch of Q&A + haystack + function calling + menu navigation + general text manipulation). Of course it's not exempt from the usual stylistic issues of MS models, but damn, your merge should definitely be used as a base for further improvements, as it has this kind of good ratio of intelligence and creativity.

I'd be really curious to see what you could make happen with more recent model bases (Hermes 4.3 and Olmo).

Sign up or log in to comment