Matthew Andrews

BlueNipples

AI & ML interests

None yet

Recent Activity

new activity 2 minutes ago

mistralai/Mistral-Medium-3.5-128B:Finally dense 100b+!

posted an update 9 minutes ago

Good news, llama.cpp seems to be close to supporting MTP on qwen models. Bad news, every single gguf will have to be redone when it is.

liked a model 6 days ago

mistralai/Mistral-Medium-3.5-128B

View all activity

Organizations

posted an update 9 minutes ago

Post

Good news, llama.cpp seems to be close to supporting MTP on qwen models. Bad news, every single gguf will have to be redone when it is.

posted an update about 1 year ago

Post

664

I'll never understand why people are merging reasoning models with non-reasoning models. It's worse, every time.

You got to train reasoning on reasoning data, and merge reasoning, with reasoning.

posted an update over 1 year ago

Post

605

arcee-ai/Virtuoso-Lite is really good. That's all lol.

replied to Undi95's post about 2 years ago

If this is prompting with special code, forgive my dummy question, how does one turn that into a usable fine tune? Using the layers to mass generate DPO pairs for a detox dataset?

Matthew Andrews

AI & ML interests

Recent Activity

Organizations

BlueNipples's activity