21 Qwen 3.5 Fine Tunes (thinking and instruct) ; reg and uncensored (2B to 27B) exceed benchmarks, and work better than org models.
All are bench marked against org model. Many exceed all benchmarks of org model. Claude, GLM, Gemini and other distills. Thinking AND dedicated Instruct versions.
Core goal: Increase benchmarks, and address long thinking blocks.
Highlights:
9B and 27B instruct "Claude" versions hit 624 and 675 on the "ARC-C" (hard challenge).
Thinking fine tunes exceed org model performance (in thinking mode).
In many cases there is a drastic reduction in thinking block size.
Recently, I have open-sourced an AI emotional companion product based on openclaw, called opensoul.
On this platform, you can create a "soulmate" that matches your personality, and configure it with the skills, tools you want it to have, as well as the platforms it can integrate with (such as Telegram, Discord, etc.). You can even create group chats, invite multiple agents and your friends to chat about recent events, discuss projects together, and so on.
On the one hand, I hope it can better accompany you in daily life by virtue of its unique memory mechanism, self-feedback and iteration mechanism, and the modeling of users' emotions. On the other hand, I also hope it can help you better handle your work with its unique skills, tools and ability to deal with complex task scenarios.
Although the entire product has taken shape, I think there are still many areas that need adjustment and optimization. I also hope to rely on the strength of the community to do a good job in AI emotional companionship.