Can you convert Qwen 3.5 9b?

#3
by Regrin - opened

Hello!
I really like the RWKV architecture, but RWKV-7 7.2b produces very poor results.
I was interested in this converted model. But here's the problem: it's based on Qwen 2.5, which is a very outdated model.
Maybe you can convert Qwen 3.5 9b?

Although, it's better not to do that. You'd better wait until Qwen 3.6 9b comes out and convert it. It will be better that way.

I would be very grateful!
I would also like you to make the RWKV hidden state larger, if possible. RWKV-7 7.2b has about 16 megabytes, and I think that's a bit small. It doesn't hold context well...

Thanks in advance!

Sign up or log in to comment