Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
FlameF0X 
posted an update 1 day ago
Post
66
Greetings Hugging Face!

I started a new project called **FWKV** (Feed-forward Weighted Key Value, or Floored Weighted Key Value), a RWKV-style LM that uses FFNNs (Feed-Forward Neural Networks) instead of RNN and floor(W·K·V). I'm hoping to make it much more efficient and scalable than RWKV.

So far I have:

- FlameF0X/FWKV-29M — this one is undertrained and doesn't have a Space yet. In the attached image you can see its speed on a T4 compared to models with the same configuration.

The only model that's fully working right now is:
- FlameF0X/FWKV-TinyStories — trained on TinyStories for one epoch. The demo Space is FlameF0X/FWKV-demo.

That's really interesting, very impressive results! Is there a paper or a blog post about the methodology?

·

Not yet. I'm still experimenting.
Once I get something that I'm pleased with I'm going to write a blog.