Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
FlameF0X 
posted an update 1 day ago
Post
78
I did some testing on the scalability of FWKV. It hits a speed bottleneck at 1B due to the T4’s bandwidth limitations. Theoretically, it should match RWKV’s inference speed if the GPU had more bandwidth. So the 1B size is not accurate.
In this post