Text Generation
Transformers
Safetensors
fwkv

FWKV

FWKV (Feed-forward Weighted Key Value, or Floored Weighted Key Value) is a novel efficient language model architecture proposed on May 13, 2026 by Me.

state = W * state + k * v

Design Philosophy

FWKV is built on two simple ideas:

  • FFNN as the primary compute block — fast, parallelizable, hardware-friendly
  • floor(W·K·V) as the sole context mechanism — a multiplicative recurrent state with a learned decay W floored above zero, so the model can never fully forget

Sample

Inference speed (CUDA -- T4)

image

Training

Factorised tied weights, chunked CE, bf16 training. Best val perplexity: 143.62

Downloads last month
83
Safetensors
Model size
50.6M params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train FlameF0X/FWKV-50M

Space using FlameF0X/FWKV-50M 1

Collection including FlameF0X/FWKV-50M