Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
Quazim0t0 
posted an update 4 days ago
Post
658
Created a causal language model with a non-standard channel-mixing block. It keeps a conventional transformer backbone for token mixing (attention), but replaces the per-layer MLP with a QuazimotoBlock: a bank of coupled phase oscillators (Kuramoto dynamics) arranged in concentric rings, run for a few differentiable Euler steps and read out through [cos θ, sin θ].

Quazim0t0/Positronic-144M
In this post