Models of the Paper LogitRouter: a novel Attention variant for reducing Myopic Routing in Mixture of Experts