Submitted by
Feng-Ting Liao
MediaTek Research
company
Verified
AI & ML interests
None defined yet.
Papers
Revisiting the Shape Convention of Transformer Language Models
Rethinking the shape convention of an MLP
None defined yet.
Revisiting the Shape Convention of Transformer Language Models
Rethinking the shape convention of an MLP