Details about expansion

by icoderzqliu - opened Mar 19, 2024

Mar 19, 2024

Hello, you mentioned 'After width expansion, there was a significant decline in the model's performance' in the blog, I would like to know some details about the width expansion, is it achieved by expanding the dimensions of the hidden layer? Or what method? Thank you!

itsliupeng

01-ai org Mar 21, 2024

Inspiration may be drawn from the insights presented in these two articles https://arxiv.org/abs/2112.11446， https://arxiv.org/abs/2110.07143

itsliupeng changed discussion status to closed Mar 21, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment