Use correct `gelu` function

by ybelkada - opened Nov 16, 2022

base: refs/heads/main

←

from: refs/pr/5

Discussion Files changed

-3

ybelkada

Nov 16, 2022

•

edited Nov 16, 2022

The previous config file was using gelu function instead of gated-gelu that is automatically set when forcing is_gated_actto True, more specifically here
This is not a breaking change since it fixes only for inference. Users that trained a model with gelu instead of gated-gelu should not be affected by this change. Note that using gated-gelu instead of gelu can give slightly different qualitative results but does not affect the overall performance of the model.

Use correct `gelu` functionff1a6e9e

ybelkada changed pull request status to merged Nov 16, 2022

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment