Update TF weights

by joaogante - opened Jun 25, 2022

base: refs/heads/main

←

from: refs/pr/2

Discussion Files changed

-2

joaogante

Jun 25, 2022

Model converted by the transformers' pt_to_tf CLI.

All converted model outputs and hidden layers were validated against its Pytorch counterpart. Maximum crossload output difference=1.465e-03; Maximum converted output difference=1.465e-03.

Update TF weightsd5efb835

joaogante

Jun 25, 2022

•

edited Jun 25, 2022

@patrickvonplaten the weights I've uploaded before were built with an MVP of the pt-to-tf CLI, which was not converting (or checking) the model head. These weights have the model head converted properly.

Merging this PR unblocks the following GH PR. After we confirm that these weights unblock the PR above (through passing tests), we can push the conversion for other XGLM model sizes.

cc @Stancld

patrickvonplaten changed pull request status to merged Jun 25, 2022

patrickvonplaten

Jun 25, 2022

Thanks @joaogante

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment