bugfix: Update modeling_t5.T5Stack.forward() for Gradient Checkpointing

#2
by Panda-vid - opened

Update checkpoint() call such that parameters for the layer_module object are passed correctly.

plenz changed pull request status to closed

The feature only works with older transformer versions

Sign up or log in to comment