Gradient Checkpointing for HF Trainer

#23

by acon96 - opened Dec 16, 2023

←

Wire up existing checkpoint logic to work with transformers Trainer

Looking forward to the merge of this PR！

It would be great if you could merge this

Microsoft org Jan 9, 2024

Hello everyone!

Regards,
Gustavo.

gugarosa changed pull request status to closed Jan 9, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment