Wrap CrossEntropyLoss in callable that makes it appplicable to sequences 4d8fdac PeteBleackley commited on Oct 9, 2023
PyTorch implementation of HugggingFace PreTrainedModel class does not allow direct setting of base_model. Rejig constructors accordingly 519dfd1 PeteBleackley commited on Oct 9, 2023
Use dictionary coprehensions to do padding and return dictionaries instead of default dictionaries 7a61dc8 PeteBleackley commited on Oct 6, 2023