Commit History

fix: tie_weights accepts kwargs for transformers 1.21+
a6288c7
verified

harims95 commited on

fix: keep all_tied_weights_keys property pointing to _tied_weights_keys dict
2793b55
verified

harims95 commited on

fix: _tied_weights_keys as dict, add tie_weights method
3d8ead4
verified

harims95 commited on

fix forward: always return logits, accept attention_mask, ignore_index=-100 for SFT
11d39da
verified

harims95 commited on

expand model card with full architecture details and Parcae journey
6aea7c7
verified

harims95 commited on

fix: all_tied_weights_keys returns dict for transformers 1.21+
28272f2
verified

harims95 commited on

fix: add all_tied_weights_keys property for transformers 1.21+
f06477d
verified

harims95 commited on

fix: add _tied_weights_keys and output embedding hooks
17391db
verified

harims95 commited on

Initial release: LoopLM-135M-naive trained on FineWeb 4.6B tokens
12f0a98
verified

harims95 commited on

initial commit
dd6bcaf
verified

harims95 commited on