Spaces:
Runtime error
Runtime error
Commit History
feat: scan layers + gradient checkpointing (#161) 07a6f9a unverified
Merge branch 'main' of https://github.com/borisdayma/dalle-mini into main bcd360f
feat(text): support emojis (#154) 7ef7bd9 unverified
fix: smelu 7f2f8ed
fix: sinkformer 2c583b3
fix: support smelu a2dcee4
feat: allow relative position (#156) 769d20a unverified
feat: sinkhorn in lse mode (#155) 00d4661 unverified
fix: sinkformer gradient eed4896
feat(model): allow bias (#152) 361a994 unverified
feat: add sinkformer + custom final ln + pre-ln (#151) f139b0b unverified
feat: placeholders for more config 69bcbeb
feat: force final ln in encoder 32f4ba5
feat: allow more configurations 5bd4c20
fix: DeepNet doesn't scale weights of embedding/output layers (#150) 503d6b4 unverified
Shuming Ma Shuming Ma commited on
feat: remove unecessary LN 02824a7
feat: add cogview 472c4cc
fix(textnormalizer): consider utf8 on windows (#148) 3b8d8cb unverified
illtellyoulater commited on
feat: implement transformer variants (#144) 542378c unverified
feat(data): super conditioning (#141) 7939874 unverified
feat: support pod (#139) 803ccbf unverified
feat: handle gradient checkpointing 5173ec7
feat: load from bucket 1c4e839
feat: reduce artifact space + offset step 34cf91c
feat: restore weights on CPU 5f954fc
fix: position embedding for generate method ebac379
fix: typo 68cc185
fix: load from checkpoint 44b7c3e
feat(modeling): simplify abstract_init fa72aa7
feat: handle model parallel 1bb3269
style(tokenizer): remove unused variables 605df32
feat: use fast tokenizer 767d78a
feat(train): distributed_shampoo with pjit cc34d07
feat(model): clean way to load on cpu 12f323d
feat(train): no batch dimension with pjit df1fe19
feat: use_artifact if run existing a5ed112
Style (isort). f9d51f7
Pedro Cuenca commited on
Tokenizer, config, model can be loaded from wandb. 7e48337
Pedro Cuenca commited on
Change import order again. 2b2be9b
Pedro Cuenca commited on
Fix import order to make isort happy. 64d99b2
Pedro Cuenca commited on
Accept changes suggested by linter. 9f522b8
Pedro Cuenca commited on
Never consider local dirs as remote wandb references. 08dd098
Pedro Cuenca commited on
Store resolved path after loading model. 55a631d
Pedro Cuenca commited on
Override from_pretrained to support wandb artifacts. 1023afa
Pedro Cuenca commited on