Cleaned up code, added multiseed training wrapper, PyTorch profiler training option, updated gradio demo, made changes to research paper to match new changes and new training results from adding new training techniques, architecture.md now explains all designs and decisions
90a2698
OliverPerrincommited on
Added many new improvements based on feedback from others
0d858b5
OliverPerrincommited on
Clean up codebase and fix training bugs
1601799
OliverPerrincommited on
Update LexiMind: improved training, model architecture, and evaluation
076bc18
OliverPerrincommited on
Full training run, code cleanup, mypy/ruff fixes
a47e5cf
OliverPerrincommited on
feat: Add FLAN-T5 compatibility with relative position bias