Leandro von Werra PRO
AI & ML interests
NLP and RL
Recent Activity
new activity 7 minutes ago
attention-wiki/knowledge-base:Add sources: T5, DeBERTa, TUPE — relative & disentangled positional encoding new activity 7 minutes ago
attention-wiki/knowledge-base:Add source: In-context Learning and Induction Heads (arxiv:2209.11895)