Leandro von Werra PRO
AI & ML interests
NLP and RL
Recent Activity
new activity about 3 hours ago
attention-wiki/knowledge-base:Add source: NoPE — positional encoding & length generalization (arxiv:2305.19466) new activity about 3 hours ago
attention-wiki/knowledge-base:Add source: H2O — Heavy-Hitter KV-cache eviction (arxiv:2306.14048) new activity about 3 hours ago
attention-wiki/knowledge-base:Add source: S4 (structured state spaces) + claims + state-space-hybrids page