Collections
Discover the best community collections!
Collections trending this week
-
SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention
Paper β’ 2312.07987 β’ Published β’ 41 -
SubGen: Token Generation in Sublinear Time and Memory
Paper β’ 2402.06082 β’ Published β’ 11 -
Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models
Paper β’ 2402.13064 β’ Published β’ 50 -
SaulLM-7B: A pioneering Large Language Model for Law
Paper β’ 2403.03883 β’ Published β’ 90
-
SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention
Paper β’ 2312.07987 β’ Published β’ 41 -
SubGen: Token Generation in Sublinear Time and Memory
Paper β’ 2402.06082 β’ Published β’ 11 -
Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models
Paper β’ 2402.13064 β’ Published β’ 50 -
SaulLM-7B: A pioneering Large Language Model for Law
Paper β’ 2403.03883 β’ Published β’ 90