-
Language Modeling Is Compression
Paper • 2309.10668 • Published • 84 -
Small-scale proxies for large-scale Transformer training instabilities
Paper • 2309.14322 • Published • 22 -
Evaluating Cognitive Maps and Planning in Large Language Models with CogEval
Paper • 2309.15129 • Published • 7 -
Vision Transformers Need Registers
Paper • 2309.16588 • Published • 86
Collections
Discover the best community collections!
Collections trending this week
-
Language Modeling Is Compression
Paper • 2309.10668 • Published • 84 -
Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)
Paper • 2309.08968 • Published • 24 -
Vision Transformers Need Registers
Paper • 2309.16588 • Published • 86 -
Localizing and Editing Knowledge in Text-to-Image Generative Models
Paper • 2310.13730 • Published • 7
-
Language Modeling Is Compression
Paper • 2309.10668 • Published • 84 -
Small-scale proxies for large-scale Transformer training instabilities
Paper • 2309.14322 • Published • 22 -
Evaluating Cognitive Maps and Planning in Large Language Models with CogEval
Paper • 2309.15129 • Published • 7 -
Vision Transformers Need Registers
Paper • 2309.16588 • Published • 86
-
Language Modeling Is Compression
Paper • 2309.10668 • Published • 84 -
Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT)
Paper • 2309.08968 • Published • 24 -
Vision Transformers Need Registers
Paper • 2309.16588 • Published • 86 -
Localizing and Editing Knowledge in Text-to-Image Generative Models
Paper • 2310.13730 • Published • 7