impactful-papers
updated
Adapters: A Unified Library for Parameter-Efficient and Modular Transfer
Learning
Paper
• 2311.11077
• Published • 29
Tensor Product Attention Is All You Need
Paper
• 2501.06425
• Published • 90
LoRA: Low-Rank Adaptation of Large Language Models
Paper
• 2106.09685
• Published • 60
ShortGPT: Layers in Large Language Models are More Redundant Than You
Expect
Paper
• 2403.03853
• Published • 66
DarwinLM: Evolutionary Structured Pruning of Large Language Models
Paper
• 2502.07780
• Published • 18
The Danger of Overthinking: Examining the Reasoning-Action Dilemma in
Agentic Tasks
Paper
• 2502.08235
• Published • 59
StarCoder: may the source be with you!
Paper
• 2305.06161
• Published • 33
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model
Paper
• 2502.02737
• Published • 257
Small Models, Big Impact: Efficient Corpus and Graph-Based Adaptation of
Small Multilingual Language Models for Low-Resource Languages
Paper
• 2502.10140
• Published • 9
Fine-Tuning Small Language Models for Domain-Specific AI: An Edge AI
Perspective
Paper
• 2503.01933
• Published • 13
Fast Inference from Transformers via Speculative Decoding
Paper
• 2211.17192
• Published • 11
Neural Thickets: Diverse Task Experts Are Dense Around Pretrained Weights
Paper
• 2603.12228
• Published • 11