LLMs
updated
Ziya2: Data-centric Learning is All LLMs Need
Paper
• 2311.03301
• Published • 20
Co-training and Co-distillation for Quality Improvement and Compression
of Language Models
Paper
• 2311.02849
• Published • 8
MFTCoder: Boosting Code LLMs with Multitask Fine-Tuning
Paper
• 2311.02303
• Published • 12
ADaPT: As-Needed Decomposition and Planning with Language Models
Paper
• 2311.05772
• Published • 12
Prompt Engineering a Prompt Engineer
Paper
• 2311.05661
• Published • 22
FinGPT: Large Generative Models for a Small Language
Paper
• 2311.05640
• Published • 30
Language Models can be Logical Solvers
Paper
• 2311.06158
• Published • 20
Lumos: Learning Agents with Unified Data, Modular Design, and
Open-Source LLMs
Paper
• 2311.05657
• Published • 30
Exponentially Faster Language Modelling
Paper
• 2311.10770
• Published • 119
SparQ Attention: Bandwidth-Efficient LLM Inference
Paper
• 2312.04985
• Published • 40
PathFinder: Guided Search over Multi-Step Reasoning Paths
Paper
• 2312.05180
• Published • 10
EE-LLM: Large-Scale Training and Inference of Early-Exit Large Language
Models with 3D Parallelism
Paper
• 2312.04916
• Published • 7