taghizadeh 's Collections llms
updated
LLaMA Beyond English: An Empirical Study on Language Capability Transfer
Paper
• 2401.01055
• Published • 54
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language
Models
Paper
• 2401.01335
• Published • 69
DocLLM: A layout-aware generative language model for multimodal document
understanding
Paper
• 2401.00908
• Published • 191
Multilingual Instruction Tuning With Just a Pinch of Multilinguality
Paper
• 2401.01854
• Published • 11
Understanding LLMs: A Comprehensive Overview from Training to Inference
Paper
• 2401.02038
• Published • 65
TinyLlama: An Open-Source Small Language Model
Paper
• 2401.02385
• Published • 95
LLaMA Pro: Progressive LLaMA with Block Expansion
Paper
• 2401.02415
• Published • 54
Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence
Lengths in Large Language Models
Paper
• 2401.04658
• Published • 27
The Impact of Reasoning Step Length on Large Language Models
Paper
• 2401.04925
• Published • 18
MaLA-500: Massive Language Adaptation of Large Language Models
Paper
• 2401.13303
• Published • 12
mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document
Understanding
Paper
• 2403.12895
• Published • 32
Localizing Paragraph Memorization in Language Models
Paper
• 2403.19851
• Published • 15
Transformer-Lite: High-efficiency Deployment of Large Language Models on
Mobile Phone GPUs
Paper
• 2403.20041
• Published • 34
LLoCO: Learning Long Contexts Offline
Paper
• 2404.07979
• Published • 22
Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language
Models
Paper
• 2404.12387
• Published • 40
OpenELM: An Efficient Language Model Family with Open-source Training
and Inference Framework
Paper
• 2404.14619
• Published • 126