VoladorLuYu 's Collections LLM Reports
updated
Nemotron-4 15B Technical Report
Paper
• 2402.16819
• Published • 46
InternLM2 Technical Report
Paper
• 2403.17297
• Published • 34
Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model
Paper
• 2404.04167
• Published • 13
MobileLLM: Optimizing Sub-billion Parameter Language Models for
On-Device Use Cases
Paper
• 2402.14905
• Published • 134
JetMoE: Reaching Llama2 Performance with 0.1M Dollars
Paper
• 2404.07413
• Published • 38
Chinchilla Scaling: A replication attempt
Paper
• 2404.10102
• Published • 2
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your
Phone
Paper
• 2404.14219
• Published • 259
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report
Paper
• 2405.00732
• Published • 122
The Prompt Report: A Systematic Survey of Prompting Techniques
Paper
• 2406.06608
• Published • 68
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code
Intelligence
Paper
• 2406.11931
• Published • 69
Paper
• 2407.10671
• Published • 169
Training Language Models to Self-Correct via Reinforcement Learning
Paper
• 2409.12917
• Published • 140