Large Language Models as Optimizers
Paper
• 2309.03409
• Published
• 79
FLM-101B: An Open LLM and How to Train It with $100K Budget
Paper
• 2309.03852
• Published
• 45
GPT Can Solve Mathematical Problems Without a Calculator
Paper
• 2309.03241
• Published
• 19
DoLa: Decoding by Contrasting Layers Improves Factuality in Large
Language Models
Paper
• 2309.03883
• Published
• 36
ImageBind-LLM: Multi-modality Instruction Tuning
Paper
• 2309.03905
• Published
• 18
Textbooks Are All You Need II: phi-1.5 technical report
Paper
• 2309.05463
• Published
• 89
NExT-GPT: Any-to-Any Multimodal LLM
Paper
• 2309.05519
• Published
• 79
When Less is More: Investigating Data Pruning for Pretraining LLMs at
Scale
Paper
• 2309.04564
• Published
• 17
Efficient Memory Management for Large Language Model Serving with
PagedAttention
Paper
• 2309.06180
• Published
• 38
Language Modeling Is Compression
Paper
• 2309.10668
• Published
• 84
Multimodal Foundation Models: From Specialists to General-Purpose
Assistants
Paper
• 2309.10020
• Published
• 41