Self-improving LLMs
updated
Self-Taught Self-Correction for Small Language Models
Paper
• 2503.08681
• Published
• 15
Self-Improving Robust Preference Optimization
Paper
• 2406.01660
• Published
• 20
LADDER: Self-Improving LLMs Through Recursive Problem Decomposition
Paper
• 2503.00735
• Published
• 23
Meta-Rewarding Language Models: Self-Improving Alignment with
LLM-as-a-Meta-Judge
Paper
• 2407.19594
• Published
• 21
Self-Taught Optimizer (STOP): Recursively Self-Improving Code Generation
Paper
• 2310.02304
• Published
• 1
Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four
Habits of Highly Effective STaRs
Paper
• 2503.01307
• Published
• 38
Large Language Models Can Self-Improve in Long-context Reasoning
Paper
• 2411.08147
• Published
• 65
B-STaR: Monitoring and Balancing Exploration and Exploitation in
Self-Taught Reasoners
Paper
• 2412.17256
• Published
• 47
Sample, Scrutinize and Scale: Effective Inference-Time Search by Scaling
Verification
Paper
• 2502.01839
• Published
• 10
Enabling Scalable Oversight via Self-Evolving Critic
Paper
• 2501.05727
• Published
• 72
Symbolic Learning Enables Self-Evolving Agents
Paper
• 2406.18532
• Published
• 12
A Survey on Self-Evolution of Large Language Models
Paper
• 2404.14387
• Published
• 3
Gödel Agent: A Self-Referential Agent Framework for Recursive
Self-Improvement
Paper
• 2410.04444
• Published
• 3
Self-Tuning: Instructing LLMs to Effectively Acquire New Knowledge
through Self-Teaching
Paper
• 2406.06326
• Published
• 2
Learning Evolving Tools for Large Language Models
Paper
• 2410.06617
• Published
• 2
LLM-FE: Automated Feature Engineering for Tabular Data with LLMs as
Evolutionary Optimizers
Paper
• 2503.14434
• Published
• 7
Self-Rewarding Language Models
Paper
• 2401.10020
• Published
• 152