Papers
arxiv:2605.15301

Solvita: Enhancing Large Language Models for Competitive Programming via Agentic Evolution

Published on May 14
· Submitted by
Jiaheng Liu
on May 18
Authors:
,
,
,
,
,
,
,
,
,
,
,
,

Abstract

Solvita is an agentic evolution framework that enables continuous learning in code generation through reinforcement learning updates to graph-structured knowledge networks, achieving state-of-the-art performance on competitive programming benchmarks.

AI-generated summary

Large language models (LLMs) still struggle with the rigorous reasoning demands of hard competitive programming. While recent multi-agent frameworks attempt to bridge this reliability gap, they remain fundamentally stateless: they rely on static retrieval and discard the valuable problem-solving and debugging experience gained from previous tasks. To address this, we present Solvita, an agentic evolution framework that enables continuous learning without requiring weight updates to the underlying LLM. Solvita reorganizes problem-solving into a closed-loop system of strategy selection, program synthesis, certified supervision, and targeted hacking, executed by four specialized agents: Planner, Solver, Oracle, and Hacker. Crucially, each agent is paired with a trainable, graph-structured knowledge network. As the system operates, outcome signals, such as pass/fail verdicts, test certification quality, and adversarial vulnerabilities discovered by the Hacker, are recast as reinforcement learning updates to these network weights. This allows the agents to dynamically route future queries based on past successes and failures, effectively accumulating transferable reasoning experience over time. Evaluated across CodeContests, APPS, AetherCode, and live Codeforces rounds, Solvita establishes a new state-of-the-art among code-generation agents, outperforming existing multi-agent pipelines and nearly doubling the accuracy of single-pass baselines.

Community

Paper submitter

Large language models (LLMs) still struggle with the rigorous reasoning demands of hard competitive programming. While recent multi-agent frameworks attempt to bridge this reliability
gap, they remain fundamentally stateless: they rely on static retrieval and discard the valuable
problem-solving and debugging experience gained from previous tasks. To address this, we present
Solvita, an agentic evolution framework that enables continuous learning without requiring weight
updates to the underlying LLM. Solvita reorganizes problem-solving into a closed-loop system
of strategy selection, program synthesis, certified supervision, and targeted hacking, executed by
four specialized agents (Planner, Solver, Oracle, and Hacker).

image

Sign up or log in to comment

Get this paper in your agent:

hf papers read 2605.15301
Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2605.15301 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2605.15301 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2605.15301 in a Space README.md to link it from this page.

Collections including this paper 1