Abstract
True artificial agency requires internalized structures for goals, identity, decision-making, self-regulation, and learning, distinguishing autonomous systems from task-specific ones.
What is an agent? What constitutes agency? With the rise of Large Language Model (LLM) systems marketed as ``coding agents'', ``AI co-scientists'', and other ``agentic" tools that promise to drive up productivity, and at the same time, ``existential" concerns such as AI escaping human control with destructive power under a speculative ``machine agency" against humans, it has become essential to clarify where automation ends and agency begins, both for building capable systems and for understanding whether and what to fear. Drawing on Descartes' grounding of agency in independent thought, and on portrayals of autonomous beings in science fiction, we survey the current landscape of AI agents, and analyze agent architectures along five dimensions: goal, identity, decision-making, self-regulation, and learning. Specifically, we argue that genuine agency requires these structures to be internalized within the system itself rather than assembled through external scaffolding. This distinction between agentic systems, whose competence resides in engineered workflows, and agentive systems, whose capabilities (including social interaction) arise endogenously, defines the boundary between systems designed for prescribed tasks, and those capable of operating in the open world with true autonomy. Building on this analysis, we propose the Goal-Identity-Configurator (GIC) architecture for a general-purpose agent model, combining hierarchical goal decomposition, identity evolution, simulative reasoning grounded in a separately trained world model, learned self-regulation, and self-directed learning from both real and simulated experience. Furthermore, we share insight on the auditability, controllability, and safety of agentive systems that possess greater autonomy and ``agency", but remain under human oversight.
Community
Fable 5 and the upcoming GPT-5.6 promise exceptional "agentic" capabilities in software engineering and scientific research. Companies like Figure AI are racing towards humanoid robots.
We study a related but deeper question: what is the remaining ๐ด๐ฎ๐ฝ between current systems and fully autonomous agents?
We formally analyze today's AI agents along five axes: ๐ด๐ผ๐ฎ๐น, ๐ถ๐ฑ๐ฒ๐ป๐๐ถ๐๐, ๐ฑ๐ฒ๐ฐ๐ถ๐๐ถ๐ผ๐ป-๐บ๐ฎ๐ธ๐ถ๐ป๐ด, ๐๐ฒ๐น๐ณ-๐ฟ๐ฒ๐ด๐๐น๐ฎ๐๐ถ๐ผ๐ป, and ๐น๐ฒ๐ฎ๐ฟ๐ป๐ถ๐ป๐ด. We find that what separates these "agentic" systems from natural agents like you and me is whether capabilities arise from ๐ฒ๐ ๐๐ฒ๐ฟ๐ป๐ฎ๐น ๐๐ฐ๐ฎ๐ณ๐ณ๐ผ๐น๐ฑ๐ถ๐ป๐ด or ๐ถ๐ป๐๐ฒ๐ฟ๐ป๐ฎ๐น ๐ถ๐ป๐ถ๐๐ถ๐ฎ๐๐ถ๐๐ฒ, a distinction we formalize as ๐ฎ๐ด๐ฒ๐ป๐๐ถ๐ฐ vs. ๐ฎ๐ด๐ฒ๐ป๐๐ถ๐๐ฒ.
We propose the ๐๐ผ๐ฎ๐น-๐๐ฑ๐ฒ๐ป๐๐ถ๐๐-๐๐ผ๐ป๐ณ๐ถ๐ด๐๐ฟ๐ฎ๐๐ผ๐ฟ (๐๐๐) architecture for general-purpose agent models that internalize all of the following: hierarchical goals, evolving identity, simulative reasoning via a separate world model, a learned configurator for self-regulation, and self-directed learning from real + simulated experience.
Better agents don't come from better harnesses; they come from models that can harness themselves.
Get this paper in your agent:
hf papers read 2606.23991 Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper