-
Executable Code Actions Elicit Better LLM Agents
Paper • 2402.01030 • Published • 187 -
ReAct: Synergizing Reasoning and Acting in Language Models
Paper • 2210.03629 • Published • 32 -
SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering
Paper • 2405.15793 • Published • 7 -
DevBench: A Comprehensive Benchmark for Software Development
Paper • 2403.08604 • Published • 2
Realsid
RealSid
·
AI & ML interests
None yet
Recent Activity
liked
a model
6 days ago
onnx-community/canary-qwen-2.5b-ONNX
liked
a model
6 days ago
nvidia/canary-qwen-2.5b
liked
a Space
3 months ago
nanotron/ultrascale-playbook