-
LLM Agent Operating System
Paper • 2403.16971 • Published • 73 -
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification
Paper • 2508.05629 • Published • 188 -
Is Chain-of-Thought Reasoning of LLMs a Mirage? A Data Distribution Lens
Paper • 2508.01191 • Published • 240 -
A Survey of Context Engineering for Large Language Models
Paper • 2507.13334 • Published • 263
Vivek
vikx01
AI & ML interests
None yet
Recent Activity
updated a collection 22 days ago
llm updated a collection 3 months ago
llm upvoted a paper 3 months ago
X-Teaming: Multi-Turn Jailbreaks and Defenses with Adaptive Multi-AgentsOrganizations
None yet