JEPA-T: Joint-Embedding Predictive Architecture with Text Fusion for Image Generation Paper • 2510.00974 • Published Oct 1, 2025
DR-LoRA: Dynamic Rank LoRA for Mixture-of-Experts Adaptation Paper • 2601.04823 • Published Jan 8 • 6
Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning Paper • 2601.06943 • Published about 1 month ago • 211
RealMem: Benchmarking LLMs in Real-World Memory-Driven Interaction Paper • 2601.06966 • Published about 1 month ago • 8
FinVault: Benchmarking Financial Agent Safety in Execution-Grounded Environments Paper • 2601.07853 • Published Jan 9 • 10
CloneMem: Benchmarking Long-Term Memory for AI Clones Paper • 2601.07023 • Published about 1 month ago • 3
Controlled Self-Evolution for Algorithmic Code Optimization Paper • 2601.07348 • Published 29 days ago • 114
Controlled Self-Evolution for Algorithmic Code Optimization Paper • 2601.07348 • Published 29 days ago • 114
KnowMe-Bench: Benchmarking Person Understanding for Lifelong Digital Companions Paper • 2601.04745 • Published Jan 8 • 58
MemGovern: Enhancing Code Agents through Learning from Governed Human Experiences Paper • 2601.06789 • Published about 1 month ago • 78
GitTaskBench: A Benchmark for Code Agents Solving Real-World Tasks Through Code Repository Leveraging Paper • 2508.18993 • Published Aug 26, 2025 • 4
SE-Agent: Self-Evolution Trajectory Optimization in Multi-Step Reasoning with LLM-Based Agents Paper • 2508.02085 • Published Aug 4, 2025 • 2
RepoMaster: Autonomous Exploration and Understanding of GitHub Repositories for Complex Task Solving Paper • 2505.21577 • Published May 27, 2025 • 3
ShieldLearner: A New Paradigm for Jailbreak Attack Defense in LLMs Paper • 2502.13162 • Published Feb 16, 2025