Yuanzhe Shen
OceanSky
·
AI & ML interests
None yet
Recent Activity
submitted
a paper
about 22 hours ago
TRIP-Bench: A Benchmark for Long-Horizon Interactive Agents in Real-World Scenarios
authored
a paper
about 23 hours ago
Controllable Memory Usage: Balancing Anchoring and Innovation in Long-Term Human-Agent Interaction
authored
a paper
about 23 hours ago
RECAST: Expanding the Boundaries of LLMs' Complex Instruction Following with Multi-Constraint Data