huxueyu
huxueyu
AI & ML interests
Large Language Models
Recent Activity
upvoted a paper 30 days ago
EcoGym: Evaluating LLMs for Long-Horizon Plan-and-Execute in Interactive Economies submitted
a paper
30 days ago
EcoGym: Evaluating LLMs for Long-Horizon Plan-and-Execute in Interactive Economies upvoted a paper about 1 month ago
AgentIF-OneDay: A Task-level Instruction-Following Benchmark for General AI Agents in Daily Scenarios