arxiv:2606.10479
Qianjia Cheng
CajZella
AI & ML interests
None yet
Recent Activity
authored a paper about 5 hours ago
ComBench: A Benchmark for Rigorous Proof Reasoning and Constructive Realization in Olympiad-Level Combinatorics upvoted a paper about 5 hours ago
ComBench: A Benchmark for Rigorous Proof Reasoning and Constructive Realization in Olympiad-Level Combinatorics upvoted a paper 20 days ago
π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon WorkflowsOrganizations
None yet