Gloria Geng

GloriaGeng

2 6

·

AI & ML interests

None yet

Recent Activity

updated a dataset about 2 months ago

GloriaGeng/CausalT5K

upvoted a paper about 2 months ago

REALM-Bench: A Benchmark for Evaluating Multi-Agent Systems on Real-world, Dynamic Planning and Scheduling Tasks

upvoted a paper about 2 months ago

CausalT5K: Diagnosing and Informing Refusal for Trustworthy Causal Reasoning of Skepticism, Sycophancy, Detection-Correction, and Rung Collapse

View all activity

Organizations

upvoted 2 papers about 2 months ago

REALM-Bench: A Benchmark for Evaluating Multi-Agent Systems on Real-world, Dynamic Planning and Scheduling Tasks

Paper • 2502.18836 • Published Aug 5, 2025 • 1

CausalT5K: Diagnosing and Informing Refusal for Trustworthy Causal Reasoning of Skepticism, Sycophancy, Detection-Correction, and Rung Collapse

Paper • 2602.08939 • Published Feb 9 • 1