Yuhan Research

university

https://teafrogsf.github.io

AI & ML interests

None defined yet.

Recent Activity

sunyiyou authored a paper about 9 hours ago

OpenOOD v1.5: Enhanced Benchmark for Out-of-Distribution Detection

sunyiyou authored a paper about 9 hours ago

Scattered Forest Search: Smarter Code Space Exploration with LLMs

sunyiyou authored a paper about 9 hours ago

Can LLMs Design Good Questions Based on Context?

View all activity

authored 9 papers about 9 hours ago

OpenOOD v1.5: Enhanced Benchmark for Out-of-Distribution Detection

Paper • 2306.09301 • Published Jun 15, 2023 • 1

Scattered Forest Search: Smarter Code Space Exploration with LLMs

Paper • 2411.05010 • Published Oct 22, 2024 • 1

Can LLMs Design Good Questions Based on Context?

Paper • 2501.03491 • Published Jan 7, 2025

Climbing the Ladder of Reasoning: What LLMs Can-and Still Can't-Solve after SFT?

Paper • 2504.11741 • Published Apr 16, 2025 • 1

OMEGA: Can LLMs Reason Outside the Box in Math? Evaluating Exploratory, Compositional, and Transformative Generalization

Paper • 2506.18880 • Published Jun 23, 2025 • 4

Can Aha Moments Be Fake? Identifying True and Decorative Thinking Steps in Chain-of-Thought

Paper • 2510.24941 • Published Oct 28, 2025 • 4

Unsafer in Many Turns: Benchmarking and Defending Multi-Turn Safety Risks in Tool-Using Agents

Paper • 2602.13379 • Published Feb 13 • 3

Rethinking Domain Generalization for Face Anti-spoofing: Separability and Alignment

Paper • 2303.13662 • Published Mar 23, 2023

Agents' Last Exam

Paper • 2606.05405 • Published 6 days ago • 1

published a dataset 9 months ago

yuhan-research/explorative-B-easy

Viewer • Updated Sep 9, 2025 • 10 • 13

updated a dataset 9 months ago

yuhan-research/explorative-B-easy

Viewer • Updated Sep 9, 2025 • 10 • 13