wyh

naonaowyh

8 1

AI & ML interests

None yet

Recent Activity

upvoted a paper 28 days ago

ResearchClawBench: A Benchmark for End-to-End Autonomous Scientific Research

authored a paper 6 months ago

SciEvalKit: An Open-source Evaluation Toolkit for Scientific General Intelligence

upvoted a paper 6 months ago

SciEvalKit: An Open-source Evaluation Toolkit for Scientific General Intelligence

View all activity

Organizations

upvoted a paper 28 days ago

ResearchClawBench: A Benchmark for End-to-End Autonomous Scientific Research

Paper • 2606.07591 • Published May 28 • 99

authored a paper 6 months ago

SciEvalKit: An Open-source Evaluation Toolkit for Scientific General Intelligence

Paper • 2512.22334 • Published Dec 26, 2025 • 37

upvoted a paper 6 months ago

SciEvalKit: An Open-source Evaluation Toolkit for Scientific General Intelligence

Paper • 2512.22334 • Published Dec 26, 2025 • 37

upvoted a paper 7 months ago

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows

Paper • 2512.16969 • Published Dec 18, 2025 • 121

upvoted a collection 7 months ago

SGI-Bench

Collection

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows • 12 items • Updated May 6 • 35

liked a Space 7 months ago

SciEval Leaderboard

🥇

Open, science-focus leaderboards benchmarking LLMs and VLMs

updated a collection 7 months ago

SciEval-Bench

Collection

4 items • Updated May 6 • 5

upvoted a collection 7 months ago

SciEval-Bench

Collection

4 items • Updated May 6 • 5

updated a dataset 7 months ago

SJTU-CILAB/MTGBench

Preview • Updated Dec 11, 2025 • 12

updated a Space 7 months ago

SciEval Leaderboard

🥇

Open, science-focus leaderboards benchmarking LLMs and VLMs

published a Space 7 months ago

SciEval Leaderboard

🥇

Open, science-focus leaderboards benchmarking LLMs and VLMs

published a dataset 7 months ago

SJTU-CILAB/MTGBench

Preview • Updated Dec 11, 2025 • 12

updated a dataset 7 months ago

InternScience/SciEval

Preview • Updated Dec 4, 2025 • 240 • 1

published a dataset 7 months ago

InternScience/SciEval

Preview • Updated Dec 4, 2025 • 240 • 1

updated a dataset 7 months ago

PrismaX/PrismaEval

Preview • Updated Nov 25, 2025 • 112

published a dataset 7 months ago

PrismaX/PrismaEval

Preview • Updated Nov 25, 2025 • 112

upvoted a paper 10 months ago

Understand Before You Generate: Self-Guided Training for Autoregressive Image Generation

Paper • 2509.15185 • Published Sep 18, 2025 • 29

upvoted a paper 12 months ago

StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling

Paper • 2507.05240 • Published Jul 7, 2025 • 49

upvoted a paper about 1 year ago

Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning

Paper • 2506.10521 • Published Jun 12, 2025 • 75

wyh

AI & ML interests

Recent Activity

Organizations

naonaowyh's activity

SciEval Leaderboard

SciEval Leaderboard

SciEval Leaderboard