view article Article Proof of Time: A Benchmark for Evaluating Scientific Idea Judgments shanchen • Jan 13 • 10
view article Article What We Learned About LLM/VLMs in Healthcare AI Evaluation: shanchen • Nov 8, 2024 • 16