DISPROTBENCH: A Disorder-Aware, Task-Rich Benchmark for Evaluating Protein Structure Prediction in Realistic Biological Contexts Paper • 2507.02883 • Published Jun 18, 2025
Scientific Hypothesis Generation and Validation: Methods, Datasets, and Future Directions Paper • 2505.04651 • Published May 6, 2025
HalluGuard: Demystifying Data-Driven and Reasoning-Driven Hallucinations in LLMs Paper • 2601.18753 • Published 1 day ago • 1
HalluGuard: Demystifying Data-Driven and Reasoning-Driven Hallucinations in LLMs Paper • 2601.18753 • Published 1 day ago • 1
Plan and Budget: Effective and Efficient Test-Time Scaling on Large Language Model Reasoning Paper • 2505.16122 • Published May 22, 2025 • 5
LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection Paper • 2505.03793 • Published May 1, 2025 • 1