Med-V1: Small Language Models for Zero-shot and Scalable Biomedical Evidence Attribution Paper • 2603.05308 • Published Mar 5 • 2
CT-Bench: A Benchmark for Multimodal Lesion Understanding in Computed Tomography Paper • 2602.14879 • Published Feb 19
Rethinking Visual Attribution for Chest X-ray Reasoning in Large Vision Language Models Paper • 2605.20158 • Published May 19 • 2
MedHopQA: A Disease-Centered Multi-Hop Reasoning Benchmark and Evaluation Framework for LLM-Based Biomedical Question Answering Paper • 2605.12361 • Published May 12
DeepEvidence: Empowering Biomedical Discovery with Deep Knowledge Graph Research Paper • 2601.11560 • Published Dec 23, 2025
Knowledge-guided Contextual Gene Set Analysis Using Large Language Models Paper • 2506.04303 • Published Jun 4, 2025
GeneGPT: Augmenting Large Language Models with Domain Tools for Improved Access to Biomedical Information Paper • 2304.09667 • Published Apr 19, 2023 • 1
RAG-Gym: Optimizing Reasoning and Search Agents with Process Supervision Paper • 2502.13957 • Published Feb 19, 2025 • 1
Benchmarking Retrieval-Augmented Generation for Chemistry Paper • 2505.07671 • Published May 12, 2025 • 1
Cell-o1: Training LLMs to Solve Single-Cell Reasoning Puzzles with Reinforcement Learning Paper • 2506.02911 • Published Jun 3, 2025 • 1
AgentMD: Empowering Language Agents for Risk Prediction with Large-Scale Clinical Tool Learning Paper • 2402.13225 • Published Feb 20, 2024 • 1
TrialPanorama: Database and Benchmark for Systematic Review and Design of Clinical Trials Paper • 2505.16097 • Published May 22, 2025
PubMedQA: A Dataset for Biomedical Research Question Answering Paper • 1909.06146 • Published Sep 13, 2019 • 4
BioCPT: Contrastive Pre-trained Transformers with Large-scale PubMed Search Logs for Zero-shot Biomedical Information Retrieval Paper • 2307.00589 • Published Jul 2, 2023 • 1
PubTator 3.0: an AI-powered Literature Resource for Unlocking Biomedical Knowledge Paper • 2401.11048 • Published Jan 19, 2024 • 2
Prioritizing Safeguarding Over Autonomy: Risks of LLM Agents for Science Paper • 2402.04247 • Published Feb 6, 2024 • 2
PMC-Patients: A Large-scale Dataset of Patient Notes and Relations Extracted from Case Reports in PubMed Central Paper • 2202.13876 • Published Feb 28, 2022
Benchmarking Retrieval-Augmented Generation for Medicine Paper • 2402.13178 • Published Feb 20, 2024 • 10
Improving Retrieval-Augmented Generation in Medicine with Iterative Follow-up Questions Paper • 2408.00727 • Published Aug 1, 2024 • 2