OpenCTEval Benchmark Datasets A collection that supports the development of the OpenCTEval Benchmark, a medical dataset catered towards LLM reasoning over Clinical Trial (CT) data araag2/MedNLI Viewer • Updated Jul 28, 2025 • 42.1k • 144 araag2/MedQA Viewer • Updated Jul 28, 2025 • 38.2k • 60 araag2/MedMCQA Viewer • Updated Jul 31, 2025 • 579k • 294 • 4 araag2/PubMedQA Viewer • Updated Jul 31, 2025 • 821k • 23
TAI-P2 google/gemma-3-4b-it Image-Text-to-Text • 4B • Updated Mar 21, 2025 • 1.64M • • 1.38k mistralai/Mistral-7B-Instruct-v0.2 Text Generation • 7B • Updated Jul 24, 2025 • 1.05M • • 3.17k meta-llama/Llama-3.2-3B-Instruct Text Generation • 3B • Updated Oct 24, 2024 • 2M • • 2.28k
Medical-LLMs Qwen/Qwen3-8B Text Generation • 8B • Updated Jul 26, 2025 • 13.5M • • 1.16k mistralai/Ministral-8B-Instruct-2410 8B • Updated Jul 31, 2025 • 295k • 583 google/gemma-3n-E4B-it Image-Text-to-Text • 8B • Updated Jul 14, 2025 • 21k • • 919 Qwen/Qwen3-4B-Instruct-2507 Text Generation • 4B • Updated Sep 17, 2025 • 5.46M • • 886
OpenCTEval Benchmark Datasets A collection that supports the development of the OpenCTEval Benchmark, a medical dataset catered towards LLM reasoning over Clinical Trial (CT) data araag2/MedNLI Viewer • Updated Jul 28, 2025 • 42.1k • 144 araag2/MedQA Viewer • Updated Jul 28, 2025 • 38.2k • 60 araag2/MedMCQA Viewer • Updated Jul 31, 2025 • 579k • 294 • 4 araag2/PubMedQA Viewer • Updated Jul 31, 2025 • 821k • 23
Medical-LLMs Qwen/Qwen3-8B Text Generation • 8B • Updated Jul 26, 2025 • 13.5M • • 1.16k mistralai/Ministral-8B-Instruct-2410 8B • Updated Jul 31, 2025 • 295k • 583 google/gemma-3n-E4B-it Image-Text-to-Text • 8B • Updated Jul 14, 2025 • 21k • • 919 Qwen/Qwen3-4B-Instruct-2507 Text Generation • 4B • Updated Sep 17, 2025 • 5.46M • • 886
TAI-P2 google/gemma-3-4b-it Image-Text-to-Text • 4B • Updated Mar 21, 2025 • 1.64M • • 1.38k mistralai/Mistral-7B-Instruct-v0.2 Text Generation • 7B • Updated Jul 24, 2025 • 1.05M • • 3.17k meta-llama/Llama-3.2-3B-Instruct Text Generation • 3B • Updated Oct 24, 2024 • 2M • • 2.28k