CarePilot: A Multi-Agent Framework for Long-Horizon Computer Task Automation in Healthcare Paper • 2603.24157 • Published 5 days ago • 8
CarePilot: A Multi-Agent Framework for Long-Horizon Computer Task Automation in Healthcare Paper • 2603.24157 • Published 5 days ago • 8
CarePilot: A Multi-Agent Framework for Long-Horizon Computer Task Automation in Healthcare Paper • 2603.24157 • Published 5 days ago • 8
CURE-Med: Curriculum-Informed Reinforcement Learning for Multilingual Medical Reasoning Paper • 2601.13262 • Published Jan 19 • 2
CURE-Med: Curriculum-Informed Reinforcement Learning for Multilingual Medical Reasoning Paper • 2601.13262 • Published Jan 19 • 2
CLINIC: Evaluating Multilingual Trustworthiness in Language Models for Healthcare Paper • 2512.11437 • Published Dec 12, 2025 • 4
CLINIC: Evaluating Multilingual Trustworthiness in Language Models for Healthcare Paper • 2512.11437 • Published Dec 12, 2025 • 4
CLINIC: Evaluating Multilingual Trustworthiness in Language Models for Healthcare Paper • 2512.11437 • Published Dec 12, 2025 • 4
left|,circlearrowright,text{BUS},right|: A Large and Diverse Multimodal Benchmark for evaluating the ability of Vision-Language Models to understand Rebus Puzzles Paper • 2511.01340 • Published Nov 3, 2025 • 13