Standardized benchmark and leaderboard for Clinical Named Entity Recognition
AI & ML interests
None defined yet.
Model evaluation framework for Clinical Application
-
MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications
Paper β’ 2409.07314 β’ Published β’ 56 -
Beyond Metrics: A Critical Analysis of the Variability in Large Language Model Evaluation Frameworks
Paper β’ 2407.21072 β’ Published β’ 2 -
Named Clinical Entity Recognition Benchmark
Paper β’ 2410.05046 β’ Published β’ 16 -
MEDIC Benchmark
π53View and compare medical LLM evaluations
SoTA clinical LLMs and the techniques to build them.
-
Beyond Fine-tuning: Unleashing the Potential of Continuous Pretraining for Clinical LLMs
Paper β’ 2409.14988 β’ Published β’ 22 -
Med42-v2: A Suite of Clinical LLMs
Paper β’ 2408.06142 β’ Published β’ 52 -
Med42 -- Evaluating Fine-Tuning Strategies for Medical LLMs: Full-Parameter vs. Parameter-Efficient Approaches
Paper β’ 2404.14779 β’ Published β’ 1 -
m42-health/Llama3-Med42-70B
Text Generation β’ 71B β’ Updated β’ 1.6k β’ β’ 68
Standardized benchmark and leaderboard for Clinical Named Entity Recognition
SoTA clinical LLMs and the techniques to build them.
-
Beyond Fine-tuning: Unleashing the Potential of Continuous Pretraining for Clinical LLMs
Paper β’ 2409.14988 β’ Published β’ 22 -
Med42-v2: A Suite of Clinical LLMs
Paper β’ 2408.06142 β’ Published β’ 52 -
Med42 -- Evaluating Fine-Tuning Strategies for Medical LLMs: Full-Parameter vs. Parameter-Efficient Approaches
Paper β’ 2404.14779 β’ Published β’ 1 -
m42-health/Llama3-Med42-70B
Text Generation β’ 71B β’ Updated β’ 1.6k β’ β’ 68
Model evaluation framework for Clinical Application
-
MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications
Paper β’ 2409.07314 β’ Published β’ 56 -
Beyond Metrics: A Critical Analysis of the Variability in Large Language Model Evaluation Frameworks
Paper β’ 2407.21072 β’ Published β’ 2 -
Named Clinical Entity Recognition Benchmark
Paper β’ 2410.05046 β’ Published β’ 16 -
MEDIC Benchmark
π53View and compare medical LLM evaluations