Update README.md
Browse files
README.md
CHANGED
|
@@ -12,6 +12,6 @@ While some data errors are tolerable, perhaps even desirable, for general ML mod
|
|
| 12 |
|
| 13 |
Complicating matters, shifting medical facts may invalidate training data and model knowledge. What was true last year may be false today. For instance, in April 2024 the U.S. Preventive Services Task Force reversed its longstanding advice and now urges biennial mammograms starting at age 40 -- down from the previous benchmark of 50 -- for average-risk women, citing rising breast-cancer incidence in younger patients.
|
| 14 |
|
| 15 |
-
Accurate annotation of medical data is challenging. Even Google DeepMind's relabeled effort of MedQA from 2024 contains errors, which we uncovered.
|
| 16 |
|
| 17 |
This is why HotpotBio exists: to provide rigorously validated, expert-curated datasets and benchmarks in pursuit of advancing ML/AI in clinical and broader biomedical applications.
|
|
|
|
| 12 |
|
| 13 |
Complicating matters, shifting medical facts may invalidate training data and model knowledge. What was true last year may be false today. For instance, in April 2024 the U.S. Preventive Services Task Force reversed its longstanding advice and now urges biennial mammograms starting at age 40 -- down from the previous benchmark of 50 -- for average-risk women, citing rising breast-cancer incidence in younger patients.
|
| 14 |
|
| 15 |
+
Accurate annotation of medical data is challenging and demands verification by experts based on the latest guidelines. Even Google DeepMind's relabeled effort of MedQA from 2024 contains errors, which we uncovered.
|
| 16 |
|
| 17 |
This is why HotpotBio exists: to provide rigorously validated, expert-curated datasets and benchmarks in pursuit of advancing ML/AI in clinical and broader biomedical applications.
|