Post-2019 MeSH Terms

#5
by irtazaaslam - opened

I'm a data scientist working on a project for the NLM to automatically assign MeSH terms to catalog records using WellcomeBertMesh.

The model works well overall but since it was trained on data up to 2019, it has no knowledge of MeSH terms added after that. This is a significant gap for our use case as many of our records cover these newer topics.

Is there a way to support post-2019 MeSH terms without fully retraining the model? like COVID-19

For example, fine-tuning on new terms or updating the label vocabulary in a meaningful way?

Any guidance would be appreciated. Thank you!

Sign up or log in to comment