view article Article Where Does the Signal Live? <br> A Web Data Recipe for Medical Encoder Pretraining bofenghuang • 6 days ago • 5
doctolib-lab/finemed-entity-extractor-fr Token Classification • 0.3B • Updated 4 days ago • 15 • 2
doctolib-lab/finemed-subdomain-classifier-fr Text Classification • 0.1B • Updated 4 days ago • 9
DoctoBERT-fr Collection French medical encoders pretrained from scratch on curated and LLM-rephrased medical web data. • 4 items • Updated 4 days ago • 8
DoctoBERT-fr Collection French medical encoders pretrained from scratch on curated and LLM-rephrased medical web data. • 4 items • Updated 4 days ago • 8
DoctoBERT-fr Collection French medical encoders pretrained from scratch on curated and LLM-rephrased medical web data. • 4 items • Updated 4 days ago • 8
FineMed-fr Collection A French medical pretraining corpus, its LLM-rephrased variant, and the annotators that built them. • 6 items • Updated 4 days ago • 2