Update README.md

Llama-3.2-1B-Instruct, with domain adapted pretraining (DAPT), also called Continuous Pre-training (CPT) on a Dutch medical corpus.

Training for one full epoch, with a 256 batch size, maximally 768 sequence length and a linear-cosine schedule (details follow..).

This model will be further pre-trained on 5 million cardiology records from the UMCU.

Files changed (1) hide show

README.md +12 -3

README.md CHANGED Viewed

@@ -1,3 +1,12 @@
----
-license: llama3.2
----

+---
+license: llama3.2
+datasets:
+- UMCU/DutchMedicalText
+language:
+- nl
+base_model:
+- meta-llama/Llama-3.2-1B-Instruct
+tags:
+- medical
+- cardiology
+---