Update README.md
Browse files
README.md
CHANGED
|
@@ -683,6 +683,8 @@ This model was created using a specialized four-stage pipeline:
|
|
| 683 |
|
| 684 |
The final model, **ModernBERT-small-v2**, was trained using a curated combination of four distinct datasets during the **MLM Pre-training** phase to ensure broad general knowledge acquisition before the final distillation tuning.
|
| 685 |
|
|
|
|
|
|
|
| 686 |
The following datasets were integrated and processed:
|
| 687 |
|
| 688 |
1. **MS MARCO Triplets** (`sentence-transformers/msmarco-msmarco-MiniLM-L6-v3`, "triplet" split)
|
|
|
|
| 683 |
|
| 684 |
The final model, **ModernBERT-small-v2**, was trained using a curated combination of four distinct datasets during the **MLM Pre-training** phase to ensure broad general knowledge acquisition before the final distillation tuning.
|
| 685 |
|
| 686 |
+
GitHub: [semantic-search-models/ModernBERT-small-v2](https://github.com/Johnnyboycurtis/semantic-search-models/tree/main/ModernBERT-small-v2)
|
| 687 |
+
|
| 688 |
The following datasets were integrated and processed:
|
| 689 |
|
| 690 |
1. **MS MARCO Triplets** (`sentence-transformers/msmarco-msmarco-MiniLM-L6-v3`, "triplet" split)
|