johnnyboycurtis
/

ModernBERT-small-v2

Sentence Similarity

sentence-transformers

feature-extraction

Generated from Trainer

dataset_size:3375201

Eval Results (legacy)

text-embeddings-inference

Model card Files Files and versions

johnnyboycurtis commited on Feb 20

Commit

72a12a4

·

verified ·

1 Parent(s): a58ba5a

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -683,6 +683,8 @@ This model was created using a specialized four-stage pipeline:
 The final model, **ModernBERT-small-v2**, was trained using a curated combination of four distinct datasets during the **MLM Pre-training** phase to ensure broad general knowledge acquisition before the final distillation tuning.
 The following datasets were integrated and processed:
 1.  **MS MARCO Triplets** (`sentence-transformers/msmarco-msmarco-MiniLM-L6-v3`, "triplet" split)

 The final model, **ModernBERT-small-v2**, was trained using a curated combination of four distinct datasets during the **MLM Pre-training** phase to ensure broad general knowledge acquisition before the final distillation tuning.
+GitHub: [semantic-search-models/ModernBERT-small-v2](https://github.com/Johnnyboycurtis/semantic-search-models/tree/main/ModernBERT-small-v2)
 The following datasets were integrated and processed:
 1.  **MS MARCO Triplets** (`sentence-transformers/msmarco-msmarco-MiniLM-L6-v3`, "triplet" split)