cnmoro
/

custom-model2vec-tokenlearn-medium

sentence-transformers

static-embeddings

Model card Files Files and versions

cnmoro commited on Oct 27, 2025

Commit

a96e898

·

verified ·

1 Parent(s): fd951a6

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -12,6 +12,8 @@ language:
 ---
 A custom model2vec model, trained using a modified version of the [tokenlearn](https://github.com/MinishLab/tokenlearn) library.
 The output dimension is 256, and the vocabulary size is 249.999
 The training process used a mix of English (10%) and Portuguese (90%) texts.

 ---
 A custom model2vec model, trained using a modified version of the [tokenlearn](https://github.com/MinishLab/tokenlearn) library.
+Base model is nomic-ai/nomic-embed-text-v2-moe.
 The output dimension is 256, and the vocabulary size is 249.999
 The training process used a mix of English (10%) and Portuguese (90%) texts.