Update README.md
Browse files
README.md
CHANGED
|
@@ -9,7 +9,7 @@ datasets:
|
|
| 9 |
|
| 10 |
# Geez Word2Vec Skipgram Model
|
| 11 |
|
| 12 |
-
This repository contains a Word2Vec model trained on the TIGQA dataset using a custom tokenizer with SpaCy.
|
| 13 |
|
| 14 |
## Usage
|
| 15 |
Word2Vec static embeddings can be used to align or map the semantic relationships of pretrained model embeddings with the target language embeddings (specifically, embeddings generated by the Tigrinya tokenizer).
|
|
|
|
| 9 |
|
| 10 |
# Geez Word2Vec Skipgram Model
|
| 11 |
|
| 12 |
+
This repository contains a Word2Vec model trained on the TIGQA dataset using a custom tokenizer with SpaCy. and The model can be used as part of the language-adaptive pretraining process and for embedding initialization.
|
| 13 |
|
| 14 |
## Usage
|
| 15 |
Word2Vec static embeddings can be used to align or map the semantic relationships of pretrained model embeddings with the target language embeddings (specifically, embeddings generated by the Tigrinya tokenizer).
|