Create README.md
Browse files
README.md
ADDED
|
@@ -0,0 +1,8 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
## Knowledge encoding by examples of Word2Vec and LLM training
|
| 2 |
+
This repository contains weights for a list of language models:
|
| 3 |
+
|
| 4 |
+
- word2vec.pt: embedding trained on 150mil pairs of text tokens subsampled from text8 dataset. SkipGram method with negative sampling was used as described in the original [paper](https://arxiv.org/abs/1402.3722).
|
| 5 |
+
- mlp.pt: 2-layers MLP trained on the same dataset and using pretrained embeddings.
|
| 6 |
+
- mlp_norm.pt: Version of the MLP model utilizing LayerNorm for better scaling of the learned features distribution.
|
| 7 |
+
|
| 8 |
+
[Training code can be found on GitHub](https://github.com/RuslanPeresy/knowledge-encode).
|