ruslandev commited on
Commit
54ed95e
·
verified ·
1 Parent(s): 37e1687

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -0
README.md ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ ## Knowledge encoding by examples of Word2Vec and LLM training
2
+ This repository contains weights for a list of language models:
3
+
4
+ - word2vec.pt: embedding trained on 150mil pairs of text tokens subsampled from text8 dataset. SkipGram method with negative sampling was used as described in the original [paper](https://arxiv.org/abs/1402.3722).
5
+ - mlp.pt: 2-layers MLP trained on the same dataset and using pretrained embeddings.
6
+ - mlp_norm.pt: Version of the MLP model utilizing LayerNorm for better scaling of the learned features distribution.
7
+
8
+ [Training code can be found on GitHub](https://github.com/RuslanPeresy/knowledge-encode).