MJerome commited on
Commit
0724e8a
·
1 Parent(s): eadbc15

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -2
README.md CHANGED
@@ -53,10 +53,10 @@ Chess games from Lichess were converted from PGN into [xLAN](https://github.zhaw
53
 
54
  #### Training Hyperparameters
55
 
56
- - **Training regime:**
57
  - Batch Size: 24
58
  - Epochs: 4
59
  - Learning Rate: 0.00005
 
60
 
61
  ## Evaluation
62
 
@@ -74,8 +74,16 @@ The model was tested on 3 metrics:
74
  ### Results
75
 
76
 
77
-
 
 
78
 
79
  ## Model Architecture
80
 
 
81
 
 
 
 
 
 
 
53
 
54
  #### Training Hyperparameters
55
 
 
56
  - Batch Size: 24
57
  - Epochs: 4
58
  - Learning Rate: 0.00005
59
+ - With BOS-Token
60
 
61
  ## Evaluation
62
 
 
74
  ### Results
75
 
76
 
77
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64b81dff25b0493d515e317c/j0ZCCS0nIrkiyPJmynnsG.png)
78
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64b81dff25b0493d515e317c/NTvDR_XHnyKih9u_GExWo.png)
79
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64b81dff25b0493d515e317c/XGu5el3iztCFmPZiwTmOO.png)
80
 
81
  ## Model Architecture
82
 
83
+ GPT2 config with the following changes:
84
 
85
+ - VOCAB_SIZE = 76
86
+ - N_POSITION = 512
87
+ - PAD_TOKEN_ID = 0
88
+ - BOS_TOKEN_ID = 75
89
+ - EOS_TOKEN_ID = 74