Update README.md
Browse files
README.md
CHANGED
|
@@ -53,10 +53,10 @@ Chess games from Lichess were converted from PGN into [xLAN](https://github.zhaw
|
|
| 53 |
|
| 54 |
#### Training Hyperparameters
|
| 55 |
|
| 56 |
-
- **Training regime:**
|
| 57 |
- Batch Size: 24
|
| 58 |
- Epochs: 4
|
| 59 |
- Learning Rate: 0.00005
|
|
|
|
| 60 |
|
| 61 |
## Evaluation
|
| 62 |
|
|
@@ -74,8 +74,16 @@ The model was tested on 3 metrics:
|
|
| 74 |
### Results
|
| 75 |
|
| 76 |
|
| 77 |
-
|
|
|
|
|
|
|
| 78 |
|
| 79 |
## Model Architecture
|
| 80 |
|
|
|
|
| 81 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 53 |
|
| 54 |
#### Training Hyperparameters
|
| 55 |
|
|
|
|
| 56 |
- Batch Size: 24
|
| 57 |
- Epochs: 4
|
| 58 |
- Learning Rate: 0.00005
|
| 59 |
+
- With BOS-Token
|
| 60 |
|
| 61 |
## Evaluation
|
| 62 |
|
|
|
|
| 74 |
### Results
|
| 75 |
|
| 76 |
|
| 77 |
+

|
| 78 |
+

|
| 79 |
+

|
| 80 |
|
| 81 |
## Model Architecture
|
| 82 |
|
| 83 |
+
GPT2 config with the following changes:
|
| 84 |
|
| 85 |
+
- VOCAB_SIZE = 76
|
| 86 |
+
- N_POSITION = 512
|
| 87 |
+
- PAD_TOKEN_ID = 0
|
| 88 |
+
- BOS_TOKEN_ID = 75
|
| 89 |
+
- EOS_TOKEN_ID = 74
|