PhilipQuirke
/

Accurate6DigitSubtraction

Model card Files Files and versions

PhilipQuirke commited on Feb 14, 2024

Commit

fb3f9f7

·

verified ·

1 Parent(s): f1005b1

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -7,6 +7,6 @@ Contains files for a Transformer model that answers 6-digit subtraction question
 This subtraction model has 3 layers, 4 attention heads, d-model = 510, d-head = 170.
 The subtraction model was initialised with a very-low-loss Addition model (2 layers, 3 attention heads, 9e-9 loss), before being trained for 45K epochs.
-The CoLab used to train the model is here: https://github.com/PhilipQuirke/transformer-maths/blob/main/assets/Accurate_Math_Train.ipynb
-The CoLab used to analyse the model is here: https://github.com/PhilipQuirke/transformer-maths/blob/main/assets/Accurate_Math_Analyse.ipynb

 This subtraction model has 3 layers, 4 attention heads, d-model = 510, d-head = 170.
 The subtraction model was initialised with a very-low-loss Addition model (2 layers, 3 attention heads, 9e-9 loss), before being trained for 45K epochs.
+The CoLab used to train the model is here: https://github.com/apartresearch/Verified_addition/blob/main/assets/Accurate_Math_Train.ipynb
+The CoLab used to analyse the model is here: https://github.com/apartresearch/Verified_addition/blob/main/assets/Accurate_Math_Analyse.ipynb