Fixed some issues and bugs .Finished trail training succesfully 0cac660 abhinavv3 commited on Jul 23, 2025
Changed the attention mechanism to GQA in knn_attention and xl_attention f58ea49 abhinavv3 commited on Jul 22, 2025
Repo before implementing concepts of the paper memorizing transformer f6d6286 abhinavv3 commited on Jul 14, 2025