Pretrained models from the paper "Predicting the Order of Upcoming Tokens Improves Language Modeling"
Zayd Muhammad Kawakibi Zuhri PRO
zaydzuhri
AI & ML interests
I really like watching loss go down
Recent Activity
updated a dataset 3 days ago
zaydzuhri/single-recall updated a dataset 6 days ago
zaydzuhri/multi-stack-ops updated a dataset 6 days ago
zaydzuhri/countingOrganizations
None yet