GrandaddyShmax
first push
00f2826
# @package __global__
# overrides nothing because default is already transformer base (~ 60M params)