SAELens
Tom Lieberum
fold in scaling by sqrt(d_model) into params
9ff4e7b
download
history blame
75.5 MB
This file is stored with Xet . It is too big to display, but you can still download it.

Xet Pointer Details

( Raw pointer file )
Xet hash:
f5063bb4f748f2420b2aef4873c2e1184e0fe3402f6b62f85b7c9b2e4a9f34ac
Size of remote file:
75.5 MB
·
SHA256:
8c663031d10179005329025fdb572a49a16d50270a8e724829bb0628ff8a02d1

Xet efficiently stores Large Files inside Git, intelligently splitting files into unique chunks and accelerating uploads and downloads. More info.