jsae shakespeare 64x4 max_steps=20k (mlpin -> mlpout)
Collection
A sweep of JSAE training with jacobian penalty from 1e-3 to 1e-2, trained for 20k steps. (added zero sparsity for baseline)
•
14 items
•
Updated
No model card