jln.shk_64x4: JSAE (resid_mid -> resid_out) 7.5k steps
j.mlp_layer.shk_64x4: JSAE (mlp_in -> mlp_out) 7.5k steps
David Quarel
davidquarel
·
AI & ML interests
None yet
Recent Activity
updated a model 20 days ago
davidquarel/arena-2.5-mcts-c4 updated a dataset 27 days ago
davidquarel/connect4-pons-eval published a dataset 27 days ago
davidquarel/connect4-pons-evalOrganizations
None yet