AI & ML interests
None yet
Organizations
None yet
mtzig/reverse_add_replicate_eval17_small_1layer_d1_50
55 • Updated • 3
mtzig/reverse_add_replicate_eval17_small_1layer_d1_20
55 • Updated • 1
mtzig/reverse_add_replicate_eval17_small_1layer_d2
55 • Updated • 1
10.7M • Updated • 1
mtzig/maze_replicate_10_test
10.7M • Updated mtzig/reverse_add_replicate_eval17_small_1layer
1.11k • Updated • 1
mtzig/reverse_add_replicate_eval17_small
7.04k • Updated • 1
mtzig/reverse_add_replicate_eval17_corruptedfull
10.7M • Updated mtzig/reverse_add_replicate_eval17_corrupted
10.7M • Updated • 2
mtzig/reverse_add_replicate_eval17_SGD_largelr
10.7M • Updated mtzig/reverse_add_replicate_eval17_SGD
10.7M • Updated mtzig/reverse_add_replicate_eval18
10.7M • Updated mtzig/reverse_add_replicate_eval30
10.7M • Updated • 1
mtzig/reverse_add_replicate_eval20
10.7M • Updated • 1
mtzig/reverse_add_replicate
10.7M • Updated mtzig/gpt2_cfg_add_8_to_64_train_attn
Updated
mtzig/gpt2_cfg_add_8_to_32_train_mlp
Updated
10.7M • Updated • 1
7.04k • Updated 880 • Updated mtzig/mmlu_small_noaugs_llama_lora
Updated
Updated • 51
mtzig/prm800k_qwen_alt_lora
mtzig/prm800k_llama_joint_checkpoint8500
8B • Updated mtzig/prm800k_llama_joint_checkpoint4500
8B • Updated mtzig/prm800k_mistral_full_1203_re
Text Generation
• 7B • Updated