AI & ML interests
None yet
Organizations
None yet
mtzig/reverse_add_replicate_eval17_small_1layer_d1_50
55
•
Updated
•
1
mtzig/reverse_add_replicate_eval17_small_1layer_d1_20
55
•
Updated
•
1
mtzig/reverse_add_replicate_eval17_small_1layer_d2
55
•
Updated
10.7M
•
Updated
•
1
mtzig/maze_replicate_10_test
10.7M
•
Updated
•
1
mtzig/reverse_add_replicate_eval17_small_1layer
1.11k
•
Updated
•
1
mtzig/reverse_add_replicate_eval17_small
7.04k
•
Updated
•
1
mtzig/reverse_add_replicate_eval17_corruptedfull
10.7M
•
Updated
•
5
mtzig/reverse_add_replicate_eval17_corrupted
10.7M
•
Updated
mtzig/reverse_add_replicate_eval17_SGD_largelr
10.7M
•
Updated
mtzig/reverse_add_replicate_eval17_SGD
10.7M
•
Updated
mtzig/reverse_add_replicate_eval18
10.7M
•
Updated
mtzig/reverse_add_replicate_eval30
10.7M
•
Updated
•
1
mtzig/reverse_add_replicate_eval20
10.7M
•
Updated
mtzig/reverse_add_replicate
10.7M
•
Updated
•
1
mtzig/gpt2_cfg_add_8_to_64_train_attn
Updated
mtzig/gpt2_cfg_add_8_to_32_train_mlp
Updated
10.7M
•
Updated
7.04k
•
Updated
•
3
880
•
Updated
•
1
mtzig/mmlu_small_noaugs_llama_lora
Updated
Updated
•
147
mtzig/prm800k_qwen_alt_lora
Updated
mtzig/prm800k_llama_joint_checkpoint8500
8B
•
Updated
mtzig/prm800k_llama_joint_checkpoint4500
8B
•
Updated
mtzig/prm800k_mistral_full_1203_re
Text Generation
•
7B
•
Updated