BredForCompanionship/vanilla_24L2048_parity_hotstop_L12_mlp_in_matryoshka_batch_top_k_k48_500M Updated 10 days ago
BredForCompanionship/vanilla_24L2048_parity_hotstop_L12_mlp_in_matryoshka_batch_top_k_k48_500M Updated 10 days ago
BredForCompanionship/vanilla_24L2048_parity_hotstop_L15_mlp_in_jump_relu_k48_250M Updated 12 days ago
BredForCompanionship/vanilla_24L2048_parity_hotstop_L1_mlp_in_batch_top_k_k48_250M Updated 12 days ago