·
AI & ML interests
None yet
Organizations
Litzy0619/MIS0731_lr_3e3_scaling_0005_warmup_15_timescale_15_cyclic_4_ensemble_shared_emb_512
Updated
Litzy0619/MIS0731_lr_3e3_scaling_0005_warmup_15_timescale_15_cyclic_4_ensemble_shared_bs_1024_emb_512
Updated
Litzy0619/MIS0731_lr_3e3_scaling_0005_warmup_15_timescale_15_cyclic_4_ensemble_shared_bs_512_emb_512
Updated
Litzy0619/MIS0731_lr_3e3_scaling_0005_warmup_15_timescale_15_cyclic_4_ensemble_shared_bs_256_emb_512
Updated
Litzy0619/MIS0731_lr_3e3_scaling_0005_warmup_15_timescale_15_cyclic_4_shared_emb_512
Updated
Litzy0619/phi2_noempty_48
Updated
Litzy0619/phi2_noempty_24
Updated
Litzy0619/phi2_noempty_36
Updated
Litzy0619/phi3_noempty_48
Updated
Litzy0619/phi3_noempty_36
Updated
Litzy0619/phi3_noempty_24
Updated
Litzy0619/MIS0726_lr_3e3_scaling_0005_warmup_0_plaueau_10_decay_095_bs_96_ensemble_shared
Updated
Litzy0619/MIS0726_lr_3e3_scaling_0005_warmup_0_plaueau_10_decay_095_bs_96_ensemble
Updated
Litzy0619/MIS0726_lr_3e3_scaling_0005_warmup_0_plaueau_10_decay_095_bs_256_ensemble_shared
Updated
Litzy0619/MIS0726_lr_3e3_scaling_0005_warmup_0_plaueau_10_decay_095_bs_256_ensemble
Updated
Litzy0619/MIS0726_lr_3e3_scaling_0005_warmup_0_plaueau_10_decay_095_bs_512_ensemble_shared
Updated