·
AI & ML interests
None yet
Organizations
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-log-noclipping-nonorm-T-1.0_math_test_8192_normal_K-1_T-0.5
Viewer
• Updated • 458 • 5
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-log-noclipping-nonorm-T-1.0_gsm8k_8192_normal_K-1_T-0.5
Viewer
• Updated • 1k • 17
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-linear-noclipping-nonorm-T-1.0_math_test_8192_normal_K-1_T-1.0
Viewer
• Updated • 458 • 17
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-linear-noclipping-nonorm-T-1.0_gsm8k_8192_normal_K-1_T-1.0
Viewer
• Updated • 1k • 58
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-linear-noclipping-nonorm_math_test_8192_normal_K-1_T-0.5
Viewer
• Updated • 458 • 66
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-linear-noclipping-nonorm_gsm8k_8192_normal_K-1_T-0.5
Viewer
• Updated • 1k • 18
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-log-noclipping-nonorm_math_test_8192_normal_K-1_T-0.5
Viewer
• Updated • 458 • 28
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-log-noclipping-nonorm_gsm8k_8192_normal_K-1_T-0.5
Viewer
• Updated • 1k • 9
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-linear-noclipping-nonorm_math_test_8192_normal_K-1
Viewer
• Updated • 458 • 10
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-linear-noclipping-nonorm_gsm8k_8192_normal_K-1
Viewer
• Updated • 1k • 8
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-log-clipping-nonorm_math_test_8192_normal_K-1
Viewer
• Updated • 458 • 20
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-log-clipping-nonorm_gsm8k_8192_normal_K-1
Viewer
• Updated • 1k • 16
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-log-noclipping-nonorm_math_test_8192_normal_K-1
Viewer
• Updated • 458 • 16
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-log-noclipping-nonorm_gsm8k_8192_normal_K-1
Viewer
• Updated • 1k • 2
mengdili/Marco-train-K-16-alpha-2-k-8-type-linear-clipping-True-normalization-True
Viewer
• Updated • 40k • 3
mengdili/Marco-train-K-16-alpha-2-k-8-type-log-clipping-False-normalization-False
Viewer
• Updated • 40k • 4
mengdili/Marco-train-K-16-alpha-2-k-8-type-log-clipping-True-normalization-False
Viewer
• Updated • 40k • 45
mengdili/Marco-train-K-16-alpha-2-k-8-type-linear-clipping-True-normalization-False
Viewer
• Updated • 40k • 58
mengdili/Marco-train-K-16-alpha-2-k-8-type-linear-clipping-False-normalization-False
Viewer
• Updated • 40k • 26
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-log-noclipping_math_test_8192_normal_K-1
Viewer
• Updated • 458 • 2
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-log-clipping_math_test_8192_normal_K-1
Viewer
• Updated • 458 • 2
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-log-clipping_gsm8k_8192_normal_K-1
Viewer
• Updated • 1k • 2
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-linear-noclipping_math_test_8192_normal_K-1
Viewer
• Updated • 458 • 2
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-linear-noclipping_gsm8k_8192_normal_K-1
Viewer
• Updated • 1k • 2
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-log-noclipping_gsm8k_8192_normal_K-1
Viewer
• Updated • 1k • 2
mengdili/Marco-train-K-16-alpha-2-k-8-type-linear-clipping-False
Viewer
• Updated • 40k • 12
mengdili/Marco-train-K-16-alpha-2-k-8-type-log-clipping-True
Viewer
• Updated • 40k • 5
mengdili/Marco-train-K-16-alpha-2-k-8-type-log-clipping-False
Viewer
• Updated • 40k • 13
mengdili/Marco-Pruned-K16-k-8-epoch-1_math_test_8192_normal_K-1
Viewer
• Updated • 458 • 5
mengdili/Marco-Pruned-K16-k-8-epoch-1_gsm8k_8192_normal_K-1
Viewer
• Updated • 1k • 23