·
AI & ML interests
None yet
Organizations
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-log-noclipping-nonorm-T-1.0_math_test_8192_normal_K-1_T-0.5
Viewer
• Updated
• 458 • 4
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-log-noclipping-nonorm-T-1.0_gsm8k_8192_normal_K-1_T-0.5
Viewer
• Updated
• 1k • 3
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-linear-noclipping-nonorm-T-1.0_math_test_8192_normal_K-1_T-1.0
Viewer
• Updated
• 458 • 4
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-linear-noclipping-nonorm-T-1.0_gsm8k_8192_normal_K-1_T-1.0
Viewer
• Updated
• 1k • 4
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-linear-noclipping-nonorm_math_test_8192_normal_K-1_T-0.5
Viewer
• Updated
• 458 • 4
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-linear-noclipping-nonorm_gsm8k_8192_normal_K-1_T-0.5
Viewer
• Updated
• 1k • 4
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-log-noclipping-nonorm_math_test_8192_normal_K-1_T-0.5
Viewer
• Updated
• 458 • 4
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-log-noclipping-nonorm_gsm8k_8192_normal_K-1_T-0.5
Viewer
• Updated
• 1k • 4
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-linear-noclipping-nonorm_math_test_8192_normal_K-1
Viewer
• Updated
• 458 • 4
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-linear-noclipping-nonorm_gsm8k_8192_normal_K-1
Viewer
• Updated
• 1k • 5
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-log-clipping-nonorm_math_test_8192_normal_K-1
Viewer
• Updated
• 458 • 3
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-log-clipping-nonorm_gsm8k_8192_normal_K-1
Viewer
• Updated
• 1k • 4
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-log-noclipping-nonorm_math_test_8192_normal_K-1
Viewer
• Updated
• 458 • 4
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-log-noclipping-nonorm_gsm8k_8192_normal_K-1
Viewer
• Updated
• 1k • 4
mengdili/Marco-train-K-16-alpha-2-k-8-type-linear-clipping-True-normalization-True
Viewer
• Updated
• 40k • 2
mengdili/Marco-train-K-16-alpha-2-k-8-type-log-clipping-False-normalization-False
Viewer
• Updated
• 40k • 7
mengdili/Marco-train-K-16-alpha-2-k-8-type-log-clipping-True-normalization-False
Viewer
• Updated
• 40k • 4
mengdili/Marco-train-K-16-alpha-2-k-8-type-linear-clipping-True-normalization-False
Viewer
• Updated
• 40k • 4
mengdili/Marco-train-K-16-alpha-2-k-8-type-linear-clipping-False-normalization-False
Viewer
• Updated
• 40k • 4
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-log-noclipping_math_test_8192_normal_K-1
Viewer
• Updated
• 458 • 4
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-log-clipping_math_test_8192_normal_K-1
Viewer
• Updated
• 458 • 4
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-log-clipping_gsm8k_8192_normal_K-1
Viewer
• Updated
• 1k • 5
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-linear-noclipping_math_test_8192_normal_K-1
Viewer
• Updated
• 458 • 4
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-linear-noclipping_gsm8k_8192_normal_K-1
Viewer
• Updated
• 1k • 3
mengdili/Marco-7B-Pruned-K-16-k-8-epoch-1-log-noclipping_gsm8k_8192_normal_K-1
Viewer
• Updated
• 1k • 5
mengdili/Marco-train-K-16-alpha-2-k-8-type-linear-clipping-False
Viewer
• Updated
• 40k • 4
mengdili/Marco-train-K-16-alpha-2-k-8-type-log-clipping-True
Viewer
• Updated
• 40k • 4
mengdili/Marco-train-K-16-alpha-2-k-8-type-log-clipping-False
Viewer
• Updated
• 40k • 4
mengdili/Marco-Pruned-K16-k-8-epoch-1_math_test_8192_normal_K-1
Viewer
• Updated
• 458 • 4
mengdili/Marco-Pruned-K16-k-8-epoch-1_gsm8k_8192_normal_K-1
Viewer
• Updated
• 1k • 4