·
AI & ML interests
None yet
Organizations
MNC-LLM/batch1_epochs4_lr1e-05_paged_adamw_32bit_cosine_length2048_warmup_0.05_max_grad1.0_grad_accu32
Text Generation
•
7B
•
Updated
•
6
MNC-LLM/batch1_epochs1_lr1e-05_paged_adamw_32bit_cosine_length2048_warmup_0.05_max_grad1.0_grad_accu16
Text Generation
•
Updated
•
9
MNC-LLM/Mistral-7B-NWS-u2k-Marcoroni-prompt-found-LaAdMoAl-ep4lr5
Text Generation
•
Updated
•
8
MNC-LLM/Mistral-7B-NWS-u2k-merge-Marcoroni-LaAdMoAl-ep4-lr5
Text Generation
•
Updated
•
12
MNC-LLM/batch1_epochs4_lr1e-05_paged_adamw_32bit_cosine_length2048_warmup_0.05_max_grad1.0_grad_accu16
Text Generation
•
Updated
•
18
MNC-LLM/batch1_epochs2_lr1e-05_paged_adamw_32bit_cosine_length2048_warmup_0.05_max_grad1.0_grad_accu32
Updated
MNC-LLM/Mistral-7B-NWS-u2k-merge-Marcoroni
Text Generation
•
Updated
•
8
MNC-LLM/Mistral-7B-LaAdMoAl-merge-Marcoroni
Text Generation
•
Updated
•
6
MNC-LLM/tulu-2-dpo-7B-NWSCot-600-ep4lr5
Text Generation
•
Updated
•
22
MNC-LLM/Tulu-2-DPO-7B-NWSO-5k-4ep-lr5
Text Generation
•
Updated
•
25
MNC-LLM/batch1_epochs6_lr1e-05_paged_adamw_32bit_cosine_length2048_warmup_0.05_max_grad1.0_grad_accu16
Text Generation
•
Updated
•
8
MNC-LLM/Mistral-7B-1st-NWSeCot-5k-lr6-ep2-2nd-La-Ad-Mo-lr5-ep4
Text Generation
•
Updated
•
8
MNC-LLM/Mistral-7B-1st-NWSeCot-5k-lr6-ep2-2nd-La-Ad-Mo-1k-lr5-ep4
Text Generation
•
Updated
•
10
MNC-LLM/Mistral-7B-1st-NWSeCot-5k-lr6-ep2-2nd-La-Ad-lr5-ep4
Text Generation
•
Updated
•
9
MNC-LLM/Mistral-7B-1st-NWSeCot-5k-lr6-ep2-2nd-Mo-lr5-ep4
Text Generation
•
Updated
•
10
MNC-LLM/Mistral-7B-1st-NWSeCot-5k-lr6-ep2-2nd-Mo-1k-lr5-ep4
Text Generation
•
Updated
•
9
MNC-LLM/law-hang-cot-2300-hang-data-all-batch1-epochs6-lr1e-05-length2048
Text Generation
•
Updated
•
9
MNC-LLM/batch1_epochs2_lr1e-06_paged_adamw_32bit_cosine_length2048_warmup_0.05_max_grad1.0_grad_accu16
Text Generation
•
Updated
•
7
MNC-LLM/batch1_epochs2_lr1e-05_paged_adamw_32bit_cosine_length2048_warmup_0.05_max_grad1.0_grad_accu16
Updated
MNC-LLM/batch1_epochs4_lr1e-06_paged_adamw_32bit_cosine_length2048_warmup_0.05_max_grad1.0_grad_accu16
Text Generation
•
Updated
•
18
MNC-LLM/batch1_epochs4_lr0.0001_paged_adamw_32bit_cosine_length2048_warmup_0.05_max_grad1.0_grad_accu16
Text Generation
•
Updated
•
15
MNC-LLM/Mistral-7B-NWS-u2k-eng-cot-ep4-lr1e-05
Text Generation
•
Updated
•
11
MNC-LLM/batch1_epochs4_lr2e-06_paged_adamw_32bit_cosine_length2048_warmup_0.05_max_grad1.0_grad_accu16
Text Generation
•
Updated
•
10
MNC-LLM/batch1_epochs6_lr1e-06_paged_adamw_32bit_cosine_length4096_warmup_0.05_max_grad1.0_grad_accu10
Updated
MNC-LLM/Mistral-7B-HANGANBU-epochs2-lr1e-06
Text Generation
•
Updated
•
8
MNC-LLM/Mistral-7B-Instruct-HANGANBU-epochs2-lr1e-06
Text Generation
•
Updated
•
11
MNC-LLM/Mistral-7B-Instruct-Foundation-CoT-5KEA-fixed-prompt-lr6-epoch2
Text Generation
•
Updated
•
12
MNC-LLM/Mistral-7B-Foundation-CoT-5KEA-fixed-prompt-lr6-epoch4
Text Generation
•
Updated
•
9
MNC-LLM/Mistral-7B-Foundation-CoT-5KEA-fixed-prompt-lr6-epoch2
Text Generation
•
Updated
•
8
MNC-LLM/Mistral-7B-Instruct-Foundation-CoT-5KEA-lr6-epoch2
Text Generation
•
Updated
•
10