·
AI & ML interests
None yet
Organizations
koreankiwi99/llama-3.1-8b-paraphrase-qlora-10000
Updated
koreankiwi99/0_predpo_lower_beta_balanced_lower_beta_mnlp_aggregate
0.6B
•
Updated
•
2
koreankiwi99/1_predpo_tuned_balanced_lower_beta_mnlp_aggregate
0.6B
•
Updated
•
2
koreankiwi99/2_predpo_base_balanced_plus_lower_beta_mnlp_aggregate
0.6B
•
Updated
•
2
koreankiwi99/3_predpo_base_curriculum_lower_beta_mnlp_aggregate
0.6B
•
Updated
•
2
koreankiwi99/4_dpo_curriculum_lower_beta_mnlp_aggregate
0.6B
•
Updated
•
2
koreankiwi99/5_dpo_balanced_plus_lower_beta_mnlp_aggregate
0.6B
•
Updated
•
2
koreankiwi99/dpo_model_predpo_config_mnlp_aggregate
0.6B
•
Updated
•
2
koreankiwi99/sft_model_sft_base_mnlp_stem_curriculum
0.6B
•
Updated
•
2
koreankiwi99/sft_model_sft_base_mnlp_stem_balanced_plus
0.6B
•
Updated
•
2
koreankiwi99/6_predpo_base_lightweight_lower_beta_mnlp_aggregate
0.6B
•
Updated
koreankiwi99/MNLP_M3_dpo_model
0.6B
•
Updated
•
2
koreankiwi99/7_dpo_lightweight_lower_beta_mnlp_aggregate
0.6B
•
Updated
koreankiwi99/8_dpo_reasoning_lower_beta_mnlp_aggregate
0.6B
•
Updated
koreankiwi99/9_dpo_math_only_lower_beta_mnlp_aggregate
0.6B
•
Updated
koreankiwi99/sft_model_sft_base_mnlp_stem_balanced
0.6B
•
Updated
koreankiwi99/sft_model_sft_base_mnlp_stem_math_only
0.6B
•
Updated
koreankiwi99/sft_model_sft_base_mnlp_stem_reasoning
0.6B
•
Updated
koreankiwi99/10_dpo_base_mnlp_aggregate_with_math
0.6B
•
Updated
koreankiwi99/11_dpo_lower_beta_mnlp_aggregate_with_math
0.6B
•
Updated
koreankiwi99/sft_model_sft_base_mnlp_stem_lightweight
0.6B
•
Updated
koreankiwi99/12_dpo_slight_lower_beta_mnlp_aggregate
0.6B
•
Updated
koreankiwi99/13_dpo_model_HelpSteer3
Text Generation
•
0.6B
•
Updated
•
1
koreankiwi99/M2_dpo_model_base_Math-Step-DPO-10K
0.6B
•
Updated
•
1
koreankiwi99/15_dpo_lower_beta
Text Generation
•
0.6B
•
Updated
•
1
koreankiwi99/16_dpo_suggested
Text Generation
•
0.6B
•
Updated
•
1
Text Generation
•
0.6B
•
Updated
•
1
koreankiwi99/18_dpo_tuned_mnlp_aggregate
Text Generation
•
0.6B
•
Updated
•
1
koreankiwi99/M2_dpo_model_SHP
Text Generation
•
0.6B
•
Updated
•
1
koreankiwi99/19_baseline_dpo_model_mnlp_aggregate
Text Generation
•
0.6B
•
Updated
•
1