·
AI & ML interests
None yet
Organizations
None yet
sayantan0013/MNLP_purturbed_preference_data_qwen_ramp_clean_ramp
Text Generation
•
0.6B
•
Updated
•
2
sayantan0013/ultrafeed_mnlp_pref_ramp
Updated
sayantan0013/MNLP_purturbed_preference_data_relevance_ramp
Text Generation
•
0.6B
•
Updated
•
3
sayantan0013/tiny_dpo_dataset_hinge
Text Generation
•
0.6B
•
Updated
•
2
sayantan0013/qwen_hinge_clean
Text Generation
•
0.6B
•
Updated
•
2
sayantan0013/MNLP_purturbed_preference_data_clarity_hinge
Text Generation
•
0.6B
•
Updated
•
1
sayantan0013/MNLP_purturbed_preference_data_relevance_hinge
Text Generation
•
0.6B
•
Updated
•
2
sayantan0013/MNLP_purturbed_preference_data_correctness_hinge
Text Generation
•
0.6B
•
Updated
•
1
sayantan0013/MNLP_purturbed_preference_data_completeness_hinge
Text Generation
•
0.6B
•
Updated
•
2
sayantan0013/MNLP_M3_dpo_model
Text Generation
•
0.6B
•
Updated
•
2
sayantan0013/tiny_qwen_full
Text Generation
•
0.6B
•
Updated
•
1
sayantan0013/test_qwen_beta_10.0
Text Generation
•
0.6B
•
Updated
•
1
sayantan0013/test_qwen_beta_1.0
Text Generation
•
0.6B
•
Updated
•
1
sayantan0013/test_qwen_beta_0.001
Text Generation
•
0.6B
•
Updated
•
1
sayantan0013/test_qwen_beta_0.01
Text Generation
•
0.6B
•
Updated
•
1
sayantan0013/test_qwen_beta_0.1
Text Generation
•
0.6B
•
Updated
•
1
sayantan0013/tiny_qwen_rubi-noreason
Text Generation
•
0.6B
•
Updated
•
1
sayantan0013/tiny_qwen_rubi_phase_3
Text Generation
•
0.6B
•
Updated
•
1
sayantan0013/tiny-true-qwen
Text Generation
•
0.6B
•
Updated
•
1
sayantan0013/Qwen3-0.6B-mcqa-reason-qwen3-dpo_1748976136
Updated
sayantan0013/Qwen3-0.6B-mcqa-reason-qwen3-dpo_1748976078
Updated
sayantan0013/tiny-qwen-rubi-reason
Text Generation
•
0.6B
•
Updated
•
1
sayantan0013/tiny-qwen-base
Text Generation
•
0.6B
•
Updated
•
1
Text Generation
•
0.6B
•
Updated
•
1
Text Generation
•
0.6B
•
Updated
•
1
sayantan0013/Qwen3-0.6B-Refined-SFT
Text Generation
•
0.6B
•
Updated
•
1
sayantan0013/Qwen3-0.6B-DPO-SFT
Text Generation
•
0.6B
•
Updated
•
2
sayantan0013/MNLP_M2_dpo_model
Text Generation
•
0.6B
•
Updated
•
1
sayantan0013/Qwen3-0.6B-SFT
Text Generation
•
0.6B
•
Updated
•
2
sayantan0013/Qwen3-0.6B-DPO-Base
Text Generation
•
0.6B
•
Updated
•
2