Evangelinejy/Qwen3_0.6B_LanTokenizer_ctx2048_multiturn_no_verify_lr0.0003 0.4B • Updated 5 days ago • 208
Evangelinejy/Qwen3_0.6B_LanTokenizer_ctx2048_multiturn_with_verify_lr0.0003 0.4B • Updated 5 days ago • 120
Evangelinejy/Qwen3_0.6B_LanTokenizer_ctx2048_singleturn_no_verify_lr0.0003 0.4B • Updated 8 days ago • 126
Evangelinejy/Qwen3_0.6B_LanTokenizer_ctx2048_singleturn_with_verify_lr0.0003 0.4B • Updated 8 days ago • 118
Evangelinejy/llama_3b_nemontron_midtrain_le8192_bs4_epoch5.0_ga1_lr5e-05 175k • Updated 13 days ago • 13
Evangelinejy/llama_3b_nemontron_midtrain_le8192_bs4_epoch5.0_ga1_lr5e-05_solution_only 175k • Updated 13 days ago • 25
Evangelinejy/Qwen3_0.6B_LanTokenizer_ctx2048_SFT_trajectory_sep_cot_minimax_60 0.4B • Updated 24 days ago • 118
Evangelinejy/Qwen3_0.6B_LanTokenizer_ctx2048_SFT_trajectory_sep_cot_400 0.4B • Updated 28 days ago • 536
Evangelinejy/Qwen25-1_5b-midtrain-openthoughts-nothink-8192-epoch3.0-bs4 2B • Updated 30 days ago • 8
Evangelinejy/llama-32-3b-instruct-openthoughts-nothink-bs4-epoch2.0-ctx8192-ga2-lr1e-05-wr0.1-n4 175k • Updated Jan 26
Evangelinejy/llama-32-3b-instruct-openthoughts-nothink-bs4-epoch1.0-ctx8192-ga2-lr1e-05-wr0.1-n4 175k • Updated Jan 26 • 1
Evangelinejy/llama3b_base_openthoughts_solution_only-bs4-epoch1.0-ctx8192-ga1-lr5e-05-wr0.1-n4 Text Generation • 175k • Updated Jan 22 • 2
Evangelinejy/llama3b_midtrain_openthoughts_solution_only-bs4-epoch1.0-ctx8192-ga1-lr5e-05-wr0.1-n4 Text Generation • 175k • Updated Jan 22 • 2
Evangelinejy/qwen25-7b-prm_demo-bs2-epoch3.0-ctx4096-ga2-lr1e-05-wr0.1-n4 Text Generation • 333k • Updated Dec 20, 2025