ProgramTrace
non-profit
AI & ML interests
None defined yet.
models 8
PTPReasoning/Llama-3.1-8B-RL-Clean-V2
8B • Updated
PTPReasoning/Llama-3.1-8B-RL-Baseline-V2
8B • Updated
PTPReasoning/Llama-3.1-8B-SFT-Baseline
Text Generation • 8B • Updated
PTPReasoning/Llama-3.1-8B-SFT-Clean-V2
Text Generation • 8B • Updated
PTPReasoning/Qwen2.5-7B-Base-RL-Clean-V2
Text Generation • 8B • Updated
PTPReasoning/Qwen2.5-7B-Base-RL-Baseline
Text Generation • 8B • Updated
PTPReasoning/Qwen2.5-7B-Base-SFT-Clean-V2
Text Generation • 8B • Updated
PTPReasoning/Qwen2.5-7B-Base-SFT-Baseline-V2
Text Generation • 8B • Updated
datasets 12
PTPReasoning/finqa
Viewer
• Updated
• 1.15k • 65
PTPReasoning/hotpot_qa
Viewer
• Updated
• 500 • 43
PTPReasoning/PubMedQA
Viewer
• Updated
• 1.5k • 7
PTPReasoning/MedCalc-Bench-v1.0
Viewer
• Updated
• 22.5k • 11 • 2
PTPReasoning/PTP-RL-ITL-Final-Clean-V2
Viewer
• Updated
• 19k • 4
PTPReasoning/PTP-SFT-ITL-Final-Baseline-V2
Viewer
• Updated
• 4.12k • 8
PTPReasoning/PTP-SFT-ITL-Final-Clean-V2
Viewer
• Updated
• 4.21k • 7
PTPReasoning/PTP-RL-MedCalc-Bench
Viewer
• Updated
• 9.34k • 7
PTPReasoning/PTP-RL-DAPO-EN
Viewer
• Updated
• 14.1k • 6
PTPReasoning/mmlu_pro_biology
Viewer
• Updated
• 717 • 6