Datasets and Model Checkpoints for Paper "From SFT to RL: Demystifying the Post-Training Pipeline for LLM-based Vulnerability Detection"
Youpeng Li
Leopo1d
AI & ML interests
None yet
Recent Activity
updated a collection 12 days ago
OpenVul updated a model 15 days ago
Leopo1d/OpenVul-Qwen3-4B-GRPO published a model 15 days ago
Leopo1d/OpenVul-Qwen3-4B-GRPOOrganizations
None yet
models 6
Leopo1d/OpenVul-Qwen3-4B-GRPO
Text Generation • 196k • Updated • 219
Leopo1d/OpenVul-Qwen3-4B-DPO
Text Generation • 4B • Updated • 5
Leopo1d/OpenVul-Qwen3-4B-ORPO
Text Generation • 4B • Updated • 5
Leopo1d/OpenVul-Qwen3-4B-SFT-ep3
Text Generation • 196k • Updated • 164 •
Leopo1d/OpenVul-Qwen3-4B-SFT-ep1
Text Generation • 196k • Updated • 5
Leopo1d/OpenVul-Qwen3-4B-SFT-ep5
Text Generation • 196k • Updated • 10 •
datasets 10
Leopo1d/OpenVul
Updated • 34
Leopo1d/OpenVul_Sample_Specification_for_RL_Reward_Evaluation
Viewer • Updated • 15.6k • 39
Leopo1d/OpenVul_CWE_Hierarchical_Mapping
Viewer • Updated • 944 • 66 • 1
Leopo1d/OpenVul_Ground_Truth_Vulnerability_Information
Viewer • Updated • 9.77k • 22
Leopo1d/OpenVul_Vulnerability_Query_Dataset_for_RL
Viewer • Updated • 19.5k • 164
Leopo1d/OpenVul_Vulnerability_Preference_Dataset_for_DPO
Viewer • Updated • 7.24k • 714
Leopo1d/OpenVul_Vulnerability_Preference_Dataset_for_ORPO
Viewer • Updated • 7.05k • 677
Leopo1d/OpenVul_Rationalization_based_Vulnerability_Reasoning_Dataset_for_SFT
Viewer • Updated • 15.6k • 55
Leopo1d/OpenVul_Rejection_Sampling_based_Vulnerability_Reasoning_Dataset_for_SFT
Viewer • Updated • 6.28k • 73 • 1
Leopo1d/OpenVul_Distilled_Vulnerability_Reasoning_CoTs_from_DeepSeek-R1-0528
Viewer • Updated • 15.6k • 116