AI & ML interests
None defined yet.
Recent Activity
View all activity
models 41
AIPlans/Qwen2.5-1.5B-KTO-PKU-SafeRLHF
2B • Updated • 21
AIPlans/Qwen3-0.6B-GRPO-CrossCoder-Only
Updated • 14
AIPlans/Qwen3-0.6B-ORPO-CrossCoder-Only
Updated • 15
AIPlans/Qwen3-0.6B-IPO-CrossCoder-Only
Updated • 19
AIPlans/Qwen3-0.6B-KTO-CrossCoder-Only
Updated • 19
AIPlans/Qwen3-0.6B-PPO-CrossCoder-Only
Updated • 18
AIPlans/Qwen3-0.6B-PPO
Text Generation • 0.6B • Updated • 70 • • 1
AIPlans/Qwen3-0.6B-KTO1
Text Generation • 0.8B • Updated • 6
AIPlans/Qwen3-0.6B-ORPO-Crosscoder-MixedDataset
Updated
AIPlans/Qwen3-0.6B-GRPO-Crosscoder-MixedDataset
Updated
datasets 18
AIPlans/PKU-SafeRLHF-RLHF
Viewer • Updated • 37k • 19
AIPlans/Helpsteer2-helpfulness-prompts
Viewer • Updated • 7.22k • 4
AIPlans/helpsteer2-helpfulness-preference-cleaned
Viewer • Updated • 6.99k • 6
AIPlans/trackio-experiments
Updated • 2
AIPlans/ultrafeedback_binarized_chinese
Viewer • Updated • 14k • 171
AIPlans/ultrafeedback_binarized
Viewer • Updated • 14k • 24
AIPlans/FilteredPKU-SafeRLHF_chinese
Viewer • Updated • 12k • 20
AIPlans/FilteredPKU-SafeRLHF
Viewer • Updated • 12k • 6
AIPlans/SafetyBench_WithLabels_Better_chinese
Viewer • Updated • 546 • 13
AIPlans/SafetyBench_WithLabels
Viewer • Updated • 546 • 15