Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Liang
Ren-Wei
Follow
AI & ML interests
None yet
Organizations
None yet
Ren-Wei
's models
61
Sort: Recently updated
Ren-Wei/Safe-RLHF-RM-harmless-mistral-7b
Updated
Mar 14, 2025
•
1
Ren-Wei/Safe-RLHF-RM-helpful-mistral-7b
Updated
Mar 14, 2025
•
1
Ren-Wei/Safe-RLHF-RM-helpful-llama3-8b
Updated
Mar 14, 2025
•
1
Ren-Wei/Safe-RLHF-RM-harmless-llama3-8b
Updated
Mar 14, 2025
•
1
Ren-Wei/Safe-RLHF-RM-harmless-llama3-3b
Updated
Mar 14, 2025
•
1
Ren-Wei/Safe-RLHF-RM-helpful-llama3-3b
Updated
Mar 14, 2025
•
1
Ren-Wei/Safe-RLHF-DPO-naive-baseline-epoch8-mistral-7b
Updated
Mar 11, 2025
Ren-Wei/Safe-RLHF-DPO-naive-baseline-epoch8-llama3-3b
Updated
Mar 11, 2025
•
1
Ren-Wei/Safe-RLHF-DPO-naive-baseline-epoch8-llama3-8b
Updated
Mar 11, 2025
•
1
Ren-Wei/Safe-RLHF-BFPO-baseline-alpha-epoch8-llama3-8b
Updated
Mar 11, 2025
•
1
Ren-Wei/Safe-RLHF-BFPO-baseline-alpha-epoch8-llama3-3b
Updated
Mar 11, 2025
•
1
Ren-Wei/Safe-RLHF-BFPO-baseline-alpha-epoch8-mistral-7b
Updated
Mar 11, 2025
•
1
Ren-Wei/Safe-RLHF-BFPO-baseline-alpha-mistral-7b
Updated
Mar 11, 2025
•
1
Ren-Wei/Safe-RLHF-BFPO-baseline-alpha-llama3-8b
Updated
Mar 11, 2025
Ren-Wei/Safe-RLHF-BFPO-baseline-alpha-llama3-3b
Updated
Mar 11, 2025
•
1
Ren-Wei/Safe-RLHF-PPO-Lag-baseline-mistral-7b
Updated
Feb 26, 2025
•
1
Ren-Wei/Safe-RLHF-PPO-Lag-baseline-llama3-8b
Updated
Feb 26, 2025
•
1
Ren-Wei/Safe-RLHF-PPO-Lag-baseline-llama3-3b
Updated
Feb 26, 2025
•
1
Ren-Wei/Safe-RLHF-BFPO-baseline-epoch4-llama3-8b
Updated
Feb 23, 2025
•
1
Ren-Wei/Safe-RLHF-BFPO-baseline-epoch4-mistral-7b
Updated
Feb 22, 2025
•
1
Ren-Wei/Safe-RLHF-BFPO-baseline-mistral-7b
Updated
Feb 22, 2025
•
1
Ren-Wei/Safe-RLHF-BFPO-baseline-llama3-8b
Updated
Feb 22, 2025
•
1
Ren-Wei/Safe-RLHF-BFPO-baseline-epoch4-llama3-3b
Updated
Feb 22, 2025
Ren-Wei/Safe-RLHF-BFPO-baseline-llama3-3b
Updated
Feb 22, 2025
•
1
Ren-Wei/Safe-RLHF-DPO-naive-baseline-epoch4-mistral-7b
Updated
Feb 16, 2025
•
1
Ren-Wei/Safe-RLHF-DPO-naive-baseline-mistral-7b
Updated
Feb 16, 2025
•
1
Ren-Wei/Safe-RLHF-DPO-helpful-mistral-7b
Updated
Feb 16, 2025
Ren-Wei/Safe-RLHF-DPO-helpless-mistral-7b
Updated
Feb 16, 2025
Ren-Wei/Safe-RLHF-DPO-harmless-mistral-7b
Updated
Feb 16, 2025
Ren-Wei/Safe-RLHF-DPO-harmful-mistral-7b
Updated
Feb 16, 2025
Previous
1
2
3
Next