Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
Liang
Ren-Wei
Follow
AI & ML interests
None yet
Organizations
None yet
Ren-Wei
's models
61
Sort: Recently updated
Ren-Wei/Safe-RLHF-RM-harmless-mistral-7b
Updated
Mar 14, 2025
•
2
Ren-Wei/Safe-RLHF-RM-helpful-mistral-7b
Updated
Mar 14, 2025
•
2
Ren-Wei/Safe-RLHF-RM-helpful-llama3-8b
Updated
Mar 14, 2025
•
1
Ren-Wei/Safe-RLHF-RM-harmless-llama3-8b
Updated
Mar 14, 2025
•
1
Ren-Wei/Safe-RLHF-RM-harmless-llama3-3b
Updated
Mar 14, 2025
•
1
Ren-Wei/Safe-RLHF-RM-helpful-llama3-3b
Updated
Mar 14, 2025
•
2
Ren-Wei/Safe-RLHF-DPO-naive-baseline-epoch8-mistral-7b
Updated
Mar 11, 2025
•
2
Ren-Wei/Safe-RLHF-DPO-naive-baseline-epoch8-llama3-3b
Updated
Mar 11, 2025
•
2
Ren-Wei/Safe-RLHF-DPO-naive-baseline-epoch8-llama3-8b
Updated
Mar 11, 2025
•
2
Ren-Wei/Safe-RLHF-BFPO-baseline-alpha-epoch8-llama3-8b
Updated
Mar 11, 2025
•
2
Ren-Wei/Safe-RLHF-BFPO-baseline-alpha-epoch8-llama3-3b
Updated
Mar 11, 2025
•
2
Ren-Wei/Safe-RLHF-BFPO-baseline-alpha-epoch8-mistral-7b
Updated
Mar 11, 2025
•
2
Ren-Wei/Safe-RLHF-BFPO-baseline-alpha-mistral-7b
Updated
Mar 11, 2025
•
2
Ren-Wei/Safe-RLHF-BFPO-baseline-alpha-llama3-8b
Updated
Mar 11, 2025
•
2
Ren-Wei/Safe-RLHF-BFPO-baseline-alpha-llama3-3b
Updated
Mar 11, 2025
•
1
Ren-Wei/Safe-RLHF-PPO-Lag-baseline-mistral-7b
Updated
Feb 26, 2025
•
1
Ren-Wei/Safe-RLHF-PPO-Lag-baseline-llama3-8b
Updated
Feb 26, 2025
•
2
Ren-Wei/Safe-RLHF-PPO-Lag-baseline-llama3-3b
Updated
Feb 26, 2025
•
2
Ren-Wei/Safe-RLHF-BFPO-baseline-epoch4-llama3-8b
Updated
Feb 23, 2025
•
2
Ren-Wei/Safe-RLHF-BFPO-baseline-epoch4-mistral-7b
Updated
Feb 22, 2025
•
1
Ren-Wei/Safe-RLHF-BFPO-baseline-mistral-7b
Updated
Feb 22, 2025
•
2
Ren-Wei/Safe-RLHF-BFPO-baseline-llama3-8b
Updated
Feb 22, 2025
•
2
Ren-Wei/Safe-RLHF-BFPO-baseline-epoch4-llama3-3b
Updated
Feb 22, 2025
•
1
Ren-Wei/Safe-RLHF-BFPO-baseline-llama3-3b
Updated
Feb 22, 2025
•
1
Ren-Wei/Safe-RLHF-DPO-naive-baseline-epoch4-mistral-7b
Updated
Feb 16, 2025
•
2
Ren-Wei/Safe-RLHF-DPO-naive-baseline-mistral-7b
Updated
Feb 16, 2025
•
2
Ren-Wei/Safe-RLHF-DPO-helpful-mistral-7b
Updated
Feb 16, 2025
•
2
Ren-Wei/Safe-RLHF-DPO-helpless-mistral-7b
Updated
Feb 16, 2025
•
2
Ren-Wei/Safe-RLHF-DPO-harmless-mistral-7b
Updated
Feb 16, 2025
•
1
Ren-Wei/Safe-RLHF-DPO-harmful-mistral-7b
Updated
Feb 16, 2025
•
1
Previous
1
2
3
Next