Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
15
23
19
Wei Xiong
weqweasdas
Follow
Laihaoran's profile picture
linrongc's profile picture
Chenlu123's profile picture
20 followers
·
21 following
https://weixiongust.github.io/WeiXiongUST/index.html
AI & ML interests
Machine learning, RLHF
Organizations
weqweasdas
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
about 1 year ago
RLHFlow/Llama3.1-8B-PRM-Deepseek-Data
Text Generation
•
8B
•
Updated
May 10, 2025
•
3.5k
•
•
37
liked
a dataset
over 1 year ago
RLHFlow/RLHFlow-SFT-Dataset-ver2
Viewer
•
Updated
Nov 2, 2024
•
2.32M
•
23
•
5
liked
3 models
over 1 year ago
RLHFlow/Llama3.1-8B-PRM-Mistral-Data
Text Generation
•
8B
•
Updated
Nov 9, 2024
•
7
•
•
10
NCSOFT/Llama-3-OffsetBias-RM-8B
Text Classification
•
8B
•
Updated
Sep 6, 2024
•
122
•
24
RLHFlow/LLaMA3-SFT
Text Generation
•
8B
•
Updated
Nov 3, 2024
•
16
•
•
10
liked
9 models
almost 2 years ago
RLHFlow/LLaMA3-iterative-DPO-final
Text Generation
•
8B
•
Updated
Oct 14, 2024
•
33
•
•
41
RLHFlow/ArmoRM-Llama3-8B-v0.1
Text Classification
•
Updated
Sep 23, 2024
•
18.9k
•
182
RLHFlow/pair-preference-model-LLaMA3-8B
Text Generation
•
8B
•
Updated
Oct 14, 2024
•
84
•
•
38
Salesforce/LLaMA-3-8B-SFR-RM-R
Text Classification
•
8B
•
Updated
Jan 21, 2025
•
3
•
11
Salesforce/LLaMA-3-8B-SFR-SFT-R
Text Generation
•
8B
•
Updated
Jan 21, 2025
•
12
•
8
Salesforce/LLaMA-3-8B-SFR-Iterative-DPO-R
Text Generation
•
8B
•
Updated
Jan 21, 2025
•
39
•
78
sfairXC/FsfairX-LLaMA3-RM-v0.1
Text Classification
•
8B
•
Updated
Oct 14, 2024
•
1.45k
•
60
sfairXC/FsfairX-Zephyr-Chat-v0.1
Text Generation
•
7B
•
Updated
Apr 24, 2024
•
5
•
8
weqweasdas/RM-Mistral-7B
Text Classification
•
7B
•
Updated
Mar 31, 2024
•
2.71k
•
25
liked
a Space
almost 2 years ago
Running
422
Reward Bench Leaderboard
📐
422
Explore RewardBench model rankings and scores
liked
2 models
about 2 years ago
weqweasdas/RM-Gemma-7B
Text Classification
•
9B
•
Updated
Mar 22, 2024
•
9
•
8
weqweasdas/RM-Gemma-2B
Text Classification
•
3B
•
Updated
Mar 22, 2024
•
182
•
25
liked
a model
over 2 years ago
weqweasdas/hh_rlhf_rm_open_llama_3b
Text Classification
•
Updated
Feb 25, 2024
•
193
•
17
liked
a Space
almost 3 years ago
Runtime error
Featured
66
Robin 7b
🔥
66