Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
15
24
19
Wei Xiong
weqweasdas
Follow
Chenlu123's profile picture
Trangle's profile picture
circulartext's profile picture
20 followers
·
21 following
https://weixiongust.github.io/WeiXiongUST/index.html
AI & ML interests
Machine learning, RLHF
Recent Activity
upvoted
a
paper
1 day ago
Stream-R1: Reliability-Perplexity Aware Reward Distillation for Streaming Video Generation
upvoted
a
paper
6 months ago
Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning
updated
a dataset
6 months ago
weqweasdas/qwen15b_train_simple_subset5k_for_difficulty_transition
View all activity
Organizations
weqweasdas
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
over 1 year ago
RLHFlow/Llama3.1-8B-PRM-Deepseek-Data
Text Generation
•
8B
•
Updated
May 10, 2025
•
2.21k
•
•
38
liked
a dataset
over 1 year ago
RLHFlow/RLHFlow-SFT-Dataset-ver2
Viewer
•
Updated
Nov 2, 2024
•
2.32M
•
46
•
5
liked
3 models
over 1 year ago
RLHFlow/Llama3.1-8B-PRM-Mistral-Data
Text Generation
•
8B
•
Updated
Nov 9, 2024
•
263
•
•
10
NCSOFT/Llama-3-OffsetBias-RM-8B
Text Classification
•
8B
•
Updated
Sep 6, 2024
•
122
•
25
RLHFlow/LLaMA3-SFT
Text Generation
•
8B
•
Updated
Nov 3, 2024
•
107
•
•
10
liked
6 models
almost 2 years ago
RLHFlow/LLaMA3-iterative-DPO-final
Text Generation
•
8B
•
Updated
Oct 14, 2024
•
94
•
•
41
RLHFlow/ArmoRM-Llama3-8B-v0.1
Text Classification
•
Updated
Sep 23, 2024
•
16.1k
•
184
RLHFlow/pair-preference-model-LLaMA3-8B
Text Generation
•
8B
•
Updated
Oct 14, 2024
•
224
•
•
38
Salesforce/LLaMA-3-8B-SFR-RM-R
Text Classification
•
8B
•
Updated
Jan 21, 2025
•
11
•
11
Salesforce/LLaMA-3-8B-SFR-SFT-R
Text Generation
•
8B
•
Updated
Jan 21, 2025
•
97
•
8
Salesforce/LLaMA-3-8B-SFR-Iterative-DPO-R
Text Generation
•
8B
•
Updated
Jan 21, 2025
•
102
•
•
78
liked
3 models
about 2 years ago
sfairXC/FsfairX-LLaMA3-RM-v0.1
Text Classification
•
8B
•
Updated
Oct 14, 2024
•
3.1k
•
60
sfairXC/FsfairX-Zephyr-Chat-v0.1
Text Generation
•
7B
•
Updated
Apr 24, 2024
•
12
•
8
weqweasdas/RM-Mistral-7B
Text Classification
•
7B
•
Updated
Mar 31, 2024
•
3.4k
•
25
liked
a Space
about 2 years ago
Running
Agents
428
Reward Bench Leaderboard
📐
428
Explore RewardBench model rankings and scores
liked
2 models
about 2 years ago
weqweasdas/RM-Gemma-7B
Text Classification
•
9B
•
Updated
Mar 22, 2024
•
41.5k
•
8
weqweasdas/RM-Gemma-2B
Text Classification
•
3B
•
Updated
Mar 22, 2024
•
47.5k
•
25
liked
a model
almost 3 years ago
weqweasdas/hh_rlhf_rm_open_llama_3b
Text Classification
•
Updated
Feb 25, 2024
•
57
•
17
liked
a Space
about 3 years ago
Runtime error
Agents
Featured
66
Robin 7b
🔥
66