Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
14
YigeYuan
1t4chi
Follow
TTTXXX01's profile picture
1 follower
·
1 following
AI & ML interests
None yet
Recent Activity
updated
a model
about 1 month ago
1t4chi/zhs-Qwen2.5-7B-GRPO-NQ-step-200-discount-1p0
published
a model
about 1 month ago
1t4chi/zhs-Qwen2.5-7B-GRPO-NQ-step-200-discount-1p0
updated
a model
about 1 month ago
1t4chi/zhs-Qwen2.5-7B-AS-step-260-discount-1p0
View all activity
Organizations
None yet
1t4chi
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a dataset
9 months ago
open-r1/DAPO-Math-17k-Processed
Viewer
•
Updated
Nov 10, 2025
•
34.8k
•
5.3k
•
62
liked
a Space
over 1 year ago
Running
423
Reward Bench Leaderboard
📐
423
Explore RewardBench model rankings and scores
liked
4 models
over 1 year ago
RLHFlow/RewardModel-Mistral-7B-for-DPA-v1
Text Classification
•
7B
•
Updated
May 23, 2024
•
53
•
4
allenai/tulu-v2.5-dpo-13b-hh-rlhf
Text Generation
•
13B
•
Updated
Jun 14, 2024
•
26
•
1
allenai/tulu-2-dpo-13b
Text Generation
•
Updated
May 17, 2024
•
1.62k
•
•
21
PKU-Alignment/beaver-7b-v1.0
Reinforcement Learning
•
7B
•
Updated
May 9, 2024
•
18
•
13
liked
3 datasets
over 1 year ago
PKU-Alignment/PKU-SafeRLHF
Viewer
•
Updated
Oct 18, 2024
•
164k
•
14.8k
•
178
PKU-Alignment/PKU-SafeRLHF-10K
Viewer
•
Updated
Jul 20, 2023
•
10k
•
726
•
62
unalignment/toxic-dpo-v0.2
Viewer
•
Updated
Jan 9, 2024
•
541
•
429
•
138
liked
a model
over 1 year ago
HelpingAI/HelpingAI-9B
Text Generation
•
9B
•
Updated
Oct 31, 2024
•
67
•
26
liked
2 datasets
over 1 year ago
rngusry/UltraFeedback-honesty-preferences
Viewer
•
Updated
Aug 3, 2024
•
251k
•
9
•
1
rngusry/UltraFeedback-truthfulness-preferences
Viewer
•
Updated
Jul 25, 2024
•
217k
•
14
•
1
liked
2 models
over 1 year ago
jointpreferences/mistral_7b_sft_helpful
Text Generation
•
7B
•
Updated
Apr 2, 2024
•
6
•
1
GraySwanAI/Mistral-7B-Instruct-RR
Text Generation
•
7B
•
Updated
Jul 9, 2024
•
840
•
5