Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
14
YigeYuan
1t4chi
Follow
TTTXXX01's profile picture
1 follower
·
1 following
AI & ML interests
None yet
Recent Activity
updated
a model
30 days ago
1t4chi/zhs-Qwen2.5-7B-GRPO-NQ-step-200-discount-1p0
published
a model
30 days ago
1t4chi/zhs-Qwen2.5-7B-GRPO-NQ-step-200-discount-1p0
updated
a model
about 1 month ago
1t4chi/zhs-Qwen2.5-7B-AS-step-260-discount-1p0
View all activity
Organizations
None yet
1t4chi
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a model
30 days ago
1t4chi/zhs-Qwen2.5-7B-GRPO-NQ-step-200-discount-1p0
8B
•
Updated
30 days ago
published
a model
30 days ago
1t4chi/zhs-Qwen2.5-7B-GRPO-NQ-step-200-discount-1p0
8B
•
Updated
30 days ago
updated
a model
about 1 month ago
1t4chi/zhs-Qwen2.5-7B-AS-step-260-discount-1p0
8B
•
Updated
Feb 24
•
249
published
a model
about 1 month ago
1t4chi/zhs-Qwen2.5-7B-AS-step-260-discount-1p0
8B
•
Updated
Feb 24
•
249
updated
a model
about 1 month ago
1t4chi/zhs-Qwen2.5-7B-NQ-step-400-discount-1p0
8B
•
Updated
Feb 24
•
163
published
a model
about 1 month ago
1t4chi/zhs-Qwen2.5-7B-NQ-step-400-discount-1p0
8B
•
Updated
Feb 24
•
163
liked
a dataset
9 months ago
open-r1/DAPO-Math-17k-Processed
Viewer
•
Updated
Nov 10, 2025
•
34.8k
•
5.3k
•
62
updated
a model
12 months ago
1t4chi/Qwen2.5-Math-7B-QwQMath8K4096-BS128-LR1e-5-SFT
8B
•
Updated
Mar 30, 2025
•
1
published
a model
12 months ago
1t4chi/Qwen2.5-Math-7B-QwQMath8K4096-BS128-LR1e-5-SFT
8B
•
Updated
Mar 30, 2025
•
1
updated
a model
about 1 year ago
1t4chi/Qwen2.5-Math-7B-QwQMath8K-SFT
Text Generation
•
8B
•
Updated
Mar 15, 2025
published
2 models
about 1 year ago
1t4chi/Qwen2.5-Math-7B-QwQMath8K-SFT
Text Generation
•
8B
•
Updated
Mar 15, 2025
1t4chi/Qwen2.5-Math-7B-QwQMath6K-SFT
Updated
Mar 14, 2025
updated
a model
about 1 year ago
1t4chi/Qwen2.5-Math-7B-KL0-1.0e-07-checkpoint-532
8B
•
Updated
Mar 11, 2025
•
1
published
4 models
about 1 year ago
1t4chi/Qwen2.5-Math-7B-KL0-1.0e-07-checkpoint-532
8B
•
Updated
Mar 11, 2025
•
1
1t4chi/Qwen2.5-Math-7B-4GPU-Nothink-KL0.00005
Updated
Mar 9, 2025
1t4chi/Qwen2.5-Math-7B-HJX8k-4GPU-Nothink-KL0.0-FindData
Updated
Mar 8, 2025
1t4chi/mistral-7b-base-simper
Updated
Feb 21, 2025
liked
a Space
over 1 year ago
Running
423
Reward Bench Leaderboard
📐
423
Explore RewardBench model rankings and scores
liked
2 models
over 1 year ago
RLHFlow/RewardModel-Mistral-7B-for-DPA-v1
Text Classification
•
7B
•
Updated
May 23, 2024
•
53
•
4
allenai/tulu-v2.5-dpo-13b-hh-rlhf
Text Generation
•
13B
•
Updated
Jun 14, 2024
•
26
•
1
Load more