YigeYuan

1t4chi

AI & ML interests

None yet

Recent Activity

updated a model 30 days ago

1t4chi/zhs-Qwen2.5-7B-GRPO-NQ-step-200-discount-1p0

published a model 30 days ago

1t4chi/zhs-Qwen2.5-7B-GRPO-NQ-step-200-discount-1p0

updated a model about 1 month ago

1t4chi/zhs-Qwen2.5-7B-AS-step-260-discount-1p0

View all activity

Organizations

None yet

updated a model 30 days ago

1t4chi/zhs-Qwen2.5-7B-GRPO-NQ-step-200-discount-1p0

8B • Updated 30 days ago

published a model 30 days ago

1t4chi/zhs-Qwen2.5-7B-GRPO-NQ-step-200-discount-1p0

8B • Updated 30 days ago

updated a model about 1 month ago

1t4chi/zhs-Qwen2.5-7B-AS-step-260-discount-1p0

8B • Updated Feb 24 • 249

published a model about 1 month ago

1t4chi/zhs-Qwen2.5-7B-AS-step-260-discount-1p0

8B • Updated Feb 24 • 249

updated a model about 1 month ago

1t4chi/zhs-Qwen2.5-7B-NQ-step-400-discount-1p0

8B • Updated Feb 24 • 163

published a model about 1 month ago

1t4chi/zhs-Qwen2.5-7B-NQ-step-400-discount-1p0

8B • Updated Feb 24 • 163

liked a dataset 9 months ago

open-r1/DAPO-Math-17k-Processed

Viewer • Updated Nov 10, 2025 • 34.8k • 5.3k • 62

updated a model 12 months ago

1t4chi/Qwen2.5-Math-7B-QwQMath8K4096-BS128-LR1e-5-SFT

8B • Updated Mar 30, 2025 • 1

published a model 12 months ago

1t4chi/Qwen2.5-Math-7B-QwQMath8K4096-BS128-LR1e-5-SFT

8B • Updated Mar 30, 2025 • 1

updated a model about 1 year ago

1t4chi/Qwen2.5-Math-7B-QwQMath8K-SFT

Text Generation • 8B • Updated Mar 15, 2025

published 2 models about 1 year ago

1t4chi/Qwen2.5-Math-7B-QwQMath8K-SFT

Text Generation • 8B • Updated Mar 15, 2025

1t4chi/Qwen2.5-Math-7B-QwQMath6K-SFT

Updated Mar 14, 2025

updated a model about 1 year ago

1t4chi/Qwen2.5-Math-7B-KL0-1.0e-07-checkpoint-532

8B • Updated Mar 11, 2025 • 1

published 4 models about 1 year ago

liked a Space over 1 year ago

Reward Bench Leaderboard

📐

423

Explore RewardBench model rankings and scores

liked 2 models over 1 year ago

RLHFlow/RewardModel-Mistral-7B-for-DPA-v1

Text Classification • 7B • Updated May 23, 2024 • 53 • 4

allenai/tulu-v2.5-dpo-13b-hh-rlhf

Text Generation • 13B • Updated Jun 14, 2024 • 26 • 1

YigeYuan

AI & ML interests

Recent Activity

Organizations

1t4chi's activity

Reward Bench Leaderboard