Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
19
16
8
Xirui Li
PRO
AIcell
Follow
lucazsh's profile picture
Dolphin42's profile picture
Gargaz's profile picture
5 followers
·
14 following
https://xirui-li.github.io/
xiruili7_li
xirui-li
AI & ML interests
Foundation LLM and VLM
Recent Activity
liked
a model
4 days ago
KangLiao/Puffin
updated
a model
10 days ago
AIcell/InternVL2_5-1B-SAT-RL-6000
published
a model
10 days ago
AIcell/InternVL2_5-1B-SAT-RL-6000
View all activity
Organizations
AIcell
's models
31
Sort: Recently updated
AIcell/InternVL2_5-1B-SAT-RL-6000
0.9B
•
Updated
10 days ago
•
11
AIcell/InternVL2_5-1B-SAT-RL-4800
0.9B
•
Updated
10 days ago
•
13
AIcell/InternVL2_5-1B-SAT-RL-3600
0.9B
•
Updated
10 days ago
•
12
AIcell/InternVL2_5-1B-SAT-RL-2400
0.9B
•
Updated
10 days ago
•
10
AIcell/InternVL2_5-1B-SAT-RL-1200
0.9B
•
Updated
10 days ago
•
12
AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-Majority
2B
•
Updated
Nov 3
•
4
AIcell/Qwen-1.5B-Instruct-GRPO-Majority
2B
•
Updated
Oct 31
•
3
AIcell/Qwen-1.5B-Instruct-GRPO-Random
2B
•
Updated
Oct 31
•
3
AIcell/Qwen-1.5B-Instruct-GRPO
2B
•
Updated
Oct 31
•
4
AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-non-reasoning
2B
•
Updated
Oct 24
•
3
AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-opposite
2B
•
Updated
Oct 24
•
6
AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-random
2B
•
Updated
Oct 21
•
4
AIcell/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
2B
•
Updated
Oct 17
•
3
AIcell/Qwen2.5-1.5B-Instruct-GRPO-gsm8k
Text Generation
•
2B
•
Updated
Oct 13
•
10
AIcell/Qwen2.5-0.5B-Instruct-GRPO-gsm8k
Text Generation
•
0.5B
•
Updated
Oct 13
•
9
AIcell/Qwen2.5-3B-Instruct-GRPO-gsm8k
Updated
Oct 10
AIcell/Qwen2.5-1.5B-Instruct-GRPO-DAPO17k-thinking
2B
•
Updated
Oct 6
•
2
AIcell/Qwen2.5-1.5B-Instruct-GRPO-Math220k-thinking
Text Generation
•
2B
•
Updated
Oct 5
•
10
AIcell/Qwen2.5-1.5B-Math-Instruct-GRPO-gsm8k
Text Generation
•
2B
•
Updated
Sep 29
•
10
AIcell/Qwen2.5-1.5B-Instruct-GRPO-gsm8k-random-reward
Text Generation
•
2B
•
Updated
Sep 26
•
15
AIcell/Qwen2.5-1.5B-Instruct-GRPO-gsm8k-no-thinking
2B
•
Updated
Sep 26
•
21
AIcell/Qwen2.5-1.5B-Instruct-GRPO-gsm8k-monitor
Text Generation
•
2B
•
Updated
Sep 12
•
6
AIcell/Qwen2.5-1.5B-Instruct-GRPO-gsm8k-plain
Text Generation
•
2B
•
Updated
Sep 12
•
7
AIcell/Qwen2.5-1.5B-Instruct-GRPO-Math12k-GPQA-Diamond-thinking
Updated
Aug 21
AIcell/Qwen2.5-1.5B-Instruct-GRPO-Math12k-MATH-500-thinking
Updated
Aug 21
AIcell/Qwen2.5-1.5B-Instruct-GRPO-Math12k-thinking
Updated
Aug 20
AIcell/Qwen2.5-1.5B-Base-GRPO-Math12k
Updated
Jul 3
AIcell/Qwen2.5-1.5B-Instruct-GRPO-Math12k-no-thinkng
Updated
Jul 3
AIcell/Qwen2.5-1.5B-Instruct-GRPO-Math12k
Updated
Jul 1
AIcell/Qwen2.5-1.5B-Instruct-GRPO
Updated
Jul 1
Previous
1
2
Next