Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
1
2
5
liang
CharlesLi
Follow
zhourunyu05's profile picture
1 follower
·
0 following
AI & ML interests
Trustworthy Machine Learning
Recent Activity
liked
a Space
5 days ago
aminediroHF/trainer-generator-bf16-mismatch
new
activity
5 months ago
deepcs233/Visual-CoT:
Not compatible with HF Datasets
upvoted
a
paper
7 months ago
LANPO: Bootstrapping Language and Numerical Feedback for Reinforcement Learning in LLMs
View all activity
Organizations
None yet
CharlesLi
's models
515
Sort:Â Recently updated
CharlesLi/grpo_5_epoch_graph_task_ins_bsz8_8192_entropy_0005_200
3B
•
Updated
May 3, 2025
•
3
CharlesLi/dapo_5_epoch_graph_task_3B_9216_200
3B
•
Updated
May 3, 2025
•
1
CharlesLi/grpo_5_epoch_graph_task_ins_bsz16_4096_entropy_0001_350
3B
•
Updated
May 3, 2025
•
2
CharlesLi/dapo_5_epoch_graph_task_3B_800
3B
•
Updated
May 2, 2025
•
1
CharlesLi/dapo_5_epoch_graph_task_3B_1150
3B
•
Updated
May 2, 2025
•
1
CharlesLi/grpo_5_epoch_graph_task_ins_bsz16_4096_entropy_0001_400
3B
•
Updated
May 2, 2025
•
4
CharlesLi/grpo_5_epoch_graph_task_ins_large_entropy_1000
3B
•
Updated
May 2, 2025
•
1
CharlesLi/grpo_5_epoch_graph_task_ins_large_entropy_1150
3B
•
Updated
May 2, 2025
•
1
CharlesLi/grpo_5_epoch_graph_task_ins_large_entropy_700
3B
•
Updated
May 1, 2025
•
3
CharlesLi/grpo_5_epoch_graph_task_ins_large_entropy_600
3B
•
Updated
May 1, 2025
•
1
CharlesLi/grpo_5_epoch_graph_task_ins_large_entropy_500
3B
•
Updated
May 1, 2025
•
1
CharlesLi/grpo_5_epoch_graph_task_ins_large_entropy_400
3B
•
Updated
May 1, 2025
•
1
CharlesLi/grpo_5_epoch_graph_task_ins_large_entropy_300
3B
•
Updated
May 1, 2025
•
1
CharlesLi/grpo_5_epoch_graph_task_ins_large_400
3B
•
Updated
Apr 29, 2025
•
1
CharlesLi/grpo_5_epoch_graph_task_ins_400
3B
•
Updated
Apr 29, 2025
•
2
CharlesLi/grpo_5_epoch_graph_task_ins_large_300
3B
•
Updated
Apr 29, 2025
•
1
CharlesLi/grpo_5_epoch_graph_task_ins_300
3B
•
Updated
Apr 29, 2025
•
2
CharlesLi/grpo_5_epoch_graph_hard_ins_gpu_800
3B
•
Updated
Apr 28, 2025
•
1
CharlesLi/grpo_5_epoch_graph_hard_base_gpu_800
3B
•
Updated
Apr 28, 2025
•
1
CharlesLi/grpo_5_epoch_graph_hard_ins_start_400
3B
•
Updated
Apr 28, 2025
•
1
CharlesLi/grpo_5_epoch_graph_hard_base_start_400
3B
•
Updated
Apr 28, 2025
•
1
CharlesLi/grpo_5_epoch_graph_hard_ins_gpu_700
3B
•
Updated
Apr 28, 2025
•
1
CharlesLi/grpo_5_epoch_graph_hard_base_gpu_700
3B
•
Updated
Apr 28, 2025
•
1
CharlesLi/grpo_5_epoch_graph_hard_ins_gpu_400
3B
•
Updated
Apr 27, 2025
CharlesLi/grpo_5_epoch_graph_hard_base_gpu_400
3B
•
Updated
Apr 27, 2025
•
2
CharlesLi/grpo_5_epoch_graph_hard_ins_start_200
3B
•
Updated
Apr 27, 2025
•
1
CharlesLi/grpo_5_epoch_graph_hard_ins_gpu_200
3B
•
Updated
Apr 27, 2025
•
1
CharlesLi/grpo_5_epoch_graph_hard_base_start_200
3B
•
Updated
Apr 27, 2025
•
1
CharlesLi/grpo_5_epoch_graph_hard_base_gpu_200
3B
•
Updated
Apr 27, 2025
•
1
CharlesLi/graph_grpo_40
3B
•
Updated
Apr 24, 2025
•
1
Previous
1
2
3
4
...
18
Next