Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
datasets-and-models
non-profit
guanning03
Activity Feed
Follow
6
AI & ML interests
None defined yet.
Recent Activity
guanning
updated
a model
17 days ago
guanning-ai/Qwen2-3M_MaxRL_Maze17_bz256_ns64
guanning
published
a model
17 days ago
guanning-ai/Qwen2-3M_MaxRL_Maze17_bz256_ns64
guanning
updated
a model
17 days ago
guanning-ai/smollm2_0.3B_MaxRL_gsm8k_1000_steps
View all activity
Team members
5
guanning-ai
's models
113
Sort: Recently updated
guanning-ai/Smollm001
Updated
Jan 23
guanning-ai/0122_p_normalization_32rollouts_1500
0.4B
•
Updated
Jan 22
guanning-ai/0122_pkpo_T16_1500
0.4B
•
Updated
Jan 22
guanning-ai/grpo_64rollouts_1200
0.4B
•
Updated
Jan 22
guanning-ai/grpo_64rollouts_900
0.4B
•
Updated
Jan 22
guanning-ai/grpo_64rollouts_600
0.4B
•
Updated
Jan 22
guanning-ai/grpo_64rollouts_300
0.4B
•
Updated
Jan 22
guanning-ai/grpo_64rollouts_1500
0.4B
•
Updated
Jan 22
guanning-ai/grpo_16rollouts_step1500
Updated
Jan 21
guanning-ai/p_normalization_16rollouts
Updated
Jan 20
guanning-ai/clipped_pnorm
Updated
Jan 20
guanning-ai/grpo_Entropy0.001_step1500
Updated
Jan 20
guanning-ai/p_normalization_0118
Updated
Jan 17
guanning-ai/rloo_0118
Updated
Jan 17
guanning-ai/grpo_0118
Updated
Jan 17
guanning-ai/SmolLM-360M-RLOO-Math-Step1100
Updated
Jan 10
guanning-ai/SmolLM-360M-GRPO-Math-Step1100
Updated
Jan 10
guanning-ai/20260102-p_normalization_step4000
0.4B
•
Updated
Jan 1
guanning-ai/20260102-grpo_step4000
0.4B
•
Updated
Jan 1
guanning-ai/smollm-gsm8k-pnorm-ckpt4900
0.4B
•
Updated
Jan 1
•
5
guanning-ai/smollm-gsm8k-grpo-ckpt3900
0.4B
•
Updated
Jan 1
guanning-ai/smollm-gsm8k-grpo-ckpt1000
0.4B
•
Updated
Jan 1
guanning-ai/maze_sft_weights_1207
Updated
Dec 6, 2025
guanning-ai/Gai
Updated
Nov 22, 2025
guanning-ai/1027-math4b-bz1024-pposz128-rollout4-seed20
Updated
Oct 29, 2025
guanning-ai/1024-1.5b-knk23-debug1004
Updated
Oct 24, 2025
guanning-ai/1024-jspo-4b-lr1e-6-bz64-pposz32-rollout4-seed6
Updated
Oct 24, 2025
guanning-ai/significance-test-1016
Updated
Oct 22, 2025
guanning-ai/Gai0
Updated
Oct 19, 2025
guanning-ai/jspo-0921
Updated
Sep 21, 2025
Previous
1
2
3
4
Next