Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
Yikun Jiang
code1phoenix
Follow
AI & ML interests
None yet
Recent Activity
published
a model
18 days ago
code1phoenix/zamba2-2.7b-grpo-v2-length-gsm8k
published
a model
18 days ago
code1phoenix/zamba2-2.7b-dpo-v3-length-gsm8k
updated
a model
21 days ago
code1phoenix/zamba2-2.7b-dpo-v3-length-gsm8k
View all activity
Organizations
None yet
code1phoenix
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
published
2 models
18 days ago
code1phoenix/zamba2-2.7b-grpo-v2-length-gsm8k
Updated
22 days ago
•
11
code1phoenix/zamba2-2.7b-dpo-v3-length-gsm8k
Updated
21 days ago
•
10
updated
a model
21 days ago
code1phoenix/zamba2-2.7b-dpo-v3-length-gsm8k
Updated
21 days ago
•
10
updated
a model
22 days ago
code1phoenix/zamba2-2.7b-grpo-v2-length-gsm8k
Updated
22 days ago
•
11
updated
a model
25 days ago
code1phoenix/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
25 days ago
published
a model
25 days ago
code1phoenix/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
25 days ago
updated
2 models
25 days ago
code1phoenix/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
25 days ago
•
50
code1phoenix/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
25 days ago
•
25
published
a model
25 days ago
code1phoenix/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
25 days ago
•
25
updated
a model
26 days ago
code1phoenix/ppo-pyramid
Reinforcement Learning
•
Updated
26 days ago
•
28
published
a model
26 days ago
code1phoenix/ppo-pyramid
Reinforcement Learning
•
Updated
26 days ago
•
28
updated
a model
26 days ago
code1phoenix/ppo-SnowballTarget
Reinforcement Learning
•
Updated
26 days ago
•
285
published
a model
26 days ago
code1phoenix/ppo-SnowballTarget
Reinforcement Learning
•
Updated
26 days ago
•
285
updated
a model
26 days ago
code1phoenix/pixelcopter
Reinforcement Learning
•
Updated
25 days ago
published
a model
26 days ago
code1phoenix/pixelcopter
Reinforcement Learning
•
Updated
25 days ago
updated
2 models
26 days ago
code1phoenix/cartpole-1
Reinforcement Learning
•
Updated
26 days ago
code1phoenix/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
26 days ago
•
43
published
2 models
26 days ago
code1phoenix/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
26 days ago
•
43
code1phoenix/cartpole-1
Reinforcement Learning
•
Updated
26 days ago
updated
a model
26 days ago
code1phoenix/Taxi-v3
Reinforcement Learning
•
Updated
26 days ago
Load more