Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
1
2
Levin Zheng
LevinZheng
Follow
0 followers
·
7 following
AI & ML interests
None yet
Recent Activity
liked
a dataset
17 days ago
ByteDance-Seed/BeyondAIME
updated
a model
2 months ago
LevinZheng/32b_recall_1015_fold_3_gptq4
updated
a model
2 months ago
LevinZheng/32b_recall_1015_fold_2_gptq4
View all activity
Organizations
None yet
LevinZheng
's models
82
Sort: Recently updated
LevinZheng/gpt-oss-20b-lora-adapter
Updated
Aug 14
LevinZheng/rlhf_ppo_full
Text Generation
•
8B
•
Updated
Jul 20
•
1
LevinZheng/aha-moment-3B-v2
3B
•
Updated
Jul 19
•
5
LevinZheng/edu_ppo_full_30k_150steps
8B
•
Updated
Jul 19
•
6
LevinZheng/edu_ppo_full_30k
8B
•
Updated
Jul 18
•
6
LevinZheng/edu_ppo_full_10k
8B
•
Updated
Jul 18
•
8
LevinZheng/edu_sft_full
Updated
Jul 17
•
1
LevinZheng/edu_dpo_full
Updated
Jul 17
LevinZheng/rlhf_ppo_full-Q8_0-GGUF
8B
•
Updated
Jul 10
•
2
LevinZheng/ppo-lunarlander-from0
Updated
Jun 20
LevinZheng/poca-SoccerTwos
Reinforcement Learning
•
Updated
May 28
•
11
LevinZheng/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
May 28
•
2
LevinZheng/ppo-Pyramids
Reinforcement Learning
•
Updated
May 28
•
6
LevinZheng/ppo-SnowballTarget
Reinforcement Learning
•
Updated
May 28
•
19
LevinZheng/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
May 27
LevinZheng/Reinforce-Cartpole-v1
Reinforcement Learning
•
Updated
May 27
LevinZheng/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
May 27
•
7
LevinZheng/q-Taxi-v3
Reinforcement Learning
•
Updated
May 27
LevinZheng/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
May 27
•
3
LevinZheng/aha-moment-3B
Text Generation
•
3B
•
Updated
May 13
•
9
LevinZheng/rlhf_dpo_full
Text Generation
•
8B
•
Updated
May 13
LevinZheng/rlhf_sft_full
Text Generation
•
8B
•
Updated
May 13
•
1
Previous
1
2
3
Next