·
AI & ML interests
None yet
Organizations
None yet
LevinZheng/gpt-oss-20b-lora-adapter
Updated
Text Generation
•
8B
•
Updated
LevinZheng/aha-moment-3B-v2
3B
•
Updated
LevinZheng/edu_ppo_full_30k_150steps
8B
•
Updated
LevinZheng/edu_ppo_full_30k
8B
•
Updated
LevinZheng/edu_ppo_full_10k
8B
•
Updated
LevinZheng/rlhf_ppo_full-Q8_0-GGUF
8B
•
Updated
•
3
LevinZheng/ppo-lunarlander-from0
Updated
LevinZheng/poca-SoccerTwos
Reinforcement Learning
•
Updated
•
1
LevinZheng/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
Reinforcement Learning
•
Updated
LevinZheng/ppo-SnowballTarget
Reinforcement Learning
•
Updated
•
1
LevinZheng/Reinforce-Pixelcopter-PLE-v0
Reinforcement Learning
•
Updated
LevinZheng/Reinforce-Cartpole-v1
Reinforcement Learning
•
Updated
LevinZheng/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
•
1
Reinforcement Learning
•
Updated
LevinZheng/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Text Generation
•
3B
•
Updated
•
1
Text Generation
•
8B
•
Updated
Text Generation
•
8B
•
Updated