AI & ML interests
None yet
Organizations
None yet
makwingchi/llama-8b-r128-a4
8B
•
Updated
makwingchi/llama-8b-r64-a4
8B
•
Updated
makwingchi/llama-8b-r32-a4
8B
•
Updated
makwingchi/llama-8b-r8-a4
8B
•
Updated
makwingchi/llama-8b-r4-a4
8B
•
Updated
makwingchi/llama-8b-r2-a4
8B
•
Updated
makwingchi/qwen3-8b-1-07-final
8B
•
Updated
makwingchi/qwen3-8b-1-07-step75
8B
•
Updated
makwingchi/llama-8b-r32-a16
8B
•
Updated
makwingchi/llama-8b-r16-a64
8B
•
Updated
makwingchi/llama-8b-r16-a32
8B
•
Updated
makwingchi/llama-8b-r16-a2
8B
•
Updated
makwingchi/llama-8b-r16-a8
8B
•
Updated
makwingchi/llama-8b-r16-a4
8B
•
Updated
Reinforcement Learning
•
Updated
makwingchi/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
makwingchi/Reinforce-Pixelcopter-v1
Reinforcement Learning
•
Updated
Reinforcement Learning
•
Updated
makwingchi/ppo-SnowballTarget
Reinforcement Learning
•
Updated
makwingchi/cartpole-reinforce-v1
Reinforcement Learning
•
Updated
makwingchi/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
Reinforcement Learning
•
Updated
makwingchi/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Reinforcement Learning
•
Updated
makwingchi/PPO-LunarLander-v2
Reinforcement Learning
•
Updated