Reinforcement Learning Marcus2112/ppo-Huggy Reinforcement Learning • Updated Dec 20, 2023 • 22 Marcus2112/q-FrozenLake-v1-4x4-noSlippery Reinforcement Learning • Updated Jan 17, 2024 Marcus2112/dqn-SpaceInvadersNoFrameskip-v4 Reinforcement Learning • Updated Jan 17, 2024 • 17 Marcus2112/q-Taxi-v3 Reinforcement Learning • Updated Jan 17, 2024
MiniCorpus github.com/MK2112/minicorpus Marcus2112/pythia-160m-minipile 0.1B • Updated Jan 17, 2025 • 3 Marcus2112/pythia-160m-minipile_reproduction 0.2B • Updated Jan 17, 2025 • 3 Marcus2112/pythia-160m-minipile_cluster-proportioned 0.2B • Updated Jan 17, 2025 • 3 Marcus2112/pythia-160m-minipile_loss-sampled 0.2B • Updated Jan 17, 2025 • 3
Zero to Hero github.com/MK2112/nn-zero-to-hero-notes Marcus2112/nanogpt_base Updated Apr 15, 2025 Marcus2112/nanogpt_glu_base Updated Apr 15, 2025 Marcus2112/nanogpt_shakespeare Updated Apr 15, 2025 Marcus2112/nanogpt_glu_shakespeare Updated Apr 15, 2025
Reinforcement Learning Marcus2112/ppo-Huggy Reinforcement Learning • Updated Dec 20, 2023 • 22 Marcus2112/q-FrozenLake-v1-4x4-noSlippery Reinforcement Learning • Updated Jan 17, 2024 Marcus2112/dqn-SpaceInvadersNoFrameskip-v4 Reinforcement Learning • Updated Jan 17, 2024 • 17 Marcus2112/q-Taxi-v3 Reinforcement Learning • Updated Jan 17, 2024
Zero to Hero github.com/MK2112/nn-zero-to-hero-notes Marcus2112/nanogpt_base Updated Apr 15, 2025 Marcus2112/nanogpt_glu_base Updated Apr 15, 2025 Marcus2112/nanogpt_shakespeare Updated Apr 15, 2025 Marcus2112/nanogpt_glu_shakespeare Updated Apr 15, 2025
MiniCorpus github.com/MK2112/minicorpus Marcus2112/pythia-160m-minipile 0.1B • Updated Jan 17, 2025 • 3 Marcus2112/pythia-160m-minipile_reproduction 0.2B • Updated Jan 17, 2025 • 3 Marcus2112/pythia-160m-minipile_cluster-proportioned 0.2B • Updated Jan 17, 2025 • 3 Marcus2112/pythia-160m-minipile_loss-sampled 0.2B • Updated Jan 17, 2025 • 3