solved classic rl environments
Nitish Pandey
nitishpandey04
AI & ML interests
LLMs, Translation
Recent Activity
upvoted
an
article
29 days ago
Deriving the PPO Loss from First Principles
updated
a collection
about 2 months ago
Classic Reinforcement Learning
updated
a model
about 2 months ago
nitishpandey04/CarRacing-v3