Deep RL models from HuggingFace Deep RL course — Q-learning, REINFORCE, PPO, A2C across various environments.