Teen-Different
/

squiral_maze

Reinforcement Learning

ReinforcementLearning

Model card Files Files and versions

Teen-Different commited on Mar 30, 2025

Commit

28e70d5

·

verified ·

1 Parent(s): b429cd2

Update README.md

Files changed (1) hide show

README.md +3 -2

README.md CHANGED Viewed

@@ -4,11 +4,12 @@ language:
 - en
 pipeline_tag: reinforcement-learning
 tags:
-- RL,
 - ReinforcementLearning
 - Q
 - DoubleQ
 - Reward
 ---
 This report provides insights into the first assignment, which focuses on defining and solving reinforcement learning (RL) environments. In this assignment, I outline the RL agents, examine the logic and functionality of our environment and agent, and employ tabular methods such as SARSA and Q-learning to train our agent to maximize rewards. This document is an initial submission for the checkpoint of assignment 1, detailing the tasks outlined in sections one and two.
@@ -467,4 +468,4 @@ Here's a neatly organized overview of the results obtained from various tabular
 - *Double Q-learning*. (n.d.). Neurips.Cc. Retrieved February 8, 2024, from https://proceedings.neurips.cc/paper_files/paper/2010/file/091d584fced301b442654dd8c23b3fc9-Paper.pdf
 - Hasselt, H. V., Guez, A., & Silver, D. (2015). Deep reinforcement learning with Double Q-learning. *AAAI Conference on Artificial Intelligence*, 2094–2100. https://doi.org/10.1609/aaai.v30i1.10295
 - *SARSA vs Q - learning*. (n.d.). Github.Io. Retrieved February 8, 2024, from https://tcnguyen.github.io/reinforcement_learning/sarsa_vs_q_learning.html
-- *What is the difference between Q-learning and SARSA?* (n.d.). Stack Overflow. Retrieved February 8, 2024, from https://stackoverflow.com/questions/6848828/what-is-the-difference-between-q-learning-and-sarsa

 - en
 pipeline_tag: reinforcement-learning
 tags:
+- RL
 - ReinforcementLearning
 - Q
 - DoubleQ
 - Reward
+- Tabular
 ---
 This report provides insights into the first assignment, which focuses on defining and solving reinforcement learning (RL) environments. In this assignment, I outline the RL agents, examine the logic and functionality of our environment and agent, and employ tabular methods such as SARSA and Q-learning to train our agent to maximize rewards. This document is an initial submission for the checkpoint of assignment 1, detailing the tasks outlined in sections one and two.
 - *Double Q-learning*. (n.d.). Neurips.Cc. Retrieved February 8, 2024, from https://proceedings.neurips.cc/paper_files/paper/2010/file/091d584fced301b442654dd8c23b3fc9-Paper.pdf
 - Hasselt, H. V., Guez, A., & Silver, D. (2015). Deep reinforcement learning with Double Q-learning. *AAAI Conference on Artificial Intelligence*, 2094–2100. https://doi.org/10.1609/aaai.v30i1.10295
 - *SARSA vs Q - learning*. (n.d.). Github.Io. Retrieved February 8, 2024, from https://tcnguyen.github.io/reinforcement_learning/sarsa_vs_q_learning.html
+- *What is the difference between Q-learning and SARSA?* (n.d.). Stack Overflow. Retrieved February 8, 2024, from https://stackoverflow.com/questions/6848828/what-is-the-difference-between-q-learning-and-sarsa