Update README.md
Browse files
README.md
CHANGED
|
@@ -4,11 +4,12 @@ language:
|
|
| 4 |
- en
|
| 5 |
pipeline_tag: reinforcement-learning
|
| 6 |
tags:
|
| 7 |
-
- RL
|
| 8 |
- ReinforcementLearning
|
| 9 |
- Q
|
| 10 |
- DoubleQ
|
| 11 |
- Reward
|
|
|
|
| 12 |
---
|
| 13 |
|
| 14 |
This report provides insights into the first assignment, which focuses on defining and solving reinforcement learning (RL) environments. In this assignment, I outline the RL agents, examine the logic and functionality of our environment and agent, and employ tabular methods such as SARSA and Q-learning to train our agent to maximize rewards. This document is an initial submission for the checkpoint of assignment 1, detailing the tasks outlined in sections one and two.
|
|
@@ -467,4 +468,4 @@ Here's a neatly organized overview of the results obtained from various tabular
|
|
| 467 |
- *Double Q-learning*. (n.d.). Neurips.Cc. Retrieved February 8, 2024, from https://proceedings.neurips.cc/paper_files/paper/2010/file/091d584fced301b442654dd8c23b3fc9-Paper.pdf
|
| 468 |
- Hasselt, H. V., Guez, A., & Silver, D. (2015). Deep reinforcement learning with Double Q-learning. *AAAI Conference on Artificial Intelligence*, 2094–2100. https://doi.org/10.1609/aaai.v30i1.10295
|
| 469 |
- *SARSA vs Q - learning*. (n.d.). Github.Io. Retrieved February 8, 2024, from https://tcnguyen.github.io/reinforcement_learning/sarsa_vs_q_learning.html
|
| 470 |
-
- *What is the difference between Q-learning and SARSA?* (n.d.). Stack Overflow. Retrieved February 8, 2024, from https://stackoverflow.com/questions/6848828/what-is-the-difference-between-q-learning-and-sarsa
|
|
|
|
| 4 |
- en
|
| 5 |
pipeline_tag: reinforcement-learning
|
| 6 |
tags:
|
| 7 |
+
- RL
|
| 8 |
- ReinforcementLearning
|
| 9 |
- Q
|
| 10 |
- DoubleQ
|
| 11 |
- Reward
|
| 12 |
+
- Tabular
|
| 13 |
---
|
| 14 |
|
| 15 |
This report provides insights into the first assignment, which focuses on defining and solving reinforcement learning (RL) environments. In this assignment, I outline the RL agents, examine the logic and functionality of our environment and agent, and employ tabular methods such as SARSA and Q-learning to train our agent to maximize rewards. This document is an initial submission for the checkpoint of assignment 1, detailing the tasks outlined in sections one and two.
|
|
|
|
| 468 |
- *Double Q-learning*. (n.d.). Neurips.Cc. Retrieved February 8, 2024, from https://proceedings.neurips.cc/paper_files/paper/2010/file/091d584fced301b442654dd8c23b3fc9-Paper.pdf
|
| 469 |
- Hasselt, H. V., Guez, A., & Silver, D. (2015). Deep reinforcement learning with Double Q-learning. *AAAI Conference on Artificial Intelligence*, 2094–2100. https://doi.org/10.1609/aaai.v30i1.10295
|
| 470 |
- *SARSA vs Q - learning*. (n.d.). Github.Io. Retrieved February 8, 2024, from https://tcnguyen.github.io/reinforcement_learning/sarsa_vs_q_learning.html
|
| 471 |
+
- *What is the difference between Q-learning and SARSA?* (n.d.). Stack Overflow. Retrieved February 8, 2024, from https://stackoverflow.com/questions/6848828/what-is-the-difference-between-q-learning-and-sarsa
|