Teen-Different commited on
Commit
28e70d5
·
verified ·
1 Parent(s): b429cd2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -2
README.md CHANGED
@@ -4,11 +4,12 @@ language:
4
  - en
5
  pipeline_tag: reinforcement-learning
6
  tags:
7
- - RL,
8
  - ReinforcementLearning
9
  - Q
10
  - DoubleQ
11
  - Reward
 
12
  ---
13
 
14
  This report provides insights into the first assignment, which focuses on defining and solving reinforcement learning (RL) environments. In this assignment, I outline the RL agents, examine the logic and functionality of our environment and agent, and employ tabular methods such as SARSA and Q-learning to train our agent to maximize rewards. This document is an initial submission for the checkpoint of assignment 1, detailing the tasks outlined in sections one and two.
@@ -467,4 +468,4 @@ Here's a neatly organized overview of the results obtained from various tabular
467
  - *Double Q-learning*. (n.d.). Neurips.Cc. Retrieved February 8, 2024, from https://proceedings.neurips.cc/paper_files/paper/2010/file/091d584fced301b442654dd8c23b3fc9-Paper.pdf
468
  - Hasselt, H. V., Guez, A., & Silver, D. (2015). Deep reinforcement learning with Double Q-learning. *AAAI Conference on Artificial Intelligence*, 2094–2100. https://doi.org/10.1609/aaai.v30i1.10295
469
  - *SARSA vs Q - learning*. (n.d.). Github.Io. Retrieved February 8, 2024, from https://tcnguyen.github.io/reinforcement_learning/sarsa_vs_q_learning.html
470
- - *What is the difference between Q-learning and SARSA?* (n.d.). Stack Overflow. Retrieved February 8, 2024, from https://stackoverflow.com/questions/6848828/what-is-the-difference-between-q-learning-and-sarsa
 
4
  - en
5
  pipeline_tag: reinforcement-learning
6
  tags:
7
+ - RL
8
  - ReinforcementLearning
9
  - Q
10
  - DoubleQ
11
  - Reward
12
+ - Tabular
13
  ---
14
 
15
  This report provides insights into the first assignment, which focuses on defining and solving reinforcement learning (RL) environments. In this assignment, I outline the RL agents, examine the logic and functionality of our environment and agent, and employ tabular methods such as SARSA and Q-learning to train our agent to maximize rewards. This document is an initial submission for the checkpoint of assignment 1, detailing the tasks outlined in sections one and two.
 
468
  - *Double Q-learning*. (n.d.). Neurips.Cc. Retrieved February 8, 2024, from https://proceedings.neurips.cc/paper_files/paper/2010/file/091d584fced301b442654dd8c23b3fc9-Paper.pdf
469
  - Hasselt, H. V., Guez, A., & Silver, D. (2015). Deep reinforcement learning with Double Q-learning. *AAAI Conference on Artificial Intelligence*, 2094–2100. https://doi.org/10.1609/aaai.v30i1.10295
470
  - *SARSA vs Q - learning*. (n.d.). Github.Io. Retrieved February 8, 2024, from https://tcnguyen.github.io/reinforcement_learning/sarsa_vs_q_learning.html
471
+ - *What is the difference between Q-learning and SARSA?* (n.d.). Stack Overflow. Retrieved February 8, 2024, from https://stackoverflow.com/questions/6848828/what-is-the-difference-between-q-learning-and-sarsa