RinKana commited on
Commit
fa6550b
·
verified ·
1 Parent(s): 90cb2cd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +50 -3
README.md CHANGED
@@ -1,3 +1,50 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ ---
6
+
7
+ # ConnectZero-Nakalipithecus
8
+
9
+ An AlphaZero-based Reinforcement Learning agent for Connect 4 game.
10
+
11
+ **Architecture:** ResNet (5 Residual Blocks) + Dual Head (Policy & Value).
12
+
13
+ **Framework:** PyTorch.
14
+
15
+ **Training Platform:** Kaggle T4 GPU.
16
+
17
+ **Author: Chakrabhuana Vishnu Deva.**
18
+
19
+ # Training result
20
+
21
+ ```
22
+ Total Parameter of the Model: 1,497,742
23
+ Starting Training for 5 Iterations...
24
+
25
+ --- Iteration 1 ---
26
+ Self-Playing 100 games...
27
+ Data Collected: 1359 samples
28
+ Avg Loss: 2.9339
29
+
30
+ --- Iteration 2 ---
31
+ Self-Playing 100 games...
32
+ Data Collected: 1644 samples
33
+ Avg Loss: 2.6747
34
+
35
+ --- Iteration 3 ---
36
+ Self-Playing 100 games...
37
+ Data Collected: 1739 samples
38
+ Avg Loss: 2.4139
39
+
40
+ --- Iteration 4 ---
41
+ Self-Playing 100 games...
42
+ Data Collected: 1678 samples
43
+ Avg Loss: 2.3377
44
+
45
+ --- Iteration 5 ---
46
+ Self-Playing 100 games...
47
+ Data Collected: 2370 samples
48
+ Avg Loss: 2.1712
49
+ Model Saved!
50
+ ```