Update README.md
Browse files
README.md
CHANGED
|
@@ -104,9 +104,7 @@ This model was trained with GRPO, a method introduced in [DeepSeekMath: Pushing
|
|
| 104 |
|
| 105 |
#### Summary Metrics Comparison
|
| 106 |
|
| 107 |
-
|
| 108 |
-
|
| 109 |
-
| Metric | Lyte/QuadConnect2.5-0.5B-v0.0.6b | Lyte/QuadConnect2.5-0.5B-v0.0.8b | Lyte/QuadConnect2.5-0.0.9b (Temp 0.6) | Lyte/QuadConnect2.5-0.0.9b (Temp 0.8) |
|
| 110 |
|-----------------------|--------------------------------|--------------------------------|--------------------------------|--------------------------------|
|
| 111 |
| Total games evaluated | 5082 | 5082 | 5082 | 5082 |
|
| 112 |
| Correct predictions | 518 | 394 | 516 | **713** |
|
|
@@ -118,7 +116,7 @@ This model was trained with GRPO, a method introduced in [DeepSeekMath: Pushing
|
|
| 118 |
|
| 119 |
#### Move Distribution Comparison
|
| 120 |
|
| 121 |
-
| Column | Lyte/QuadConnect2.5-0.5B-v0.0.6b (Count, %) | Lyte/QuadConnect2.5-0.5B-v0.0.8b (Count, %) | Lyte/QuadConnect2.5-0.0.9b (Temp 0.6) (Count, %) | Lyte/QuadConnect2.5-0.0.9b (Temp 0.8) (Count, %) |
|
| 122 |
|--------|-----------------------------------|-----------------------------------|------------------------------|------------------------------|
|
| 123 |
| a | 603 (19.02%) | 3 (0.12%) | 1447 (38.72%) | 1547 (31.01%) |
|
| 124 |
| b | 111 (3.50%) | 4 (0.16%) | 644 (17.23%) | 924 (18.52%) |
|
|
|
|
| 104 |
|
| 105 |
#### Summary Metrics Comparison
|
| 106 |
|
| 107 |
+
| Metric | Lyte/QuadConnect2.5-0.5B-v0.0.6b (Temp 0.6) | Lyte/QuadConnect2.5-0.5B-v0.0.8b (Temp 0.6) | Lyte/QuadConnect2.5-0.0.9b (Temp 0.6) | Lyte/QuadConnect2.5-0.0.9b (Temp 0.8) |
|
|
|
|
|
|
|
| 108 |
|-----------------------|--------------------------------|--------------------------------|--------------------------------|--------------------------------|
|
| 109 |
| Total games evaluated | 5082 | 5082 | 5082 | 5082 |
|
| 110 |
| Correct predictions | 518 | 394 | 516 | **713** |
|
|
|
|
| 116 |
|
| 117 |
#### Move Distribution Comparison
|
| 118 |
|
| 119 |
+
| Column | Lyte/QuadConnect2.5-0.5B-v0.0.6b (Temp 0.6) (Count, %) | Lyte/QuadConnect2.5-0.5B-v0.0.8b (Temp 0.6) (Count, %) | Lyte/QuadConnect2.5-0.0.9b (Temp 0.6) (Count, %) | Lyte/QuadConnect2.5-0.0.9b (Temp 0.8) (Count, %) |
|
| 120 |
|--------|-----------------------------------|-----------------------------------|------------------------------|------------------------------|
|
| 121 |
| a | 603 (19.02%) | 3 (0.12%) | 1447 (38.72%) | 1547 (31.01%) |
|
| 122 |
| b | 111 (3.50%) | 4 (0.16%) | 644 (17.23%) | 924 (18.52%) |
|