Alex-GSL commited on
Commit
f296ef8
·
verified ·
1 Parent(s): 207682d

Upload human_games/game_20260325_142116_4.json with huggingface_hub

Browse files
human_games/game_20260325_142116_4.json ADDED
@@ -0,0 +1,2550 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "game_number": 4,
3
+ "timestamp": "2026-03-25T14:21:16.903291+00:00",
4
+ "checkpoint": "checkpoints/r42_1500M.pkl",
5
+ "architecture": "simba_aux",
6
+ "model_checkpoint": "r42_1500M.pkl",
7
+ "model_architecture": "simba_aux",
8
+ "result": "human_win",
9
+ "final_scores": [
10
+ 8,
11
+ 0
12
+ ],
13
+ "total_turns": 18,
14
+ "actions": [
15
+ {
16
+ "turn": 0,
17
+ "player": -1,
18
+ "phase": "draw",
19
+ "action": -1,
20
+ "action_desc": "Initial deal",
21
+ "card_drawn": null,
22
+ "hands": [
23
+ [
24
+ 5,
25
+ 2,
26
+ 6,
27
+ 33,
28
+ 17,
29
+ 37,
30
+ 51,
31
+ 18,
32
+ 25,
33
+ 23
34
+ ],
35
+ [
36
+ 21,
37
+ 19,
38
+ 38,
39
+ 47,
40
+ 45,
41
+ 43,
42
+ 7,
43
+ 29,
44
+ 14,
45
+ 0
46
+ ]
47
+ ],
48
+ "hand_sizes": [
49
+ 10,
50
+ 10
51
+ ],
52
+ "discard_pile": [
53
+ 44
54
+ ],
55
+ "discard_size": 1,
56
+ "stock_remaining": 31,
57
+ "deadwood": [
58
+ 75,
59
+ 62
60
+ ],
61
+ "model_logits": null,
62
+ "model_value": null,
63
+ "model_checkpoint": "r42_1500M.pkl",
64
+ "model_architecture": "simba_aux"
65
+ },
66
+ {
67
+ "turn": 1,
68
+ "player": 1,
69
+ "phase": "draw",
70
+ "action": 0,
71
+ "action_desc": "Model drew from stock",
72
+ "card_drawn": 4,
73
+ "hands": [
74
+ [
75
+ 5,
76
+ 2,
77
+ 6,
78
+ 33,
79
+ 17,
80
+ 37,
81
+ 51,
82
+ 18,
83
+ 25,
84
+ 23
85
+ ],
86
+ [
87
+ 21,
88
+ 19,
89
+ 38,
90
+ 47,
91
+ 45,
92
+ 43,
93
+ 7,
94
+ 29,
95
+ 14,
96
+ 0
97
+ ]
98
+ ],
99
+ "hand_sizes": [
100
+ 10,
101
+ 10
102
+ ],
103
+ "discard_pile": [
104
+ 44
105
+ ],
106
+ "discard_size": 1,
107
+ "stock_remaining": 31,
108
+ "deadwood": [
109
+ 75,
110
+ 62
111
+ ],
112
+ "model_logits": [
113
+ 0.09325987845659256,
114
+ -0.087431900203228,
115
+ -0.5688839554786682,
116
+ -0.06865032762289047,
117
+ -2.5005149841308594,
118
+ -4.514980316162109,
119
+ -0.9133586883544922,
120
+ 1.1016803979873657,
121
+ 1.1803399324417114,
122
+ 2.8836007118225098,
123
+ 3.5052731037139893,
124
+ 0.17142173647880554,
125
+ -4.690744400024414,
126
+ 1.8572735786437988,
127
+ -3.2495574951171875,
128
+ 14.289706230163574
129
+ ],
130
+ "model_value": 0.32534098625183105,
131
+ "model_checkpoint": "r42_1500M.pkl",
132
+ "model_architecture": "simba_aux"
133
+ },
134
+ {
135
+ "turn": 2,
136
+ "player": 1,
137
+ "phase": "discard",
138
+ "action": 12,
139
+ "action_desc": "Model discarded 5\u2660",
140
+ "card_drawn": null,
141
+ "hands": [
142
+ [
143
+ 5,
144
+ 2,
145
+ 6,
146
+ 33,
147
+ 17,
148
+ 37,
149
+ 51,
150
+ 18,
151
+ 25,
152
+ 23
153
+ ],
154
+ [
155
+ 21,
156
+ 19,
157
+ 38,
158
+ 47,
159
+ 45,
160
+ 43,
161
+ 7,
162
+ 29,
163
+ 14,
164
+ 0,
165
+ 4
166
+ ]
167
+ ],
168
+ "hand_sizes": [
169
+ 10,
170
+ 11
171
+ ],
172
+ "discard_pile": [
173
+ 44
174
+ ],
175
+ "discard_size": 1,
176
+ "stock_remaining": 30,
177
+ "deadwood": [
178
+ 75,
179
+ 67
180
+ ],
181
+ "model_logits": [
182
+ 3.417013168334961,
183
+ -3.4077160358428955,
184
+ -5.001515865325928,
185
+ -0.009092236869037151,
186
+ -2.6918270587921143,
187
+ -3.870602607727051,
188
+ -3.5438337326049805,
189
+ -2.014331579208374,
190
+ 1.7961392402648926,
191
+ 4.180337429046631,
192
+ 4.018463134765625,
193
+ -1.7526929378509521,
194
+ 4.605531215667725,
195
+ 2.3439955711364746,
196
+ -3.8578248023986816,
197
+ 13.765121459960938
198
+ ],
199
+ "model_value": 0.3130495548248291,
200
+ "model_checkpoint": "r42_1500M.pkl",
201
+ "model_architecture": "simba_aux"
202
+ },
203
+ {
204
+ "turn": 3,
205
+ "player": 0,
206
+ "phase": "draw",
207
+ "action": 1,
208
+ "action_desc": "Human drew 5\u2660 from discard",
209
+ "card_drawn": 4,
210
+ "hands": [
211
+ [
212
+ 5,
213
+ 2,
214
+ 6,
215
+ 33,
216
+ 17,
217
+ 37,
218
+ 51,
219
+ 18,
220
+ 25,
221
+ 23
222
+ ],
223
+ [
224
+ 21,
225
+ 19,
226
+ 38,
227
+ 47,
228
+ 45,
229
+ 43,
230
+ 7,
231
+ 29,
232
+ 14,
233
+ 0
234
+ ]
235
+ ],
236
+ "hand_sizes": [
237
+ 10,
238
+ 10
239
+ ],
240
+ "discard_pile": [
241
+ 44,
242
+ 4
243
+ ],
244
+ "discard_size": 2,
245
+ "stock_remaining": 30,
246
+ "deadwood": [
247
+ 75,
248
+ 62
249
+ ],
250
+ "model_logits": null,
251
+ "model_value": null,
252
+ "model_checkpoint": "r42_1500M.pkl",
253
+ "model_architecture": "simba_aux"
254
+ },
255
+ {
256
+ "turn": 4,
257
+ "player": 0,
258
+ "phase": "discard",
259
+ "action": 8,
260
+ "action_desc": "Human discarded K\u2663",
261
+ "card_drawn": null,
262
+ "hands": [
263
+ [
264
+ 5,
265
+ 2,
266
+ 6,
267
+ 33,
268
+ 17,
269
+ 37,
270
+ 51,
271
+ 18,
272
+ 25,
273
+ 23,
274
+ 4
275
+ ],
276
+ [
277
+ 21,
278
+ 19,
279
+ 38,
280
+ 47,
281
+ 45,
282
+ 43,
283
+ 7,
284
+ 29,
285
+ 14,
286
+ 0
287
+ ]
288
+ ],
289
+ "hand_sizes": [
290
+ 11,
291
+ 10
292
+ ],
293
+ "discard_pile": [
294
+ 44
295
+ ],
296
+ "discard_size": 1,
297
+ "stock_remaining": 30,
298
+ "deadwood": [
299
+ 62,
300
+ 62
301
+ ],
302
+ "model_logits": null,
303
+ "model_value": null,
304
+ "model_checkpoint": "r42_1500M.pkl",
305
+ "model_architecture": "simba_aux"
306
+ },
307
+ {
308
+ "turn": 5,
309
+ "player": 1,
310
+ "phase": "draw",
311
+ "action": 0,
312
+ "action_desc": "Model drew from stock",
313
+ "card_drawn": 36,
314
+ "hands": [
315
+ [
316
+ 5,
317
+ 2,
318
+ 6,
319
+ 33,
320
+ 17,
321
+ 37,
322
+ 4,
323
+ 18,
324
+ 25,
325
+ 23
326
+ ],
327
+ [
328
+ 21,
329
+ 19,
330
+ 38,
331
+ 47,
332
+ 45,
333
+ 43,
334
+ 7,
335
+ 29,
336
+ 14,
337
+ 0
338
+ ]
339
+ ],
340
+ "hand_sizes": [
341
+ 10,
342
+ 10
343
+ ],
344
+ "discard_pile": [
345
+ 44,
346
+ 51
347
+ ],
348
+ "discard_size": 2,
349
+ "stock_remaining": 30,
350
+ "deadwood": [
351
+ 52,
352
+ 62
353
+ ],
354
+ "model_logits": [
355
+ 5.197052478790283,
356
+ -5.18112325668335,
357
+ -2.769796371459961,
358
+ -0.8118061423301697,
359
+ -0.5535638332366943,
360
+ -7.104782581329346,
361
+ -2.2658491134643555,
362
+ 0.2758958339691162,
363
+ 2.0152428150177,
364
+ 3.033750057220459,
365
+ 2.8243067264556885,
366
+ 0.27645158767700195,
367
+ 3.9991893768310547,
368
+ 0.7301002740859985,
369
+ -2.4215526580810547,
370
+ 15.506138801574707
371
+ ],
372
+ "model_value": 0.165588840842247,
373
+ "model_checkpoint": "r42_1500M.pkl",
374
+ "model_architecture": "simba_aux"
375
+ },
376
+ {
377
+ "turn": 6,
378
+ "player": 1,
379
+ "phase": "discard",
380
+ "action": 8,
381
+ "action_desc": "Model discarded 8\u2660",
382
+ "card_drawn": null,
383
+ "hands": [
384
+ [
385
+ 5,
386
+ 2,
387
+ 6,
388
+ 33,
389
+ 17,
390
+ 37,
391
+ 4,
392
+ 18,
393
+ 25,
394
+ 23
395
+ ],
396
+ [
397
+ 21,
398
+ 19,
399
+ 38,
400
+ 47,
401
+ 45,
402
+ 43,
403
+ 7,
404
+ 29,
405
+ 14,
406
+ 0,
407
+ 36
408
+ ]
409
+ ],
410
+ "hand_sizes": [
411
+ 10,
412
+ 11
413
+ ],
414
+ "discard_pile": [
415
+ 44,
416
+ 51
417
+ ],
418
+ "discard_size": 2,
419
+ "stock_remaining": 29,
420
+ "deadwood": [
421
+ 52,
422
+ 72
423
+ ],
424
+ "model_logits": [
425
+ -0.32122141122817993,
426
+ 0.3406141698360443,
427
+ -5.724595546722412,
428
+ -4.194174289703369,
429
+ -3.1780011653900146,
430
+ -2.937650680541992,
431
+ -0.8163458704948425,
432
+ 1.94199538230896,
433
+ 3.5510947704315186,
434
+ 4.117539882659912,
435
+ 1.6936264038085938,
436
+ -1.9613864421844482,
437
+ 1.4538295269012451,
438
+ 5.999101638793945,
439
+ -7.209649085998535,
440
+ 11.982159614562988
441
+ ],
442
+ "model_value": 0.3117015063762665,
443
+ "model_checkpoint": "r42_1500M.pkl",
444
+ "model_architecture": "simba_aux"
445
+ },
446
+ {
447
+ "turn": 7,
448
+ "player": 0,
449
+ "phase": "draw",
450
+ "action": 1,
451
+ "action_desc": "Human drew 8\u2660 from discard",
452
+ "card_drawn": 7,
453
+ "hands": [
454
+ [
455
+ 5,
456
+ 2,
457
+ 6,
458
+ 33,
459
+ 17,
460
+ 37,
461
+ 4,
462
+ 18,
463
+ 25,
464
+ 23
465
+ ],
466
+ [
467
+ 21,
468
+ 19,
469
+ 38,
470
+ 47,
471
+ 45,
472
+ 43,
473
+ 36,
474
+ 29,
475
+ 14,
476
+ 0
477
+ ]
478
+ ],
479
+ "hand_sizes": [
480
+ 10,
481
+ 10
482
+ ],
483
+ "discard_pile": [
484
+ 44,
485
+ 51,
486
+ 7
487
+ ],
488
+ "discard_size": 3,
489
+ "stock_remaining": 29,
490
+ "deadwood": [
491
+ 52,
492
+ 64
493
+ ],
494
+ "model_logits": null,
495
+ "model_value": null,
496
+ "model_checkpoint": "r42_1500M.pkl",
497
+ "model_architecture": "simba_aux"
498
+ },
499
+ {
500
+ "turn": 8,
501
+ "player": 0,
502
+ "phase": "discard",
503
+ "action": 7,
504
+ "action_desc": "Human discarded Q\u2666",
505
+ "card_drawn": null,
506
+ "hands": [
507
+ [
508
+ 5,
509
+ 2,
510
+ 6,
511
+ 33,
512
+ 17,
513
+ 37,
514
+ 4,
515
+ 18,
516
+ 25,
517
+ 23,
518
+ 7
519
+ ],
520
+ [
521
+ 21,
522
+ 19,
523
+ 38,
524
+ 47,
525
+ 45,
526
+ 43,
527
+ 36,
528
+ 29,
529
+ 14,
530
+ 0
531
+ ]
532
+ ],
533
+ "hand_sizes": [
534
+ 11,
535
+ 10
536
+ ],
537
+ "discard_pile": [
538
+ 44,
539
+ 51
540
+ ],
541
+ "discard_size": 2,
542
+ "stock_remaining": 29,
543
+ "deadwood": [
544
+ 52,
545
+ 64
546
+ ],
547
+ "model_logits": null,
548
+ "model_value": null,
549
+ "model_checkpoint": "r42_1500M.pkl",
550
+ "model_architecture": "simba_aux"
551
+ },
552
+ {
553
+ "turn": 9,
554
+ "player": 1,
555
+ "phase": "draw",
556
+ "action": 1,
557
+ "action_desc": "Model drew Q\u2666 from discard",
558
+ "card_drawn": 37,
559
+ "hands": [
560
+ [
561
+ 5,
562
+ 2,
563
+ 6,
564
+ 33,
565
+ 17,
566
+ 7,
567
+ 4,
568
+ 18,
569
+ 25,
570
+ 23
571
+ ],
572
+ [
573
+ 21,
574
+ 19,
575
+ 38,
576
+ 47,
577
+ 45,
578
+ 43,
579
+ 36,
580
+ 29,
581
+ 14,
582
+ 0
583
+ ]
584
+ ],
585
+ "hand_sizes": [
586
+ 10,
587
+ 10
588
+ ],
589
+ "discard_pile": [
590
+ 44,
591
+ 51,
592
+ 37
593
+ ],
594
+ "discard_size": 3,
595
+ "stock_remaining": 29,
596
+ "deadwood": [
597
+ 42,
598
+ 64
599
+ ],
600
+ "model_logits": [
601
+ -24.1959228515625,
602
+ 24.210447311401367,
603
+ -1.0009433031082153,
604
+ -2.5308589935302734,
605
+ 1.8285845518112183,
606
+ -2.4441871643066406,
607
+ -0.4872193932533264,
608
+ -2.541861057281494,
609
+ 1.9330122470855713,
610
+ -0.7277585864067078,
611
+ 0.5597188472747803,
612
+ 0.2730743885040283,
613
+ -9.375678062438965,
614
+ -1.695233702659607,
615
+ 2.0203540325164795,
616
+ 2.025934934616089
617
+ ],
618
+ "model_value": 0.186870738863945,
619
+ "model_checkpoint": "r42_1500M.pkl",
620
+ "model_architecture": "simba_aux"
621
+ },
622
+ {
623
+ "turn": 10,
624
+ "player": 1,
625
+ "phase": "discard",
626
+ "action": 9,
627
+ "action_desc": "Model discarded 4\u2666",
628
+ "card_drawn": null,
629
+ "hands": [
630
+ [
631
+ 5,
632
+ 2,
633
+ 6,
634
+ 33,
635
+ 17,
636
+ 7,
637
+ 4,
638
+ 18,
639
+ 25,
640
+ 23
641
+ ],
642
+ [
643
+ 21,
644
+ 19,
645
+ 38,
646
+ 47,
647
+ 45,
648
+ 43,
649
+ 36,
650
+ 29,
651
+ 14,
652
+ 0,
653
+ 37
654
+ ]
655
+ ],
656
+ "hand_sizes": [
657
+ 10,
658
+ 11
659
+ ],
660
+ "discard_pile": [
661
+ 44,
662
+ 51
663
+ ],
664
+ "discard_size": 2,
665
+ "stock_remaining": 29,
666
+ "deadwood": [
667
+ 42,
668
+ 44
669
+ ],
670
+ "model_logits": [
671
+ -3.7748053073883057,
672
+ 3.7913267612457275,
673
+ 2.5602798461914062,
674
+ 3.7133896350860596,
675
+ -12.48697566986084,
676
+ 3.3832638263702393,
677
+ 3.0778608322143555,
678
+ 4.574680328369141,
679
+ -17.03318214416504,
680
+ 6.318802356719971,
681
+ 7.165841102600098,
682
+ 2.709026575088501,
683
+ -11.156314849853516,
684
+ 1.7774165868759155,
685
+ -2.1440062522888184,
686
+ 6.494228363037109
687
+ ],
688
+ "model_value": 0.31523439288139343,
689
+ "model_checkpoint": "r42_1500M.pkl",
690
+ "model_architecture": "simba_aux"
691
+ },
692
+ {
693
+ "turn": 11,
694
+ "player": 0,
695
+ "phase": "draw",
696
+ "action": 0,
697
+ "action_desc": "Human drew from stock",
698
+ "card_drawn": 49,
699
+ "hands": [
700
+ [
701
+ 5,
702
+ 2,
703
+ 6,
704
+ 33,
705
+ 17,
706
+ 7,
707
+ 4,
708
+ 18,
709
+ 25,
710
+ 23
711
+ ],
712
+ [
713
+ 21,
714
+ 19,
715
+ 38,
716
+ 47,
717
+ 45,
718
+ 43,
719
+ 36,
720
+ 37,
721
+ 14,
722
+ 0
723
+ ]
724
+ ],
725
+ "hand_sizes": [
726
+ 10,
727
+ 10
728
+ ],
729
+ "discard_pile": [
730
+ 44,
731
+ 51,
732
+ 29
733
+ ],
734
+ "discard_size": 3,
735
+ "stock_remaining": 29,
736
+ "deadwood": [
737
+ 42,
738
+ 40
739
+ ],
740
+ "model_logits": null,
741
+ "model_value": null,
742
+ "model_checkpoint": "r42_1500M.pkl",
743
+ "model_architecture": "simba_aux"
744
+ },
745
+ {
746
+ "turn": 12,
747
+ "player": 0,
748
+ "phase": "discard",
749
+ "action": 12,
750
+ "action_desc": "Human discarded J\u2663",
751
+ "card_drawn": null,
752
+ "hands": [
753
+ [
754
+ 5,
755
+ 2,
756
+ 6,
757
+ 33,
758
+ 17,
759
+ 7,
760
+ 4,
761
+ 18,
762
+ 25,
763
+ 23,
764
+ 49
765
+ ],
766
+ [
767
+ 21,
768
+ 19,
769
+ 38,
770
+ 47,
771
+ 45,
772
+ 43,
773
+ 36,
774
+ 37,
775
+ 14,
776
+ 0
777
+ ]
778
+ ],
779
+ "hand_sizes": [
780
+ 11,
781
+ 10
782
+ ],
783
+ "discard_pile": [
784
+ 44,
785
+ 51,
786
+ 29
787
+ ],
788
+ "discard_size": 3,
789
+ "stock_remaining": 28,
790
+ "deadwood": [
791
+ 52,
792
+ 40
793
+ ],
794
+ "model_logits": null,
795
+ "model_value": null,
796
+ "model_checkpoint": "r42_1500M.pkl",
797
+ "model_architecture": "simba_aux"
798
+ },
799
+ {
800
+ "turn": 13,
801
+ "player": 1,
802
+ "phase": "draw",
803
+ "action": 0,
804
+ "action_desc": "Model drew from stock",
805
+ "card_drawn": 3,
806
+ "hands": [
807
+ [
808
+ 5,
809
+ 2,
810
+ 6,
811
+ 33,
812
+ 17,
813
+ 7,
814
+ 4,
815
+ 18,
816
+ 25,
817
+ 23
818
+ ],
819
+ [
820
+ 21,
821
+ 19,
822
+ 38,
823
+ 47,
824
+ 45,
825
+ 43,
826
+ 36,
827
+ 37,
828
+ 14,
829
+ 0
830
+ ]
831
+ ],
832
+ "hand_sizes": [
833
+ 10,
834
+ 10
835
+ ],
836
+ "discard_pile": [
837
+ 44,
838
+ 51,
839
+ 29,
840
+ 49
841
+ ],
842
+ "discard_size": 4,
843
+ "stock_remaining": 28,
844
+ "deadwood": [
845
+ 42,
846
+ 40
847
+ ],
848
+ "model_logits": [
849
+ 5.435916423797607,
850
+ -5.450207233428955,
851
+ 3.107023000717163,
852
+ 2.9311065673828125,
853
+ -7.157114505767822,
854
+ -0.09724774211645126,
855
+ 4.413162708282471,
856
+ 1.8900127410888672,
857
+ -9.952603340148926,
858
+ -8.317371368408203,
859
+ 3.3269364833831787,
860
+ 2.0887064933776855,
861
+ 4.783175945281982,
862
+ 2.314573049545288,
863
+ -3.9845399856567383,
864
+ 14.989801406860352
865
+ ],
866
+ "model_value": 0.2417949140071869,
867
+ "model_checkpoint": "r42_1500M.pkl",
868
+ "model_architecture": "simba_aux"
869
+ },
870
+ {
871
+ "turn": 14,
872
+ "player": 1,
873
+ "phase": "discard",
874
+ "action": 12,
875
+ "action_desc": "Model discarded 4\u2660",
876
+ "card_drawn": null,
877
+ "hands": [
878
+ [
879
+ 5,
880
+ 2,
881
+ 6,
882
+ 33,
883
+ 17,
884
+ 7,
885
+ 4,
886
+ 18,
887
+ 25,
888
+ 23
889
+ ],
890
+ [
891
+ 21,
892
+ 19,
893
+ 38,
894
+ 47,
895
+ 45,
896
+ 43,
897
+ 36,
898
+ 37,
899
+ 14,
900
+ 0,
901
+ 3
902
+ ]
903
+ ],
904
+ "hand_sizes": [
905
+ 10,
906
+ 11
907
+ ],
908
+ "discard_pile": [
909
+ 44,
910
+ 51,
911
+ 29,
912
+ 49
913
+ ],
914
+ "discard_size": 4,
915
+ "stock_remaining": 27,
916
+ "deadwood": [
917
+ 42,
918
+ 44
919
+ ],
920
+ "model_logits": [
921
+ 1.3572660684585571,
922
+ -1.3808159828186035,
923
+ 2.8994479179382324,
924
+ 1.6390485763549805,
925
+ -11.19175910949707,
926
+ 2.5846855640411377,
927
+ 2.7333574295043945,
928
+ 1.3355464935302734,
929
+ -15.045228004455566,
930
+ -12.471461296081543,
931
+ 2.690511465072632,
932
+ 2.2058544158935547,
933
+ 16.3038272857666,
934
+ 1.5606690645217896,
935
+ -2.86368989944458,
936
+ 11.665703773498535
937
+ ],
938
+ "model_value": 0.2911173105239868,
939
+ "model_checkpoint": "r42_1500M.pkl",
940
+ "model_architecture": "simba_aux"
941
+ },
942
+ {
943
+ "turn": 15,
944
+ "player": 0,
945
+ "phase": "draw",
946
+ "action": 1,
947
+ "action_desc": "Human drew 4\u2660 from discard",
948
+ "card_drawn": 3,
949
+ "hands": [
950
+ [
951
+ 5,
952
+ 2,
953
+ 6,
954
+ 33,
955
+ 17,
956
+ 7,
957
+ 4,
958
+ 18,
959
+ 25,
960
+ 23
961
+ ],
962
+ [
963
+ 21,
964
+ 19,
965
+ 38,
966
+ 47,
967
+ 45,
968
+ 43,
969
+ 36,
970
+ 37,
971
+ 14,
972
+ 0
973
+ ]
974
+ ],
975
+ "hand_sizes": [
976
+ 10,
977
+ 10
978
+ ],
979
+ "discard_pile": [
980
+ 44,
981
+ 51,
982
+ 29,
983
+ 49,
984
+ 3
985
+ ],
986
+ "discard_size": 5,
987
+ "stock_remaining": 27,
988
+ "deadwood": [
989
+ 42,
990
+ 40
991
+ ],
992
+ "model_logits": null,
993
+ "model_value": null,
994
+ "model_checkpoint": "r42_1500M.pkl",
995
+ "model_architecture": "simba_aux"
996
+ },
997
+ {
998
+ "turn": 16,
999
+ "player": 0,
1000
+ "phase": "discard",
1001
+ "action": 5,
1002
+ "action_desc": "Human discarded 8\u2666",
1003
+ "card_drawn": null,
1004
+ "hands": [
1005
+ [
1006
+ 5,
1007
+ 2,
1008
+ 6,
1009
+ 33,
1010
+ 17,
1011
+ 7,
1012
+ 4,
1013
+ 18,
1014
+ 25,
1015
+ 23,
1016
+ 3
1017
+ ],
1018
+ [
1019
+ 21,
1020
+ 19,
1021
+ 38,
1022
+ 47,
1023
+ 45,
1024
+ 43,
1025
+ 36,
1026
+ 37,
1027
+ 14,
1028
+ 0
1029
+ ]
1030
+ ],
1031
+ "hand_sizes": [
1032
+ 11,
1033
+ 10
1034
+ ],
1035
+ "discard_pile": [
1036
+ 44,
1037
+ 51,
1038
+ 29,
1039
+ 49
1040
+ ],
1041
+ "discard_size": 4,
1042
+ "stock_remaining": 27,
1043
+ "deadwood": [
1044
+ 39,
1045
+ 40
1046
+ ],
1047
+ "model_logits": null,
1048
+ "model_value": null,
1049
+ "model_checkpoint": "r42_1500M.pkl",
1050
+ "model_architecture": "simba_aux"
1051
+ },
1052
+ {
1053
+ "turn": 17,
1054
+ "player": 1,
1055
+ "phase": "draw",
1056
+ "action": 0,
1057
+ "action_desc": "Model drew from stock",
1058
+ "card_drawn": 13,
1059
+ "hands": [
1060
+ [
1061
+ 5,
1062
+ 2,
1063
+ 6,
1064
+ 3,
1065
+ 17,
1066
+ 7,
1067
+ 4,
1068
+ 18,
1069
+ 25,
1070
+ 23
1071
+ ],
1072
+ [
1073
+ 21,
1074
+ 19,
1075
+ 38,
1076
+ 47,
1077
+ 45,
1078
+ 43,
1079
+ 36,
1080
+ 37,
1081
+ 14,
1082
+ 0
1083
+ ]
1084
+ ],
1085
+ "hand_sizes": [
1086
+ 10,
1087
+ 10
1088
+ ],
1089
+ "discard_pile": [
1090
+ 44,
1091
+ 51,
1092
+ 29,
1093
+ 49,
1094
+ 33
1095
+ ],
1096
+ "discard_size": 5,
1097
+ "stock_remaining": 27,
1098
+ "deadwood": [
1099
+ 31,
1100
+ 40
1101
+ ],
1102
+ "model_logits": [
1103
+ 8.066147804260254,
1104
+ -8.0846529006958,
1105
+ 9.601375579833984,
1106
+ 6.709384441375732,
1107
+ -6.27523946762085,
1108
+ 4.711665153503418,
1109
+ 5.822248935699463,
1110
+ 0.8416484594345093,
1111
+ -11.963319778442383,
1112
+ -9.035999298095703,
1113
+ -0.22975727915763855,
1114
+ -0.3294707238674164,
1115
+ 1.2679355144500732,
1116
+ 2.0494441986083984,
1117
+ -3.489938735961914,
1118
+ 12.807523727416992
1119
+ ],
1120
+ "model_value": 0.16435781121253967,
1121
+ "model_checkpoint": "r42_1500M.pkl",
1122
+ "model_architecture": "simba_aux"
1123
+ },
1124
+ {
1125
+ "turn": 18,
1126
+ "player": 1,
1127
+ "phase": "discard",
1128
+ "action": 2,
1129
+ "action_desc": "Model discarded 9\u2665",
1130
+ "card_drawn": null,
1131
+ "hands": [
1132
+ [
1133
+ 5,
1134
+ 2,
1135
+ 6,
1136
+ 3,
1137
+ 17,
1138
+ 7,
1139
+ 4,
1140
+ 18,
1141
+ 25,
1142
+ 23
1143
+ ],
1144
+ [
1145
+ 21,
1146
+ 19,
1147
+ 38,
1148
+ 47,
1149
+ 45,
1150
+ 43,
1151
+ 36,
1152
+ 37,
1153
+ 14,
1154
+ 0,
1155
+ 13
1156
+ ]
1157
+ ],
1158
+ "hand_sizes": [
1159
+ 10,
1160
+ 11
1161
+ ],
1162
+ "discard_pile": [
1163
+ 44,
1164
+ 51,
1165
+ 29,
1166
+ 49,
1167
+ 33
1168
+ ],
1169
+ "discard_size": 5,
1170
+ "stock_remaining": 26,
1171
+ "deadwood": [
1172
+ 31,
1173
+ 41
1174
+ ],
1175
+ "model_logits": [
1176
+ -2.77071213722229,
1177
+ 2.749574661254883,
1178
+ 12.901314735412598,
1179
+ 10.536663055419922,
1180
+ -10.191338539123535,
1181
+ 10.192569732666016,
1182
+ 9.449311256408691,
1183
+ 0.58781898021698,
1184
+ -16.14435577392578,
1185
+ -12.884576797485352,
1186
+ -3.6686058044433594,
1187
+ -2.897615909576416,
1188
+ -3.726410150527954,
1189
+ 1.92379629611969,
1190
+ -2.3426313400268555,
1191
+ 5.508289813995361
1192
+ ],
1193
+ "model_value": 0.1654054969549179,
1194
+ "model_checkpoint": "r42_1500M.pkl",
1195
+ "model_architecture": "simba_aux"
1196
+ },
1197
+ {
1198
+ "turn": 19,
1199
+ "player": 0,
1200
+ "phase": "draw",
1201
+ "action": 0,
1202
+ "action_desc": "Human drew from stock",
1203
+ "card_drawn": 31,
1204
+ "hands": [
1205
+ [
1206
+ 5,
1207
+ 2,
1208
+ 6,
1209
+ 3,
1210
+ 17,
1211
+ 7,
1212
+ 4,
1213
+ 18,
1214
+ 25,
1215
+ 23
1216
+ ],
1217
+ [
1218
+ 13,
1219
+ 19,
1220
+ 38,
1221
+ 47,
1222
+ 45,
1223
+ 43,
1224
+ 36,
1225
+ 37,
1226
+ 14,
1227
+ 0
1228
+ ]
1229
+ ],
1230
+ "hand_sizes": [
1231
+ 10,
1232
+ 10
1233
+ ],
1234
+ "discard_pile": [
1235
+ 44,
1236
+ 51,
1237
+ 29,
1238
+ 49,
1239
+ 33,
1240
+ 21
1241
+ ],
1242
+ "discard_size": 6,
1243
+ "stock_remaining": 26,
1244
+ "deadwood": [
1245
+ 31,
1246
+ 32
1247
+ ],
1248
+ "model_logits": null,
1249
+ "model_value": null,
1250
+ "model_checkpoint": "r42_1500M.pkl",
1251
+ "model_architecture": "simba_aux"
1252
+ },
1253
+ {
1254
+ "turn": 20,
1255
+ "player": 0,
1256
+ "phase": "discard",
1257
+ "action": 10,
1258
+ "action_desc": "Human discarded K\u2665",
1259
+ "card_drawn": null,
1260
+ "hands": [
1261
+ [
1262
+ 5,
1263
+ 2,
1264
+ 6,
1265
+ 3,
1266
+ 17,
1267
+ 7,
1268
+ 4,
1269
+ 18,
1270
+ 25,
1271
+ 23,
1272
+ 31
1273
+ ],
1274
+ [
1275
+ 13,
1276
+ 19,
1277
+ 38,
1278
+ 47,
1279
+ 45,
1280
+ 43,
1281
+ 36,
1282
+ 37,
1283
+ 14,
1284
+ 0
1285
+ ]
1286
+ ],
1287
+ "hand_sizes": [
1288
+ 11,
1289
+ 10
1290
+ ],
1291
+ "discard_pile": [
1292
+ 44,
1293
+ 51,
1294
+ 29,
1295
+ 49,
1296
+ 33,
1297
+ 21
1298
+ ],
1299
+ "discard_size": 6,
1300
+ "stock_remaining": 25,
1301
+ "deadwood": [
1302
+ 37,
1303
+ 32
1304
+ ],
1305
+ "model_logits": null,
1306
+ "model_value": null,
1307
+ "model_checkpoint": "r42_1500M.pkl",
1308
+ "model_architecture": "simba_aux"
1309
+ },
1310
+ {
1311
+ "turn": 21,
1312
+ "player": 1,
1313
+ "phase": "draw",
1314
+ "action": 0,
1315
+ "action_desc": "Model drew from stock",
1316
+ "card_drawn": 1,
1317
+ "hands": [
1318
+ [
1319
+ 5,
1320
+ 2,
1321
+ 6,
1322
+ 3,
1323
+ 17,
1324
+ 7,
1325
+ 4,
1326
+ 18,
1327
+ 31,
1328
+ 23
1329
+ ],
1330
+ [
1331
+ 13,
1332
+ 19,
1333
+ 38,
1334
+ 47,
1335
+ 45,
1336
+ 43,
1337
+ 36,
1338
+ 37,
1339
+ 14,
1340
+ 0
1341
+ ]
1342
+ ],
1343
+ "hand_sizes": [
1344
+ 10,
1345
+ 10
1346
+ ],
1347
+ "discard_pile": [
1348
+ 44,
1349
+ 51,
1350
+ 29,
1351
+ 49,
1352
+ 33,
1353
+ 21,
1354
+ 25
1355
+ ],
1356
+ "discard_size": 7,
1357
+ "stock_remaining": 25,
1358
+ "deadwood": [
1359
+ 27,
1360
+ 32
1361
+ ],
1362
+ "model_logits": [
1363
+ 6.26929235458374,
1364
+ -6.2877702713012695,
1365
+ -0.0012273131869733334,
1366
+ 5.2356743812561035,
1367
+ -5.459696292877197,
1368
+ 11.210075378417969,
1369
+ 6.015535831451416,
1370
+ 2.988128185272217,
1371
+ -12.3079252243042,
1372
+ -7.972535610198975,
1373
+ -0.8080845475196838,
1374
+ -0.3211536407470703,
1375
+ 0.16582894325256348,
1376
+ 0.542965292930603,
1377
+ -1.9980934858322144,
1378
+ 13.446394920349121
1379
+ ],
1380
+ "model_value": 0.1807677000761032,
1381
+ "model_checkpoint": "r42_1500M.pkl",
1382
+ "model_architecture": "simba_aux"
1383
+ },
1384
+ {
1385
+ "turn": 22,
1386
+ "player": 1,
1387
+ "phase": "discard",
1388
+ "action": 5,
1389
+ "action_desc": "Model discarded 9\u2663",
1390
+ "card_drawn": null,
1391
+ "hands": [
1392
+ [
1393
+ 5,
1394
+ 2,
1395
+ 6,
1396
+ 3,
1397
+ 17,
1398
+ 7,
1399
+ 4,
1400
+ 18,
1401
+ 31,
1402
+ 23
1403
+ ],
1404
+ [
1405
+ 13,
1406
+ 19,
1407
+ 38,
1408
+ 47,
1409
+ 45,
1410
+ 43,
1411
+ 36,
1412
+ 37,
1413
+ 14,
1414
+ 0,
1415
+ 1
1416
+ ]
1417
+ ],
1418
+ "hand_sizes": [
1419
+ 10,
1420
+ 11
1421
+ ],
1422
+ "discard_pile": [
1423
+ 44,
1424
+ 51,
1425
+ 29,
1426
+ 49,
1427
+ 33,
1428
+ 21,
1429
+ 25
1430
+ ],
1431
+ "discard_size": 7,
1432
+ "stock_remaining": 24,
1433
+ "deadwood": [
1434
+ 27,
1435
+ 34
1436
+ ],
1437
+ "model_logits": [
1438
+ -4.491512775421143,
1439
+ 4.460123062133789,
1440
+ -2.2431042194366455,
1441
+ 9.859258651733398,
1442
+ -9.673826217651367,
1443
+ 19.081336975097656,
1444
+ 7.023276329040527,
1445
+ 4.194875240325928,
1446
+ -17.16871452331543,
1447
+ -10.632189750671387,
1448
+ -2.711754560470581,
1449
+ -4.07654333114624,
1450
+ -1.7465616464614868,
1451
+ 0.35864073038101196,
1452
+ -0.7489283680915833,
1453
+ 5.912528038024902
1454
+ ],
1455
+ "model_value": 0.25027400255203247,
1456
+ "model_checkpoint": "r42_1500M.pkl",
1457
+ "model_architecture": "simba_aux"
1458
+ },
1459
+ {
1460
+ "turn": 23,
1461
+ "player": 0,
1462
+ "phase": "draw",
1463
+ "action": 0,
1464
+ "action_desc": "Human drew from stock",
1465
+ "card_drawn": 46,
1466
+ "hands": [
1467
+ [
1468
+ 5,
1469
+ 2,
1470
+ 6,
1471
+ 3,
1472
+ 17,
1473
+ 7,
1474
+ 4,
1475
+ 18,
1476
+ 31,
1477
+ 23
1478
+ ],
1479
+ [
1480
+ 13,
1481
+ 19,
1482
+ 38,
1483
+ 1,
1484
+ 45,
1485
+ 43,
1486
+ 36,
1487
+ 37,
1488
+ 14,
1489
+ 0
1490
+ ]
1491
+ ],
1492
+ "hand_sizes": [
1493
+ 10,
1494
+ 10
1495
+ ],
1496
+ "discard_pile": [
1497
+ 44,
1498
+ 51,
1499
+ 29,
1500
+ 49,
1501
+ 33,
1502
+ 21,
1503
+ 25,
1504
+ 47
1505
+ ],
1506
+ "discard_size": 8,
1507
+ "stock_remaining": 24,
1508
+ "deadwood": [
1509
+ 27,
1510
+ 25
1511
+ ],
1512
+ "model_logits": null,
1513
+ "model_value": null,
1514
+ "model_checkpoint": "r42_1500M.pkl",
1515
+ "model_architecture": "simba_aux"
1516
+ },
1517
+ {
1518
+ "turn": 24,
1519
+ "player": 0,
1520
+ "phase": "discard",
1521
+ "action": 11,
1522
+ "action_desc": "Human discarded J\u2665",
1523
+ "card_drawn": null,
1524
+ "hands": [
1525
+ [
1526
+ 5,
1527
+ 2,
1528
+ 6,
1529
+ 3,
1530
+ 17,
1531
+ 7,
1532
+ 4,
1533
+ 18,
1534
+ 31,
1535
+ 23,
1536
+ 46
1537
+ ],
1538
+ [
1539
+ 13,
1540
+ 19,
1541
+ 38,
1542
+ 1,
1543
+ 45,
1544
+ 43,
1545
+ 36,
1546
+ 37,
1547
+ 14,
1548
+ 0
1549
+ ]
1550
+ ],
1551
+ "hand_sizes": [
1552
+ 11,
1553
+ 10
1554
+ ],
1555
+ "discard_pile": [
1556
+ 44,
1557
+ 51,
1558
+ 29,
1559
+ 49,
1560
+ 33,
1561
+ 21,
1562
+ 25,
1563
+ 47
1564
+ ],
1565
+ "discard_size": 8,
1566
+ "stock_remaining": 23,
1567
+ "deadwood": [
1568
+ 35,
1569
+ 25
1570
+ ],
1571
+ "model_logits": null,
1572
+ "model_value": null,
1573
+ "model_checkpoint": "r42_1500M.pkl",
1574
+ "model_architecture": "simba_aux"
1575
+ },
1576
+ {
1577
+ "turn": 25,
1578
+ "player": 1,
1579
+ "phase": "draw",
1580
+ "action": 0,
1581
+ "action_desc": "Model drew from stock",
1582
+ "card_drawn": 11,
1583
+ "hands": [
1584
+ [
1585
+ 5,
1586
+ 2,
1587
+ 6,
1588
+ 3,
1589
+ 17,
1590
+ 7,
1591
+ 4,
1592
+ 18,
1593
+ 31,
1594
+ 46
1595
+ ],
1596
+ [
1597
+ 13,
1598
+ 19,
1599
+ 38,
1600
+ 1,
1601
+ 45,
1602
+ 43,
1603
+ 36,
1604
+ 37,
1605
+ 14,
1606
+ 0
1607
+ ]
1608
+ ],
1609
+ "hand_sizes": [
1610
+ 10,
1611
+ 10
1612
+ ],
1613
+ "discard_pile": [
1614
+ 44,
1615
+ 51,
1616
+ 29,
1617
+ 49,
1618
+ 33,
1619
+ 21,
1620
+ 25,
1621
+ 47,
1622
+ 23
1623
+ ],
1624
+ "discard_size": 9,
1625
+ "stock_remaining": 23,
1626
+ "deadwood": [
1627
+ 25,
1628
+ 25
1629
+ ],
1630
+ "model_logits": [
1631
+ 10.157042503356934,
1632
+ -10.170555114746094,
1633
+ 0.7261695265769958,
1634
+ 7.496785640716553,
1635
+ -7.870490074157715,
1636
+ 0.6724360585212708,
1637
+ 11.823999404907227,
1638
+ 3.638632297515869,
1639
+ -10.330878257751465,
1640
+ -8.757818222045898,
1641
+ -0.029917217791080475,
1642
+ 1.5304876565933228,
1643
+ 2.7648935317993164,
1644
+ 1.7297470569610596,
1645
+ -3.4055967330932617,
1646
+ 13.965205192565918
1647
+ ],
1648
+ "model_value": 0.2112925946712494,
1649
+ "model_checkpoint": "r42_1500M.pkl",
1650
+ "model_architecture": "simba_aux"
1651
+ },
1652
+ {
1653
+ "turn": 26,
1654
+ "player": 1,
1655
+ "phase": "discard",
1656
+ "action": 12,
1657
+ "action_desc": "Model discarded Q\u2660",
1658
+ "card_drawn": null,
1659
+ "hands": [
1660
+ [
1661
+ 5,
1662
+ 2,
1663
+ 6,
1664
+ 3,
1665
+ 17,
1666
+ 7,
1667
+ 4,
1668
+ 18,
1669
+ 31,
1670
+ 46
1671
+ ],
1672
+ [
1673
+ 13,
1674
+ 19,
1675
+ 38,
1676
+ 1,
1677
+ 45,
1678
+ 43,
1679
+ 36,
1680
+ 37,
1681
+ 14,
1682
+ 0,
1683
+ 11
1684
+ ]
1685
+ ],
1686
+ "hand_sizes": [
1687
+ 10,
1688
+ 11
1689
+ ],
1690
+ "discard_pile": [
1691
+ 44,
1692
+ 51,
1693
+ 29,
1694
+ 49,
1695
+ 33,
1696
+ 21,
1697
+ 25,
1698
+ 47,
1699
+ 23
1700
+ ],
1701
+ "discard_size": 9,
1702
+ "stock_remaining": 22,
1703
+ "deadwood": [
1704
+ 25,
1705
+ 35
1706
+ ],
1707
+ "model_logits": [
1708
+ 4.846325874328613,
1709
+ -4.868699550628662,
1710
+ -0.11475139111280441,
1711
+ 8.090211868286133,
1712
+ -11.844910621643066,
1713
+ -1.5905108451843262,
1714
+ 10.211276054382324,
1715
+ 2.7191364765167236,
1716
+ -13.504241943359375,
1717
+ -11.84048080444336,
1718
+ -2.415240526199341,
1719
+ -1.146047592163086,
1720
+ 18.196413040161133,
1721
+ 8.024925231933594,
1722
+ -9.367064476013184,
1723
+ 10.640433311462402
1724
+ ],
1725
+ "model_value": 0.3169029951095581,
1726
+ "model_checkpoint": "r42_1500M.pkl",
1727
+ "model_architecture": "simba_aux"
1728
+ },
1729
+ {
1730
+ "turn": 27,
1731
+ "player": 0,
1732
+ "phase": "draw",
1733
+ "action": 0,
1734
+ "action_desc": "Human drew from stock",
1735
+ "card_drawn": 10,
1736
+ "hands": [
1737
+ [
1738
+ 5,
1739
+ 2,
1740
+ 6,
1741
+ 3,
1742
+ 17,
1743
+ 7,
1744
+ 4,
1745
+ 18,
1746
+ 31,
1747
+ 46
1748
+ ],
1749
+ [
1750
+ 13,
1751
+ 19,
1752
+ 38,
1753
+ 1,
1754
+ 45,
1755
+ 43,
1756
+ 36,
1757
+ 37,
1758
+ 14,
1759
+ 0
1760
+ ]
1761
+ ],
1762
+ "hand_sizes": [
1763
+ 10,
1764
+ 10
1765
+ ],
1766
+ "discard_pile": [
1767
+ 44,
1768
+ 51,
1769
+ 29,
1770
+ 49,
1771
+ 33,
1772
+ 21,
1773
+ 25,
1774
+ 47,
1775
+ 23,
1776
+ 11
1777
+ ],
1778
+ "discard_size": 10,
1779
+ "stock_remaining": 22,
1780
+ "deadwood": [
1781
+ 25,
1782
+ 25
1783
+ ],
1784
+ "model_logits": null,
1785
+ "model_value": null,
1786
+ "model_checkpoint": "r42_1500M.pkl",
1787
+ "model_architecture": "simba_aux"
1788
+ },
1789
+ {
1790
+ "turn": 28,
1791
+ "player": 0,
1792
+ "phase": "discard",
1793
+ "action": 12,
1794
+ "action_desc": "Human discarded J\u2660",
1795
+ "card_drawn": null,
1796
+ "hands": [
1797
+ [
1798
+ 5,
1799
+ 2,
1800
+ 6,
1801
+ 3,
1802
+ 17,
1803
+ 7,
1804
+ 4,
1805
+ 18,
1806
+ 31,
1807
+ 46,
1808
+ 10
1809
+ ],
1810
+ [
1811
+ 13,
1812
+ 19,
1813
+ 38,
1814
+ 1,
1815
+ 45,
1816
+ 43,
1817
+ 36,
1818
+ 37,
1819
+ 14,
1820
+ 0
1821
+ ]
1822
+ ],
1823
+ "hand_sizes": [
1824
+ 11,
1825
+ 10
1826
+ ],
1827
+ "discard_pile": [
1828
+ 44,
1829
+ 51,
1830
+ 29,
1831
+ 49,
1832
+ 33,
1833
+ 21,
1834
+ 25,
1835
+ 47,
1836
+ 23,
1837
+ 11
1838
+ ],
1839
+ "discard_size": 10,
1840
+ "stock_remaining": 21,
1841
+ "deadwood": [
1842
+ 35,
1843
+ 25
1844
+ ],
1845
+ "model_logits": null,
1846
+ "model_value": null,
1847
+ "model_checkpoint": "r42_1500M.pkl",
1848
+ "model_architecture": "simba_aux"
1849
+ },
1850
+ {
1851
+ "turn": 29,
1852
+ "player": 1,
1853
+ "phase": "draw",
1854
+ "action": 0,
1855
+ "action_desc": "Model drew from stock",
1856
+ "card_drawn": 34,
1857
+ "hands": [
1858
+ [
1859
+ 5,
1860
+ 2,
1861
+ 6,
1862
+ 3,
1863
+ 17,
1864
+ 7,
1865
+ 4,
1866
+ 18,
1867
+ 31,
1868
+ 46
1869
+ ],
1870
+ [
1871
+ 13,
1872
+ 19,
1873
+ 38,
1874
+ 1,
1875
+ 45,
1876
+ 43,
1877
+ 36,
1878
+ 37,
1879
+ 14,
1880
+ 0
1881
+ ]
1882
+ ],
1883
+ "hand_sizes": [
1884
+ 10,
1885
+ 10
1886
+ ],
1887
+ "discard_pile": [
1888
+ 44,
1889
+ 51,
1890
+ 29,
1891
+ 49,
1892
+ 33,
1893
+ 21,
1894
+ 25,
1895
+ 47,
1896
+ 23,
1897
+ 11,
1898
+ 10
1899
+ ],
1900
+ "discard_size": 11,
1901
+ "stock_remaining": 21,
1902
+ "deadwood": [
1903
+ 25,
1904
+ 25
1905
+ ],
1906
+ "model_logits": [
1907
+ 8.433442115783691,
1908
+ -8.445843696594238,
1909
+ 0.2269468605518341,
1910
+ 8.16739273071289,
1911
+ -8.541677474975586,
1912
+ 0.7132872939109802,
1913
+ 11.54711627960205,
1914
+ 3.1492228507995605,
1915
+ -9.632174491882324,
1916
+ -8.679957389831543,
1917
+ -0.501717209815979,
1918
+ 1.9759103059768677,
1919
+ 2.1433303356170654,
1920
+ 1.842934489250183,
1921
+ -3.460728645324707,
1922
+ 13.90916919708252
1923
+ ],
1924
+ "model_value": 0.226350337266922,
1925
+ "model_checkpoint": "r42_1500M.pkl",
1926
+ "model_architecture": "simba_aux"
1927
+ },
1928
+ {
1929
+ "turn": 30,
1930
+ "player": 1,
1931
+ "phase": "discard",
1932
+ "action": 12,
1933
+ "action_desc": "Model discarded 9\u2666",
1934
+ "card_drawn": null,
1935
+ "hands": [
1936
+ [
1937
+ 5,
1938
+ 2,
1939
+ 6,
1940
+ 3,
1941
+ 17,
1942
+ 7,
1943
+ 4,
1944
+ 18,
1945
+ 31,
1946
+ 46
1947
+ ],
1948
+ [
1949
+ 13,
1950
+ 19,
1951
+ 38,
1952
+ 1,
1953
+ 45,
1954
+ 43,
1955
+ 36,
1956
+ 37,
1957
+ 14,
1958
+ 0,
1959
+ 34
1960
+ ]
1961
+ ],
1962
+ "hand_sizes": [
1963
+ 10,
1964
+ 11
1965
+ ],
1966
+ "discard_pile": [
1967
+ 44,
1968
+ 51,
1969
+ 29,
1970
+ 49,
1971
+ 33,
1972
+ 21,
1973
+ 25,
1974
+ 47,
1975
+ 23,
1976
+ 11,
1977
+ 10
1978
+ ],
1979
+ "discard_size": 11,
1980
+ "stock_remaining": 20,
1981
+ "deadwood": [
1982
+ 25,
1983
+ 34
1984
+ ],
1985
+ "model_logits": [
1986
+ 5.0039753913879395,
1987
+ -5.023728370666504,
1988
+ -1.5188764333724976,
1989
+ 8.051513671875,
1990
+ -12.160207748413086,
1991
+ -1.6672836542129517,
1992
+ 8.416703224182129,
1993
+ 0.8420174717903137,
1994
+ -11.68520450592041,
1995
+ -11.156301498413086,
1996
+ -1.8939532041549683,
1997
+ -1.459132194519043,
1998
+ 20.891910552978516,
1999
+ 8.046717643737793,
2000
+ -9.465716361999512,
2001
+ 11.172394752502441
2002
+ ],
2003
+ "model_value": 0.3114898204803467,
2004
+ "model_checkpoint": "r42_1500M.pkl",
2005
+ "model_architecture": "simba_aux"
2006
+ },
2007
+ {
2008
+ "turn": 31,
2009
+ "player": 0,
2010
+ "phase": "draw",
2011
+ "action": 0,
2012
+ "action_desc": "Human drew from stock",
2013
+ "card_drawn": 24,
2014
+ "hands": [
2015
+ [
2016
+ 5,
2017
+ 2,
2018
+ 6,
2019
+ 3,
2020
+ 17,
2021
+ 7,
2022
+ 4,
2023
+ 18,
2024
+ 31,
2025
+ 46
2026
+ ],
2027
+ [
2028
+ 13,
2029
+ 19,
2030
+ 38,
2031
+ 1,
2032
+ 45,
2033
+ 43,
2034
+ 36,
2035
+ 37,
2036
+ 14,
2037
+ 0
2038
+ ]
2039
+ ],
2040
+ "hand_sizes": [
2041
+ 10,
2042
+ 10
2043
+ ],
2044
+ "discard_pile": [
2045
+ 44,
2046
+ 51,
2047
+ 29,
2048
+ 49,
2049
+ 33,
2050
+ 21,
2051
+ 25,
2052
+ 47,
2053
+ 23,
2054
+ 11,
2055
+ 10,
2056
+ 34
2057
+ ],
2058
+ "discard_size": 12,
2059
+ "stock_remaining": 20,
2060
+ "deadwood": [
2061
+ 25,
2062
+ 25
2063
+ ],
2064
+ "model_logits": null,
2065
+ "model_value": null,
2066
+ "model_checkpoint": "r42_1500M.pkl",
2067
+ "model_architecture": "simba_aux"
2068
+ },
2069
+ {
2070
+ "turn": 32,
2071
+ "player": 0,
2072
+ "phase": "discard",
2073
+ "action": 12,
2074
+ "action_desc": "Human discarded Q\u2665",
2075
+ "card_drawn": null,
2076
+ "hands": [
2077
+ [
2078
+ 5,
2079
+ 2,
2080
+ 6,
2081
+ 3,
2082
+ 17,
2083
+ 7,
2084
+ 4,
2085
+ 18,
2086
+ 31,
2087
+ 46,
2088
+ 24
2089
+ ],
2090
+ [
2091
+ 13,
2092
+ 19,
2093
+ 38,
2094
+ 1,
2095
+ 45,
2096
+ 43,
2097
+ 36,
2098
+ 37,
2099
+ 14,
2100
+ 0
2101
+ ]
2102
+ ],
2103
+ "hand_sizes": [
2104
+ 11,
2105
+ 10
2106
+ ],
2107
+ "discard_pile": [
2108
+ 44,
2109
+ 51,
2110
+ 29,
2111
+ 49,
2112
+ 33,
2113
+ 21,
2114
+ 25,
2115
+ 47,
2116
+ 23,
2117
+ 11,
2118
+ 10,
2119
+ 34
2120
+ ],
2121
+ "discard_size": 12,
2122
+ "stock_remaining": 19,
2123
+ "deadwood": [
2124
+ 35,
2125
+ 25
2126
+ ],
2127
+ "model_logits": null,
2128
+ "model_value": null,
2129
+ "model_checkpoint": "r42_1500M.pkl",
2130
+ "model_architecture": "simba_aux"
2131
+ },
2132
+ {
2133
+ "turn": 33,
2134
+ "player": 1,
2135
+ "phase": "draw",
2136
+ "action": 0,
2137
+ "action_desc": "Model drew from stock",
2138
+ "card_drawn": 27,
2139
+ "hands": [
2140
+ [
2141
+ 5,
2142
+ 2,
2143
+ 6,
2144
+ 3,
2145
+ 17,
2146
+ 7,
2147
+ 4,
2148
+ 18,
2149
+ 31,
2150
+ 46
2151
+ ],
2152
+ [
2153
+ 13,
2154
+ 19,
2155
+ 38,
2156
+ 1,
2157
+ 45,
2158
+ 43,
2159
+ 36,
2160
+ 37,
2161
+ 14,
2162
+ 0
2163
+ ]
2164
+ ],
2165
+ "hand_sizes": [
2166
+ 10,
2167
+ 10
2168
+ ],
2169
+ "discard_pile": [
2170
+ 44,
2171
+ 51,
2172
+ 29,
2173
+ 49,
2174
+ 33,
2175
+ 21,
2176
+ 25,
2177
+ 47,
2178
+ 23,
2179
+ 11,
2180
+ 10,
2181
+ 34,
2182
+ 24
2183
+ ],
2184
+ "discard_size": 13,
2185
+ "stock_remaining": 19,
2186
+ "deadwood": [
2187
+ 25,
2188
+ 25
2189
+ ],
2190
+ "model_logits": [
2191
+ 9.665902137756348,
2192
+ -9.677483558654785,
2193
+ 0.6676115393638611,
2194
+ 7.765767574310303,
2195
+ -7.992070198059082,
2196
+ 0.6821295022964478,
2197
+ 12.180002212524414,
2198
+ 2.483808755874634,
2199
+ -8.4148530960083,
2200
+ -8.724776268005371,
2201
+ -0.7120273113250732,
2202
+ 1.5324976444244385,
2203
+ 2.654120445251465,
2204
+ 1.5488325357437134,
2205
+ -3.1580612659454346,
2206
+ 13.590743064880371
2207
+ ],
2208
+ "model_value": 0.19069039821624756,
2209
+ "model_checkpoint": "r42_1500M.pkl",
2210
+ "model_architecture": "simba_aux"
2211
+ },
2212
+ {
2213
+ "turn": 34,
2214
+ "player": 1,
2215
+ "phase": "discard",
2216
+ "action": 3,
2217
+ "action_desc": "Model discarded 7\u2665",
2218
+ "card_drawn": null,
2219
+ "hands": [
2220
+ [
2221
+ 5,
2222
+ 2,
2223
+ 6,
2224
+ 3,
2225
+ 17,
2226
+ 7,
2227
+ 4,
2228
+ 18,
2229
+ 31,
2230
+ 46
2231
+ ],
2232
+ [
2233
+ 13,
2234
+ 19,
2235
+ 38,
2236
+ 1,
2237
+ 45,
2238
+ 43,
2239
+ 36,
2240
+ 37,
2241
+ 14,
2242
+ 0,
2243
+ 27
2244
+ ]
2245
+ ],
2246
+ "hand_sizes": [
2247
+ 10,
2248
+ 11
2249
+ ],
2250
+ "discard_pile": [
2251
+ 44,
2252
+ 51,
2253
+ 29,
2254
+ 49,
2255
+ 33,
2256
+ 21,
2257
+ 25,
2258
+ 47,
2259
+ 23,
2260
+ 11,
2261
+ 10,
2262
+ 34,
2263
+ 24
2264
+ ],
2265
+ "discard_size": 13,
2266
+ "stock_remaining": 18,
2267
+ "deadwood": [
2268
+ 25,
2269
+ 21
2270
+ ],
2271
+ "model_logits": [
2272
+ 1.4226794242858887,
2273
+ -1.4492957592010498,
2274
+ 0.5585522055625916,
2275
+ 16.642166137695312,
2276
+ -15.224808692932129,
2277
+ -0.010581095702946186,
2278
+ 15.518430709838867,
2279
+ 7.496588230133057,
2280
+ -15.361716270446777,
2281
+ -15.838196754455566,
2282
+ -0.8002575039863586,
2283
+ 1.2340660095214844,
2284
+ 2.604884147644043,
2285
+ 0.6524441838264465,
2286
+ -1.1755698919296265,
2287
+ 4.939535617828369
2288
+ ],
2289
+ "model_value": 0.3216603994369507,
2290
+ "model_checkpoint": "r42_1500M.pkl",
2291
+ "model_architecture": "simba_aux"
2292
+ },
2293
+ {
2294
+ "turn": 35,
2295
+ "player": 0,
2296
+ "phase": "draw",
2297
+ "action": 1,
2298
+ "action_desc": "Human drew 7\u2665 from discard",
2299
+ "card_drawn": 19,
2300
+ "hands": [
2301
+ [
2302
+ 5,
2303
+ 2,
2304
+ 6,
2305
+ 3,
2306
+ 17,
2307
+ 7,
2308
+ 4,
2309
+ 18,
2310
+ 31,
2311
+ 46
2312
+ ],
2313
+ [
2314
+ 13,
2315
+ 27,
2316
+ 38,
2317
+ 1,
2318
+ 45,
2319
+ 43,
2320
+ 36,
2321
+ 37,
2322
+ 14,
2323
+ 0
2324
+ ]
2325
+ ],
2326
+ "hand_sizes": [
2327
+ 10,
2328
+ 10
2329
+ ],
2330
+ "discard_pile": [
2331
+ 44,
2332
+ 51,
2333
+ 29,
2334
+ 49,
2335
+ 33,
2336
+ 21,
2337
+ 25,
2338
+ 47,
2339
+ 23,
2340
+ 11,
2341
+ 10,
2342
+ 34,
2343
+ 24,
2344
+ 19
2345
+ ],
2346
+ "discard_size": 14,
2347
+ "stock_remaining": 18,
2348
+ "deadwood": [
2349
+ 25,
2350
+ 14
2351
+ ],
2352
+ "model_logits": null,
2353
+ "model_value": null,
2354
+ "model_checkpoint": "r42_1500M.pkl",
2355
+ "model_architecture": "simba_aux"
2356
+ },
2357
+ {
2358
+ "turn": 36,
2359
+ "player": 0,
2360
+ "phase": "discard",
2361
+ "action": 11,
2362
+ "action_desc": "Human discarded 8\u2663",
2363
+ "card_drawn": null,
2364
+ "hands": [
2365
+ [
2366
+ 5,
2367
+ 2,
2368
+ 6,
2369
+ 3,
2370
+ 17,
2371
+ 7,
2372
+ 4,
2373
+ 18,
2374
+ 31,
2375
+ 46,
2376
+ 19
2377
+ ],
2378
+ [
2379
+ 13,
2380
+ 27,
2381
+ 38,
2382
+ 1,
2383
+ 45,
2384
+ 43,
2385
+ 36,
2386
+ 37,
2387
+ 14,
2388
+ 0
2389
+ ]
2390
+ ],
2391
+ "hand_sizes": [
2392
+ 11,
2393
+ 10
2394
+ ],
2395
+ "discard_pile": [
2396
+ 44,
2397
+ 51,
2398
+ 29,
2399
+ 49,
2400
+ 33,
2401
+ 21,
2402
+ 25,
2403
+ 47,
2404
+ 23,
2405
+ 11,
2406
+ 10,
2407
+ 34,
2408
+ 24
2409
+ ],
2410
+ "discard_size": 13,
2411
+ "stock_remaining": 18,
2412
+ "deadwood": [
2413
+ 14,
2414
+ 14
2415
+ ],
2416
+ "model_logits": null,
2417
+ "model_value": null,
2418
+ "model_checkpoint": "r42_1500M.pkl",
2419
+ "model_architecture": "simba_aux"
2420
+ },
2421
+ {
2422
+ "turn": 37,
2423
+ "player": 0,
2424
+ "phase": "knock_decision",
2425
+ "action": 14,
2426
+ "action_desc": "Human knocked",
2427
+ "card_drawn": null,
2428
+ "hands": [
2429
+ [
2430
+ 5,
2431
+ 2,
2432
+ 6,
2433
+ 3,
2434
+ 17,
2435
+ 7,
2436
+ 4,
2437
+ 18,
2438
+ 31,
2439
+ 19
2440
+ ],
2441
+ [
2442
+ 13,
2443
+ 27,
2444
+ 38,
2445
+ 1,
2446
+ 45,
2447
+ 43,
2448
+ 36,
2449
+ 37,
2450
+ 14,
2451
+ 0
2452
+ ]
2453
+ ],
2454
+ "hand_sizes": [
2455
+ 10,
2456
+ 10
2457
+ ],
2458
+ "discard_pile": [
2459
+ 44,
2460
+ 51,
2461
+ 29,
2462
+ 49,
2463
+ 33,
2464
+ 21,
2465
+ 25,
2466
+ 47,
2467
+ 23,
2468
+ 11,
2469
+ 10,
2470
+ 34,
2471
+ 24,
2472
+ 46
2473
+ ],
2474
+ "discard_size": 14,
2475
+ "stock_remaining": 18,
2476
+ "deadwood": [
2477
+ 6,
2478
+ 14
2479
+ ],
2480
+ "model_logits": null,
2481
+ "model_value": null,
2482
+ "model_checkpoint": "r42_1500M.pkl",
2483
+ "model_architecture": "simba_aux"
2484
+ },
2485
+ {
2486
+ "turn": 38,
2487
+ "player": -1,
2488
+ "phase": "game_over",
2489
+ "action": -1,
2490
+ "action_desc": "Game over.",
2491
+ "card_drawn": null,
2492
+ "hands": [
2493
+ [
2494
+ 5,
2495
+ 2,
2496
+ 6,
2497
+ 3,
2498
+ 17,
2499
+ 7,
2500
+ 4,
2501
+ 18,
2502
+ 31,
2503
+ 19
2504
+ ],
2505
+ [
2506
+ 13,
2507
+ 27,
2508
+ 38,
2509
+ 1,
2510
+ 45,
2511
+ 43,
2512
+ 36,
2513
+ 37,
2514
+ 14,
2515
+ 0
2516
+ ]
2517
+ ],
2518
+ "hand_sizes": [
2519
+ 10,
2520
+ 10
2521
+ ],
2522
+ "discard_pile": [
2523
+ 44,
2524
+ 51,
2525
+ 29,
2526
+ 49,
2527
+ 33,
2528
+ 21,
2529
+ 25,
2530
+ 47,
2531
+ 23,
2532
+ 11,
2533
+ 10,
2534
+ 34,
2535
+ 24,
2536
+ 46
2537
+ ],
2538
+ "discard_size": 14,
2539
+ "stock_remaining": 18,
2540
+ "deadwood": [
2541
+ 6,
2542
+ 14
2543
+ ],
2544
+ "model_logits": null,
2545
+ "model_value": null,
2546
+ "model_checkpoint": "r42_1500M.pkl",
2547
+ "model_architecture": "simba_aux"
2548
+ }
2549
+ ]
2550
+ }