Alex-GSL commited on
Commit
3465962
·
verified ·
1 Parent(s): 1773eff

Upload human_games/game_20260325_001249_5.json with huggingface_hub

Browse files
human_games/game_20260325_001249_5.json ADDED
@@ -0,0 +1,2351 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "game_number": 5,
3
+ "timestamp": "2026-03-25T00:12:49.955354+00:00",
4
+ "checkpoint": "checkpoints/r42_800M.pkl",
5
+ "architecture": "simba_aux",
6
+ "model_checkpoint": "r42_800M.pkl",
7
+ "model_architecture": "simba_aux",
8
+ "result": "model_win",
9
+ "final_scores": [
10
+ 0,
11
+ 22
12
+ ],
13
+ "total_turns": 16,
14
+ "actions": [
15
+ {
16
+ "turn": 0,
17
+ "player": -1,
18
+ "phase": "draw",
19
+ "action": -1,
20
+ "action_desc": "Initial deal",
21
+ "card_drawn": null,
22
+ "hands": [
23
+ [
24
+ 47,
25
+ 3,
26
+ 31,
27
+ 51,
28
+ 5,
29
+ 15,
30
+ 9,
31
+ 27,
32
+ 30,
33
+ 13
34
+ ],
35
+ [
36
+ 2,
37
+ 29,
38
+ 8,
39
+ 42,
40
+ 45,
41
+ 7,
42
+ 43,
43
+ 21,
44
+ 36,
45
+ 12
46
+ ]
47
+ ],
48
+ "hand_sizes": [
49
+ 10,
50
+ 10
51
+ ],
52
+ "discard_pile": [
53
+ 23
54
+ ],
55
+ "discard_size": 1,
56
+ "stock_remaining": 31,
57
+ "deadwood": [
58
+ 56,
59
+ 69
60
+ ],
61
+ "model_logits": null,
62
+ "model_value": null,
63
+ "model_checkpoint": "r42_800M.pkl",
64
+ "model_architecture": "simba_aux"
65
+ },
66
+ {
67
+ "turn": 1,
68
+ "player": 0,
69
+ "phase": "draw",
70
+ "action": 0,
71
+ "action_desc": "Human drew from stock",
72
+ "card_drawn": 48,
73
+ "hands": [
74
+ [
75
+ 47,
76
+ 3,
77
+ 31,
78
+ 51,
79
+ 5,
80
+ 15,
81
+ 9,
82
+ 27,
83
+ 30,
84
+ 13
85
+ ],
86
+ [
87
+ 2,
88
+ 29,
89
+ 8,
90
+ 42,
91
+ 45,
92
+ 7,
93
+ 43,
94
+ 21,
95
+ 36,
96
+ 12
97
+ ]
98
+ ],
99
+ "hand_sizes": [
100
+ 10,
101
+ 10
102
+ ],
103
+ "discard_pile": [
104
+ 23
105
+ ],
106
+ "discard_size": 1,
107
+ "stock_remaining": 31,
108
+ "deadwood": [
109
+ 56,
110
+ 69
111
+ ],
112
+ "model_logits": null,
113
+ "model_value": null,
114
+ "model_checkpoint": "r42_800M.pkl",
115
+ "model_architecture": "simba_aux"
116
+ },
117
+ {
118
+ "turn": 2,
119
+ "player": 0,
120
+ "phase": "discard",
121
+ "action": 5,
122
+ "action_desc": "Human discarded K\u2663",
123
+ "card_drawn": null,
124
+ "hands": [
125
+ [
126
+ 47,
127
+ 3,
128
+ 31,
129
+ 51,
130
+ 5,
131
+ 15,
132
+ 9,
133
+ 27,
134
+ 30,
135
+ 13,
136
+ 48
137
+ ],
138
+ [
139
+ 2,
140
+ 29,
141
+ 8,
142
+ 42,
143
+ 45,
144
+ 7,
145
+ 43,
146
+ 21,
147
+ 36,
148
+ 12
149
+ ]
150
+ ],
151
+ "hand_sizes": [
152
+ 11,
153
+ 10
154
+ ],
155
+ "discard_pile": [
156
+ 23
157
+ ],
158
+ "discard_size": 1,
159
+ "stock_remaining": 30,
160
+ "deadwood": [
161
+ 66,
162
+ 69
163
+ ],
164
+ "model_logits": null,
165
+ "model_value": null,
166
+ "model_checkpoint": "r42_800M.pkl",
167
+ "model_architecture": "simba_aux"
168
+ },
169
+ {
170
+ "turn": 3,
171
+ "player": 1,
172
+ "phase": "draw",
173
+ "action": 0,
174
+ "action_desc": "Model drew from stock",
175
+ "card_drawn": 11,
176
+ "hands": [
177
+ [
178
+ 47,
179
+ 3,
180
+ 31,
181
+ 48,
182
+ 5,
183
+ 15,
184
+ 9,
185
+ 27,
186
+ 30,
187
+ 13
188
+ ],
189
+ [
190
+ 2,
191
+ 29,
192
+ 8,
193
+ 42,
194
+ 45,
195
+ 7,
196
+ 43,
197
+ 21,
198
+ 36,
199
+ 12
200
+ ]
201
+ ],
202
+ "hand_sizes": [
203
+ 10,
204
+ 10
205
+ ],
206
+ "discard_pile": [
207
+ 23,
208
+ 51
209
+ ],
210
+ "discard_size": 2,
211
+ "stock_remaining": 30,
212
+ "deadwood": [
213
+ 56,
214
+ 69
215
+ ],
216
+ "model_logits": [
217
+ 0.44646427035331726,
218
+ -0.44516491889953613,
219
+ 2.005288600921631,
220
+ -0.7088584303855896,
221
+ -2.6995675563812256,
222
+ -1.8335438966751099,
223
+ 1.838926076889038,
224
+ -4.08890962600708,
225
+ -0.399880051612854,
226
+ -1.8007315397262573,
227
+ 2.0132405757904053,
228
+ -0.1954866349697113,
229
+ 2.959669351577759,
230
+ -1.5346155166625977,
231
+ -0.060077957808971405,
232
+ 15.1032133102417
233
+ ],
234
+ "model_value": 0.19617769122123718,
235
+ "model_checkpoint": "r42_800M.pkl",
236
+ "model_architecture": "simba_aux"
237
+ },
238
+ {
239
+ "turn": 4,
240
+ "player": 1,
241
+ "phase": "discard",
242
+ "action": 6,
243
+ "action_desc": "Model discarded 7\u2663",
244
+ "card_drawn": null,
245
+ "hands": [
246
+ [
247
+ 47,
248
+ 3,
249
+ 31,
250
+ 48,
251
+ 5,
252
+ 15,
253
+ 9,
254
+ 27,
255
+ 30,
256
+ 13
257
+ ],
258
+ [
259
+ 2,
260
+ 29,
261
+ 8,
262
+ 42,
263
+ 45,
264
+ 7,
265
+ 43,
266
+ 21,
267
+ 36,
268
+ 12,
269
+ 11
270
+ ]
271
+ ],
272
+ "hand_sizes": [
273
+ 10,
274
+ 11
275
+ ],
276
+ "discard_pile": [
277
+ 23,
278
+ 51
279
+ ],
280
+ "discard_size": 2,
281
+ "stock_remaining": 29,
282
+ "deadwood": [
283
+ 56,
284
+ 79
285
+ ],
286
+ "model_logits": [
287
+ 0.22540131211280823,
288
+ -0.22753995656967163,
289
+ 2.407101631164551,
290
+ 0.6800909638404846,
291
+ -2.173002243041992,
292
+ -0.08242562413215637,
293
+ 2.898866891860962,
294
+ -1.2171683311462402,
295
+ 0.9923494458198547,
296
+ -1.960150122642517,
297
+ -0.17256639897823334,
298
+ -3.667513847351074,
299
+ -1.1051957607269287,
300
+ 2.7059309482574463,
301
+ -4.0940327644348145,
302
+ 13.52048397064209
303
+ ],
304
+ "model_value": 0.3150976300239563,
305
+ "model_checkpoint": "r42_800M.pkl",
306
+ "model_architecture": "simba_aux"
307
+ },
308
+ {
309
+ "turn": 5,
310
+ "player": 0,
311
+ "phase": "draw",
312
+ "action": 0,
313
+ "action_desc": "Human drew from stock",
314
+ "card_drawn": 17,
315
+ "hands": [
316
+ [
317
+ 47,
318
+ 3,
319
+ 31,
320
+ 48,
321
+ 5,
322
+ 15,
323
+ 9,
324
+ 27,
325
+ 30,
326
+ 13
327
+ ],
328
+ [
329
+ 2,
330
+ 29,
331
+ 8,
332
+ 42,
333
+ 11,
334
+ 7,
335
+ 43,
336
+ 21,
337
+ 36,
338
+ 12
339
+ ]
340
+ ],
341
+ "hand_sizes": [
342
+ 10,
343
+ 10
344
+ ],
345
+ "discard_pile": [
346
+ 23,
347
+ 51,
348
+ 45
349
+ ],
350
+ "discard_size": 3,
351
+ "stock_remaining": 29,
352
+ "deadwood": [
353
+ 56,
354
+ 72
355
+ ],
356
+ "model_logits": null,
357
+ "model_value": null,
358
+ "model_checkpoint": "r42_800M.pkl",
359
+ "model_architecture": "simba_aux"
360
+ },
361
+ {
362
+ "turn": 6,
363
+ "player": 0,
364
+ "phase": "discard",
365
+ "action": 8,
366
+ "action_desc": "Human discarded 10\u2660",
367
+ "card_drawn": null,
368
+ "hands": [
369
+ [
370
+ 47,
371
+ 3,
372
+ 31,
373
+ 48,
374
+ 5,
375
+ 15,
376
+ 9,
377
+ 27,
378
+ 30,
379
+ 13,
380
+ 17
381
+ ],
382
+ [
383
+ 2,
384
+ 29,
385
+ 8,
386
+ 42,
387
+ 11,
388
+ 7,
389
+ 43,
390
+ 21,
391
+ 36,
392
+ 12
393
+ ]
394
+ ],
395
+ "hand_sizes": [
396
+ 11,
397
+ 10
398
+ ],
399
+ "discard_pile": [
400
+ 23,
401
+ 51,
402
+ 45
403
+ ],
404
+ "discard_size": 3,
405
+ "stock_remaining": 28,
406
+ "deadwood": [
407
+ 61,
408
+ 72
409
+ ],
410
+ "model_logits": null,
411
+ "model_value": null,
412
+ "model_checkpoint": "r42_800M.pkl",
413
+ "model_architecture": "simba_aux"
414
+ },
415
+ {
416
+ "turn": 7,
417
+ "player": 1,
418
+ "phase": "draw",
419
+ "action": 1,
420
+ "action_desc": "Model drew 10\u2660 from discard",
421
+ "card_drawn": 9,
422
+ "hands": [
423
+ [
424
+ 47,
425
+ 3,
426
+ 31,
427
+ 48,
428
+ 5,
429
+ 15,
430
+ 17,
431
+ 27,
432
+ 30,
433
+ 13
434
+ ],
435
+ [
436
+ 2,
437
+ 29,
438
+ 8,
439
+ 42,
440
+ 11,
441
+ 7,
442
+ 43,
443
+ 21,
444
+ 36,
445
+ 12
446
+ ]
447
+ ],
448
+ "hand_sizes": [
449
+ 10,
450
+ 10
451
+ ],
452
+ "discard_pile": [
453
+ 23,
454
+ 51,
455
+ 45,
456
+ 9
457
+ ],
458
+ "discard_size": 4,
459
+ "stock_remaining": 28,
460
+ "deadwood": [
461
+ 51,
462
+ 72
463
+ ],
464
+ "model_logits": [
465
+ -12.26740837097168,
466
+ 12.26699447631836,
467
+ 1.8883110284805298,
468
+ -1.626328706741333,
469
+ -1.6464855670928955,
470
+ -2.814018726348877,
471
+ -1.3795942068099976,
472
+ -2.46687388420105,
473
+ -1.8701295852661133,
474
+ -0.34300312399864197,
475
+ 2.6106715202331543,
476
+ 2.051229476928711,
477
+ -3.2833993434906006,
478
+ -1.2803606986999512,
479
+ 0.7393525838851929,
480
+ 8.850516319274902
481
+ ],
482
+ "model_value": 0.16748379170894623,
483
+ "model_checkpoint": "r42_800M.pkl",
484
+ "model_architecture": "simba_aux"
485
+ },
486
+ {
487
+ "turn": 8,
488
+ "player": 1,
489
+ "phase": "discard",
490
+ "action": 10,
491
+ "action_desc": "Model discarded J\u2666",
492
+ "card_drawn": null,
493
+ "hands": [
494
+ [
495
+ 47,
496
+ 3,
497
+ 31,
498
+ 48,
499
+ 5,
500
+ 15,
501
+ 17,
502
+ 27,
503
+ 30,
504
+ 13
505
+ ],
506
+ [
507
+ 2,
508
+ 29,
509
+ 8,
510
+ 42,
511
+ 11,
512
+ 7,
513
+ 43,
514
+ 21,
515
+ 36,
516
+ 12,
517
+ 9
518
+ ]
519
+ ],
520
+ "hand_sizes": [
521
+ 10,
522
+ 11
523
+ ],
524
+ "discard_pile": [
525
+ 23,
526
+ 51,
527
+ 45
528
+ ],
529
+ "discard_size": 3,
530
+ "stock_remaining": 28,
531
+ "deadwood": [
532
+ 51,
533
+ 55
534
+ ],
535
+ "model_logits": [
536
+ -3.3303606510162354,
537
+ 3.322096824645996,
538
+ 3.56318998336792,
539
+ -0.1697646826505661,
540
+ -7.255303382873535,
541
+ 0.6792615056037903,
542
+ 1.5580886602401733,
543
+ -9.08933162689209,
544
+ 0.20594502985477448,
545
+ -1.4213473796844482,
546
+ 7.503976345062256,
547
+ 2.17415189743042,
548
+ -2.7449615001678467,
549
+ 0.8932405114173889,
550
+ -1.8213993310928345,
551
+ 10.873382568359375
552
+ ],
553
+ "model_value": 0.266734778881073,
554
+ "model_checkpoint": "r42_800M.pkl",
555
+ "model_architecture": "simba_aux"
556
+ },
557
+ {
558
+ "turn": 9,
559
+ "player": 0,
560
+ "phase": "draw",
561
+ "action": 0,
562
+ "action_desc": "Human drew from stock",
563
+ "card_drawn": 20,
564
+ "hands": [
565
+ [
566
+ 47,
567
+ 3,
568
+ 31,
569
+ 48,
570
+ 5,
571
+ 15,
572
+ 17,
573
+ 27,
574
+ 30,
575
+ 13
576
+ ],
577
+ [
578
+ 2,
579
+ 29,
580
+ 8,
581
+ 42,
582
+ 11,
583
+ 7,
584
+ 43,
585
+ 21,
586
+ 9,
587
+ 12
588
+ ]
589
+ ],
590
+ "hand_sizes": [
591
+ 10,
592
+ 10
593
+ ],
594
+ "discard_pile": [
595
+ 23,
596
+ 51,
597
+ 45,
598
+ 36
599
+ ],
600
+ "discard_size": 4,
601
+ "stock_remaining": 28,
602
+ "deadwood": [
603
+ 51,
604
+ 45
605
+ ],
606
+ "model_logits": null,
607
+ "model_value": null,
608
+ "model_checkpoint": "r42_800M.pkl",
609
+ "model_architecture": "simba_aux"
610
+ },
611
+ {
612
+ "turn": 10,
613
+ "player": 0,
614
+ "phase": "discard",
615
+ "action": 12,
616
+ "action_desc": "Human discarded 8\u2665",
617
+ "card_drawn": null,
618
+ "hands": [
619
+ [
620
+ 47,
621
+ 3,
622
+ 31,
623
+ 48,
624
+ 5,
625
+ 15,
626
+ 17,
627
+ 27,
628
+ 30,
629
+ 13,
630
+ 20
631
+ ],
632
+ [
633
+ 2,
634
+ 29,
635
+ 8,
636
+ 42,
637
+ 11,
638
+ 7,
639
+ 43,
640
+ 21,
641
+ 9,
642
+ 12
643
+ ]
644
+ ],
645
+ "hand_sizes": [
646
+ 11,
647
+ 10
648
+ ],
649
+ "discard_pile": [
650
+ 23,
651
+ 51,
652
+ 45,
653
+ 36
654
+ ],
655
+ "discard_size": 4,
656
+ "stock_remaining": 27,
657
+ "deadwood": [
658
+ 59,
659
+ 45
660
+ ],
661
+ "model_logits": null,
662
+ "model_value": null,
663
+ "model_checkpoint": "r42_800M.pkl",
664
+ "model_architecture": "simba_aux"
665
+ },
666
+ {
667
+ "turn": 11,
668
+ "player": 1,
669
+ "phase": "draw",
670
+ "action": 0,
671
+ "action_desc": "Model drew from stock",
672
+ "card_drawn": 10,
673
+ "hands": [
674
+ [
675
+ 47,
676
+ 3,
677
+ 31,
678
+ 48,
679
+ 5,
680
+ 15,
681
+ 17,
682
+ 27,
683
+ 30,
684
+ 13
685
+ ],
686
+ [
687
+ 2,
688
+ 29,
689
+ 8,
690
+ 42,
691
+ 11,
692
+ 7,
693
+ 43,
694
+ 21,
695
+ 9,
696
+ 12
697
+ ]
698
+ ],
699
+ "hand_sizes": [
700
+ 10,
701
+ 10
702
+ ],
703
+ "discard_pile": [
704
+ 23,
705
+ 51,
706
+ 45,
707
+ 36,
708
+ 20
709
+ ],
710
+ "discard_size": 5,
711
+ "stock_remaining": 27,
712
+ "deadwood": [
713
+ 51,
714
+ 45
715
+ ],
716
+ "model_logits": [
717
+ 1.8350306749343872,
718
+ -1.8555736541748047,
719
+ 0.7017938494682312,
720
+ 0.699691653251648,
721
+ -3.9626903533935547,
722
+ -1.1517935991287231,
723
+ 2.5774996280670166,
724
+ -6.230175495147705,
725
+ 1.0678907632827759,
726
+ 3.7306838035583496,
727
+ -5.8910980224609375,
728
+ 6.788896560668945,
729
+ 1.6548861265182495,
730
+ 3.7875125408172607,
731
+ -5.116235733032227,
732
+ 13.08728313446045
733
+ ],
734
+ "model_value": 0.13876299560070038,
735
+ "model_checkpoint": "r42_800M.pkl",
736
+ "model_architecture": "simba_aux"
737
+ },
738
+ {
739
+ "turn": 12,
740
+ "player": 1,
741
+ "phase": "discard",
742
+ "action": 9,
743
+ "action_desc": "Model discarded 9\u2665",
744
+ "card_drawn": null,
745
+ "hands": [
746
+ [
747
+ 47,
748
+ 3,
749
+ 31,
750
+ 48,
751
+ 5,
752
+ 15,
753
+ 17,
754
+ 27,
755
+ 30,
756
+ 13
757
+ ],
758
+ [
759
+ 2,
760
+ 29,
761
+ 8,
762
+ 42,
763
+ 11,
764
+ 7,
765
+ 43,
766
+ 21,
767
+ 9,
768
+ 12,
769
+ 10
770
+ ]
771
+ ],
772
+ "hand_sizes": [
773
+ 10,
774
+ 11
775
+ ],
776
+ "discard_pile": [
777
+ 23,
778
+ 51,
779
+ 45,
780
+ 36,
781
+ 20
782
+ ],
783
+ "discard_size": 5,
784
+ "stock_remaining": 26,
785
+ "deadwood": [
786
+ 51,
787
+ 25
788
+ ],
789
+ "model_logits": [
790
+ -2.2246735095977783,
791
+ 2.20810604095459,
792
+ 4.102224349975586,
793
+ 3.0280911922454834,
794
+ -7.31381368637085,
795
+ 2.561394214630127,
796
+ -3.572284460067749,
797
+ -0.1446799784898758,
798
+ 5.87237548828125,
799
+ 9.524531364440918,
800
+ -11.957315444946289,
801
+ 3.309128522872925,
802
+ -8.70164966583252,
803
+ 3.2824041843414307,
804
+ -3.70316219329834,
805
+ 5.874724388122559
806
+ ],
807
+ "model_value": 0.303559273481369,
808
+ "model_checkpoint": "r42_800M.pkl",
809
+ "model_architecture": "simba_aux"
810
+ },
811
+ {
812
+ "turn": 13,
813
+ "player": 0,
814
+ "phase": "draw",
815
+ "action": 0,
816
+ "action_desc": "Human drew from stock",
817
+ "card_drawn": 6,
818
+ "hands": [
819
+ [
820
+ 47,
821
+ 3,
822
+ 31,
823
+ 48,
824
+ 5,
825
+ 15,
826
+ 17,
827
+ 27,
828
+ 30,
829
+ 13
830
+ ],
831
+ [
832
+ 2,
833
+ 29,
834
+ 8,
835
+ 42,
836
+ 11,
837
+ 7,
838
+ 43,
839
+ 10,
840
+ 9,
841
+ 12
842
+ ]
843
+ ],
844
+ "hand_sizes": [
845
+ 10,
846
+ 10
847
+ ],
848
+ "discard_pile": [
849
+ 23,
850
+ 51,
851
+ 45,
852
+ 36,
853
+ 20,
854
+ 21
855
+ ],
856
+ "discard_size": 6,
857
+ "stock_remaining": 26,
858
+ "deadwood": [
859
+ 51,
860
+ 16
861
+ ],
862
+ "model_logits": null,
863
+ "model_value": null,
864
+ "model_checkpoint": "r42_800M.pkl",
865
+ "model_architecture": "simba_aux"
866
+ },
867
+ {
868
+ "turn": 14,
869
+ "player": 0,
870
+ "phase": "discard",
871
+ "action": 5,
872
+ "action_desc": "Human discarded 10\u2663",
873
+ "card_drawn": null,
874
+ "hands": [
875
+ [
876
+ 47,
877
+ 3,
878
+ 31,
879
+ 48,
880
+ 5,
881
+ 15,
882
+ 17,
883
+ 27,
884
+ 30,
885
+ 13,
886
+ 6
887
+ ],
888
+ [
889
+ 2,
890
+ 29,
891
+ 8,
892
+ 42,
893
+ 11,
894
+ 7,
895
+ 43,
896
+ 10,
897
+ 9,
898
+ 12
899
+ ]
900
+ ],
901
+ "hand_sizes": [
902
+ 11,
903
+ 10
904
+ ],
905
+ "discard_pile": [
906
+ 23,
907
+ 51,
908
+ 45,
909
+ 36,
910
+ 20,
911
+ 21
912
+ ],
913
+ "discard_size": 6,
914
+ "stock_remaining": 25,
915
+ "deadwood": [
916
+ 58,
917
+ 16
918
+ ],
919
+ "model_logits": null,
920
+ "model_value": null,
921
+ "model_checkpoint": "r42_800M.pkl",
922
+ "model_architecture": "simba_aux"
923
+ },
924
+ {
925
+ "turn": 15,
926
+ "player": 1,
927
+ "phase": "draw",
928
+ "action": 0,
929
+ "action_desc": "Model drew from stock",
930
+ "card_drawn": 14,
931
+ "hands": [
932
+ [
933
+ 47,
934
+ 3,
935
+ 31,
936
+ 6,
937
+ 5,
938
+ 15,
939
+ 17,
940
+ 27,
941
+ 30,
942
+ 13
943
+ ],
944
+ [
945
+ 2,
946
+ 29,
947
+ 8,
948
+ 42,
949
+ 11,
950
+ 7,
951
+ 43,
952
+ 10,
953
+ 9,
954
+ 12
955
+ ]
956
+ ],
957
+ "hand_sizes": [
958
+ 10,
959
+ 10
960
+ ],
961
+ "discard_pile": [
962
+ 23,
963
+ 51,
964
+ 45,
965
+ 36,
966
+ 20,
967
+ 21,
968
+ 48
969
+ ],
970
+ "discard_size": 7,
971
+ "stock_remaining": 25,
972
+ "deadwood": [
973
+ 48,
974
+ 16
975
+ ],
976
+ "model_logits": [
977
+ 5.166488170623779,
978
+ -5.19210958480835,
979
+ 4.5680413246154785,
980
+ 3.404137134552002,
981
+ 0.34513992071151733,
982
+ 4.458003997802734,
983
+ -4.01824951171875,
984
+ 0.876509964466095,
985
+ 4.626875877380371,
986
+ -7.867835521697998,
987
+ -6.73263692855835,
988
+ 2.9618117809295654,
989
+ -0.6598952412605286,
990
+ 0.059613440185785294,
991
+ -1.489869236946106,
992
+ 12.89390754699707
993
+ ],
994
+ "model_value": 0.37604159116744995,
995
+ "model_checkpoint": "r42_800M.pkl",
996
+ "model_architecture": "simba_aux"
997
+ },
998
+ {
999
+ "turn": 16,
1000
+ "player": 1,
1001
+ "phase": "discard",
1002
+ "action": 5,
1003
+ "action_desc": "Model discarded 4\u2663",
1004
+ "card_drawn": null,
1005
+ "hands": [
1006
+ [
1007
+ 47,
1008
+ 3,
1009
+ 31,
1010
+ 6,
1011
+ 5,
1012
+ 15,
1013
+ 17,
1014
+ 27,
1015
+ 30,
1016
+ 13
1017
+ ],
1018
+ [
1019
+ 2,
1020
+ 29,
1021
+ 8,
1022
+ 42,
1023
+ 11,
1024
+ 7,
1025
+ 43,
1026
+ 10,
1027
+ 9,
1028
+ 12,
1029
+ 14
1030
+ ]
1031
+ ],
1032
+ "hand_sizes": [
1033
+ 10,
1034
+ 11
1035
+ ],
1036
+ "discard_pile": [
1037
+ 23,
1038
+ 51,
1039
+ 45,
1040
+ 36,
1041
+ 20,
1042
+ 21,
1043
+ 48
1044
+ ],
1045
+ "discard_size": 7,
1046
+ "stock_remaining": 24,
1047
+ "deadwood": [
1048
+ 48,
1049
+ 18
1050
+ ],
1051
+ "model_logits": [
1052
+ 2.1318228244781494,
1053
+ -2.164736032485962,
1054
+ 5.450942516326904,
1055
+ 3.822159767150879,
1056
+ -2.391251802444458,
1057
+ 7.0785088539123535,
1058
+ -5.183215618133545,
1059
+ 2.5668752193450928,
1060
+ 7.5528669357299805,
1061
+ -12.599595069885254,
1062
+ -11.118485450744629,
1063
+ 4.190390586853027,
1064
+ 0.32790911197662354,
1065
+ -2.6046624183654785,
1066
+ 1.5989207029342651,
1067
+ 9.116048812866211
1068
+ ],
1069
+ "model_value": 0.45257148146629333,
1070
+ "model_checkpoint": "r42_800M.pkl",
1071
+ "model_architecture": "simba_aux"
1072
+ },
1073
+ {
1074
+ "turn": 17,
1075
+ "player": 0,
1076
+ "phase": "draw",
1077
+ "action": 0,
1078
+ "action_desc": "Human drew from stock",
1079
+ "card_drawn": 16,
1080
+ "hands": [
1081
+ [
1082
+ 47,
1083
+ 3,
1084
+ 31,
1085
+ 6,
1086
+ 5,
1087
+ 15,
1088
+ 17,
1089
+ 27,
1090
+ 30,
1091
+ 13
1092
+ ],
1093
+ [
1094
+ 2,
1095
+ 29,
1096
+ 8,
1097
+ 14,
1098
+ 11,
1099
+ 7,
1100
+ 43,
1101
+ 10,
1102
+ 9,
1103
+ 12
1104
+ ]
1105
+ ],
1106
+ "hand_sizes": [
1107
+ 10,
1108
+ 10
1109
+ ],
1110
+ "discard_pile": [
1111
+ 23,
1112
+ 51,
1113
+ 45,
1114
+ 36,
1115
+ 20,
1116
+ 21,
1117
+ 48,
1118
+ 42
1119
+ ],
1120
+ "discard_size": 8,
1121
+ "stock_remaining": 24,
1122
+ "deadwood": [
1123
+ 48,
1124
+ 14
1125
+ ],
1126
+ "model_logits": null,
1127
+ "model_value": null,
1128
+ "model_checkpoint": "r42_800M.pkl",
1129
+ "model_architecture": "simba_aux"
1130
+ },
1131
+ {
1132
+ "turn": 18,
1133
+ "player": 0,
1134
+ "phase": "discard",
1135
+ "action": 2,
1136
+ "action_desc": "Human discarded 9\u2663",
1137
+ "card_drawn": null,
1138
+ "hands": [
1139
+ [
1140
+ 47,
1141
+ 3,
1142
+ 31,
1143
+ 6,
1144
+ 5,
1145
+ 15,
1146
+ 17,
1147
+ 27,
1148
+ 30,
1149
+ 13,
1150
+ 16
1151
+ ],
1152
+ [
1153
+ 2,
1154
+ 29,
1155
+ 8,
1156
+ 14,
1157
+ 11,
1158
+ 7,
1159
+ 43,
1160
+ 10,
1161
+ 9,
1162
+ 12
1163
+ ]
1164
+ ],
1165
+ "hand_sizes": [
1166
+ 11,
1167
+ 10
1168
+ ],
1169
+ "discard_pile": [
1170
+ 23,
1171
+ 51,
1172
+ 45,
1173
+ 36,
1174
+ 20,
1175
+ 21,
1176
+ 48,
1177
+ 42
1178
+ ],
1179
+ "discard_size": 8,
1180
+ "stock_remaining": 23,
1181
+ "deadwood": [
1182
+ 40,
1183
+ 14
1184
+ ],
1185
+ "model_logits": null,
1186
+ "model_value": null,
1187
+ "model_checkpoint": "r42_800M.pkl",
1188
+ "model_architecture": "simba_aux"
1189
+ },
1190
+ {
1191
+ "turn": 19,
1192
+ "player": 1,
1193
+ "phase": "draw",
1194
+ "action": 0,
1195
+ "action_desc": "Model drew from stock",
1196
+ "card_drawn": 4,
1197
+ "hands": [
1198
+ [
1199
+ 16,
1200
+ 3,
1201
+ 31,
1202
+ 6,
1203
+ 5,
1204
+ 15,
1205
+ 17,
1206
+ 27,
1207
+ 30,
1208
+ 13
1209
+ ],
1210
+ [
1211
+ 2,
1212
+ 29,
1213
+ 8,
1214
+ 14,
1215
+ 11,
1216
+ 7,
1217
+ 43,
1218
+ 10,
1219
+ 9,
1220
+ 12
1221
+ ]
1222
+ ],
1223
+ "hand_sizes": [
1224
+ 10,
1225
+ 10
1226
+ ],
1227
+ "discard_pile": [
1228
+ 23,
1229
+ 51,
1230
+ 45,
1231
+ 36,
1232
+ 20,
1233
+ 21,
1234
+ 48,
1235
+ 42,
1236
+ 47
1237
+ ],
1238
+ "discard_size": 9,
1239
+ "stock_remaining": 23,
1240
+ "deadwood": [
1241
+ 31,
1242
+ 14
1243
+ ],
1244
+ "model_logits": [
1245
+ 4.420629978179932,
1246
+ -4.447947025299072,
1247
+ 3.50484037399292,
1248
+ 5.919126987457275,
1249
+ -1.6009573936462402,
1250
+ 2.61783504486084,
1251
+ -3.265421152114868,
1252
+ 1.7501583099365234,
1253
+ 6.085115432739258,
1254
+ -8.871217727661133,
1255
+ -7.1262030601501465,
1256
+ 3.755293130874634,
1257
+ -1.0539367198944092,
1258
+ -1.6321419477462769,
1259
+ 0.30441853404045105,
1260
+ 12.216235160827637
1261
+ ],
1262
+ "model_value": 0.4005470275878906,
1263
+ "model_checkpoint": "r42_800M.pkl",
1264
+ "model_architecture": "simba_aux"
1265
+ },
1266
+ {
1267
+ "turn": 20,
1268
+ "player": 1,
1269
+ "phase": "discard",
1270
+ "action": 8,
1271
+ "action_desc": "Model discarded 5\u2663",
1272
+ "card_drawn": null,
1273
+ "hands": [
1274
+ [
1275
+ 16,
1276
+ 3,
1277
+ 31,
1278
+ 6,
1279
+ 5,
1280
+ 15,
1281
+ 17,
1282
+ 27,
1283
+ 30,
1284
+ 13
1285
+ ],
1286
+ [
1287
+ 2,
1288
+ 29,
1289
+ 8,
1290
+ 14,
1291
+ 11,
1292
+ 7,
1293
+ 43,
1294
+ 10,
1295
+ 9,
1296
+ 12,
1297
+ 4
1298
+ ]
1299
+ ],
1300
+ "hand_sizes": [
1301
+ 10,
1302
+ 11
1303
+ ],
1304
+ "discard_pile": [
1305
+ 23,
1306
+ 51,
1307
+ 45,
1308
+ 36,
1309
+ 20,
1310
+ 21,
1311
+ 48,
1312
+ 42,
1313
+ 47
1314
+ ],
1315
+ "discard_size": 9,
1316
+ "stock_remaining": 22,
1317
+ "deadwood": [
1318
+ 31,
1319
+ 19
1320
+ ],
1321
+ "model_logits": [
1322
+ -1.8081129789352417,
1323
+ 1.7736397981643677,
1324
+ 2.2659590244293213,
1325
+ 6.230957508087158,
1326
+ -4.518009662628174,
1327
+ 2.9621570110321045,
1328
+ -4.923310279846191,
1329
+ 2.28125262260437,
1330
+ 6.971704483032227,
1331
+ -13.012149810791016,
1332
+ -9.59646987915039,
1333
+ 4.096042633056641,
1334
+ 4.966683387756348,
1335
+ -1.6995298862457275,
1336
+ 0.7542835474014282,
1337
+ 9.218894004821777
1338
+ ],
1339
+ "model_value": 0.5602080821990967,
1340
+ "model_checkpoint": "r42_800M.pkl",
1341
+ "model_architecture": "simba_aux"
1342
+ },
1343
+ {
1344
+ "turn": 21,
1345
+ "player": 0,
1346
+ "phase": "draw",
1347
+ "action": 0,
1348
+ "action_desc": "Human drew from stock",
1349
+ "card_drawn": 19,
1350
+ "hands": [
1351
+ [
1352
+ 16,
1353
+ 3,
1354
+ 31,
1355
+ 6,
1356
+ 5,
1357
+ 15,
1358
+ 17,
1359
+ 27,
1360
+ 30,
1361
+ 13
1362
+ ],
1363
+ [
1364
+ 2,
1365
+ 29,
1366
+ 8,
1367
+ 14,
1368
+ 11,
1369
+ 7,
1370
+ 4,
1371
+ 10,
1372
+ 9,
1373
+ 12
1374
+ ]
1375
+ ],
1376
+ "hand_sizes": [
1377
+ 10,
1378
+ 10
1379
+ ],
1380
+ "discard_pile": [
1381
+ 23,
1382
+ 51,
1383
+ 45,
1384
+ 36,
1385
+ 20,
1386
+ 21,
1387
+ 48,
1388
+ 42,
1389
+ 47,
1390
+ 43
1391
+ ],
1392
+ "discard_size": 10,
1393
+ "stock_remaining": 22,
1394
+ "deadwood": [
1395
+ 31,
1396
+ 14
1397
+ ],
1398
+ "model_logits": null,
1399
+ "model_value": null,
1400
+ "model_checkpoint": "r42_800M.pkl",
1401
+ "model_architecture": "simba_aux"
1402
+ },
1403
+ {
1404
+ "turn": 22,
1405
+ "player": 0,
1406
+ "phase": "discard",
1407
+ "action": 4,
1408
+ "action_desc": "Human discarded 6\u2666",
1409
+ "card_drawn": null,
1410
+ "hands": [
1411
+ [
1412
+ 16,
1413
+ 3,
1414
+ 31,
1415
+ 6,
1416
+ 5,
1417
+ 15,
1418
+ 17,
1419
+ 27,
1420
+ 30,
1421
+ 13,
1422
+ 19
1423
+ ],
1424
+ [
1425
+ 2,
1426
+ 29,
1427
+ 8,
1428
+ 14,
1429
+ 11,
1430
+ 7,
1431
+ 4,
1432
+ 10,
1433
+ 9,
1434
+ 12
1435
+ ]
1436
+ ],
1437
+ "hand_sizes": [
1438
+ 11,
1439
+ 10
1440
+ ],
1441
+ "discard_pile": [
1442
+ 23,
1443
+ 51,
1444
+ 45,
1445
+ 36,
1446
+ 20,
1447
+ 21,
1448
+ 48,
1449
+ 42,
1450
+ 47,
1451
+ 43
1452
+ ],
1453
+ "discard_size": 10,
1454
+ "stock_remaining": 21,
1455
+ "deadwood": [
1456
+ 38,
1457
+ 14
1458
+ ],
1459
+ "model_logits": null,
1460
+ "model_value": null,
1461
+ "model_checkpoint": "r42_800M.pkl",
1462
+ "model_architecture": "simba_aux"
1463
+ },
1464
+ {
1465
+ "turn": 23,
1466
+ "player": 1,
1467
+ "phase": "draw",
1468
+ "action": 0,
1469
+ "action_desc": "Model drew from stock",
1470
+ "card_drawn": 35,
1471
+ "hands": [
1472
+ [
1473
+ 16,
1474
+ 3,
1475
+ 19,
1476
+ 6,
1477
+ 5,
1478
+ 15,
1479
+ 17,
1480
+ 27,
1481
+ 30,
1482
+ 13
1483
+ ],
1484
+ [
1485
+ 2,
1486
+ 29,
1487
+ 8,
1488
+ 14,
1489
+ 11,
1490
+ 7,
1491
+ 4,
1492
+ 10,
1493
+ 9,
1494
+ 12
1495
+ ]
1496
+ ],
1497
+ "hand_sizes": [
1498
+ 10,
1499
+ 10
1500
+ ],
1501
+ "discard_pile": [
1502
+ 23,
1503
+ 51,
1504
+ 45,
1505
+ 36,
1506
+ 20,
1507
+ 21,
1508
+ 48,
1509
+ 42,
1510
+ 47,
1511
+ 43,
1512
+ 31
1513
+ ],
1514
+ "discard_size": 11,
1515
+ "stock_remaining": 21,
1516
+ "deadwood": [
1517
+ 32,
1518
+ 14
1519
+ ],
1520
+ "model_logits": [
1521
+ 3.981086015701294,
1522
+ -4.012567043304443,
1523
+ 1.4224706888198853,
1524
+ 5.855846881866455,
1525
+ -0.14530238509178162,
1526
+ 3.299467086791992,
1527
+ -4.2381696701049805,
1528
+ 2.2106308937072754,
1529
+ 8.822104454040527,
1530
+ -11.091350555419922,
1531
+ -7.793320178985596,
1532
+ 4.890064716339111,
1533
+ -1.1544140577316284,
1534
+ -1.613085150718689,
1535
+ 0.491565078496933,
1536
+ 10.63914680480957
1537
+ ],
1538
+ "model_value": 0.2749723792076111,
1539
+ "model_checkpoint": "r42_800M.pkl",
1540
+ "model_architecture": "simba_aux"
1541
+ },
1542
+ {
1543
+ "turn": 24,
1544
+ "player": 1,
1545
+ "phase": "discard",
1546
+ "action": 12,
1547
+ "action_desc": "Model discarded 10\u2666",
1548
+ "card_drawn": null,
1549
+ "hands": [
1550
+ [
1551
+ 16,
1552
+ 3,
1553
+ 19,
1554
+ 6,
1555
+ 5,
1556
+ 15,
1557
+ 17,
1558
+ 27,
1559
+ 30,
1560
+ 13
1561
+ ],
1562
+ [
1563
+ 2,
1564
+ 29,
1565
+ 8,
1566
+ 14,
1567
+ 11,
1568
+ 7,
1569
+ 4,
1570
+ 10,
1571
+ 9,
1572
+ 12,
1573
+ 35
1574
+ ]
1575
+ ],
1576
+ "hand_sizes": [
1577
+ 10,
1578
+ 11
1579
+ ],
1580
+ "discard_pile": [
1581
+ 23,
1582
+ 51,
1583
+ 45,
1584
+ 36,
1585
+ 20,
1586
+ 21,
1587
+ 48,
1588
+ 42,
1589
+ 47,
1590
+ 43,
1591
+ 31
1592
+ ],
1593
+ "discard_size": 11,
1594
+ "stock_remaining": 20,
1595
+ "deadwood": [
1596
+ 32,
1597
+ 24
1598
+ ],
1599
+ "model_logits": [
1600
+ 3.9433114528656006,
1601
+ -3.976815700531006,
1602
+ 0.3868432343006134,
1603
+ 3.749077796936035,
1604
+ -1.3430635929107666,
1605
+ 2.311318874359131,
1606
+ -5.866939067840576,
1607
+ 1.3649365901947021,
1608
+ 4.520620822906494,
1609
+ -10.834098815917969,
1610
+ -9.302236557006836,
1611
+ 2.196981191635132,
1612
+ 13.273894309997559,
1613
+ 3.1547605991363525,
1614
+ -4.482095718383789,
1615
+ 10.480803489685059
1616
+ ],
1617
+ "model_value": 0.33236491680145264,
1618
+ "model_checkpoint": "r42_800M.pkl",
1619
+ "model_architecture": "simba_aux"
1620
+ },
1621
+ {
1622
+ "turn": 25,
1623
+ "player": 0,
1624
+ "phase": "draw",
1625
+ "action": 0,
1626
+ "action_desc": "Human drew from stock",
1627
+ "card_drawn": 37,
1628
+ "hands": [
1629
+ [
1630
+ 16,
1631
+ 3,
1632
+ 19,
1633
+ 6,
1634
+ 5,
1635
+ 15,
1636
+ 17,
1637
+ 27,
1638
+ 30,
1639
+ 13
1640
+ ],
1641
+ [
1642
+ 2,
1643
+ 29,
1644
+ 8,
1645
+ 14,
1646
+ 11,
1647
+ 7,
1648
+ 4,
1649
+ 10,
1650
+ 9,
1651
+ 12
1652
+ ]
1653
+ ],
1654
+ "hand_sizes": [
1655
+ 10,
1656
+ 10
1657
+ ],
1658
+ "discard_pile": [
1659
+ 23,
1660
+ 51,
1661
+ 45,
1662
+ 36,
1663
+ 20,
1664
+ 21,
1665
+ 48,
1666
+ 42,
1667
+ 47,
1668
+ 43,
1669
+ 31,
1670
+ 35
1671
+ ],
1672
+ "discard_size": 12,
1673
+ "stock_remaining": 20,
1674
+ "deadwood": [
1675
+ 32,
1676
+ 14
1677
+ ],
1678
+ "model_logits": null,
1679
+ "model_value": null,
1680
+ "model_checkpoint": "r42_800M.pkl",
1681
+ "model_architecture": "simba_aux"
1682
+ },
1683
+ {
1684
+ "turn": 26,
1685
+ "player": 0,
1686
+ "phase": "discard",
1687
+ "action": 12,
1688
+ "action_desc": "Human discarded Q\u2666",
1689
+ "card_drawn": null,
1690
+ "hands": [
1691
+ [
1692
+ 16,
1693
+ 3,
1694
+ 19,
1695
+ 6,
1696
+ 5,
1697
+ 15,
1698
+ 17,
1699
+ 27,
1700
+ 30,
1701
+ 13,
1702
+ 37
1703
+ ],
1704
+ [
1705
+ 2,
1706
+ 29,
1707
+ 8,
1708
+ 14,
1709
+ 11,
1710
+ 7,
1711
+ 4,
1712
+ 10,
1713
+ 9,
1714
+ 12
1715
+ ]
1716
+ ],
1717
+ "hand_sizes": [
1718
+ 11,
1719
+ 10
1720
+ ],
1721
+ "discard_pile": [
1722
+ 23,
1723
+ 51,
1724
+ 45,
1725
+ 36,
1726
+ 20,
1727
+ 21,
1728
+ 48,
1729
+ 42,
1730
+ 47,
1731
+ 43,
1732
+ 31,
1733
+ 35
1734
+ ],
1735
+ "discard_size": 12,
1736
+ "stock_remaining": 19,
1737
+ "deadwood": [
1738
+ 42,
1739
+ 14
1740
+ ],
1741
+ "model_logits": null,
1742
+ "model_value": null,
1743
+ "model_checkpoint": "r42_800M.pkl",
1744
+ "model_architecture": "simba_aux"
1745
+ },
1746
+ {
1747
+ "turn": 27,
1748
+ "player": 1,
1749
+ "phase": "draw",
1750
+ "action": 0,
1751
+ "action_desc": "Model drew from stock",
1752
+ "card_drawn": 44,
1753
+ "hands": [
1754
+ [
1755
+ 16,
1756
+ 3,
1757
+ 19,
1758
+ 6,
1759
+ 5,
1760
+ 15,
1761
+ 17,
1762
+ 27,
1763
+ 30,
1764
+ 13
1765
+ ],
1766
+ [
1767
+ 2,
1768
+ 29,
1769
+ 8,
1770
+ 14,
1771
+ 11,
1772
+ 7,
1773
+ 4,
1774
+ 10,
1775
+ 9,
1776
+ 12
1777
+ ]
1778
+ ],
1779
+ "hand_sizes": [
1780
+ 10,
1781
+ 10
1782
+ ],
1783
+ "discard_pile": [
1784
+ 23,
1785
+ 51,
1786
+ 45,
1787
+ 36,
1788
+ 20,
1789
+ 21,
1790
+ 48,
1791
+ 42,
1792
+ 47,
1793
+ 43,
1794
+ 31,
1795
+ 35,
1796
+ 37
1797
+ ],
1798
+ "discard_size": 13,
1799
+ "stock_remaining": 19,
1800
+ "deadwood": [
1801
+ 32,
1802
+ 14
1803
+ ],
1804
+ "model_logits": [
1805
+ 4.104899883270264,
1806
+ -4.123440742492676,
1807
+ 1.151823878288269,
1808
+ 6.2424845695495605,
1809
+ -0.7818430662155151,
1810
+ 1.3664721250534058,
1811
+ -3.36592698097229,
1812
+ 1.64742112159729,
1813
+ 8.030861854553223,
1814
+ -9.514984130859375,
1815
+ -7.078109264373779,
1816
+ 4.604839324951172,
1817
+ -0.653775155544281,
1818
+ -0.8946107029914856,
1819
+ -0.35482338070869446,
1820
+ 11.643874168395996
1821
+ ],
1822
+ "model_value": 0.33271682262420654,
1823
+ "model_checkpoint": "r42_800M.pkl",
1824
+ "model_architecture": "simba_aux"
1825
+ },
1826
+ {
1827
+ "turn": 28,
1828
+ "player": 1,
1829
+ "phase": "discard",
1830
+ "action": 12,
1831
+ "action_desc": "Model discarded 6\u2663",
1832
+ "card_drawn": null,
1833
+ "hands": [
1834
+ [
1835
+ 16,
1836
+ 3,
1837
+ 19,
1838
+ 6,
1839
+ 5,
1840
+ 15,
1841
+ 17,
1842
+ 27,
1843
+ 30,
1844
+ 13
1845
+ ],
1846
+ [
1847
+ 2,
1848
+ 29,
1849
+ 8,
1850
+ 14,
1851
+ 11,
1852
+ 7,
1853
+ 4,
1854
+ 10,
1855
+ 9,
1856
+ 12,
1857
+ 44
1858
+ ]
1859
+ ],
1860
+ "hand_sizes": [
1861
+ 10,
1862
+ 11
1863
+ ],
1864
+ "discard_pile": [
1865
+ 23,
1866
+ 51,
1867
+ 45,
1868
+ 36,
1869
+ 20,
1870
+ 21,
1871
+ 48,
1872
+ 42,
1873
+ 47,
1874
+ 43,
1875
+ 31,
1876
+ 35,
1877
+ 37
1878
+ ],
1879
+ "discard_size": 13,
1880
+ "stock_remaining": 18,
1881
+ "deadwood": [
1882
+ 32,
1883
+ 20
1884
+ ],
1885
+ "model_logits": [
1886
+ 3.6289353370666504,
1887
+ -3.651562452316284,
1888
+ 0.8955609798431396,
1889
+ 5.640666484832764,
1890
+ -1.6406002044677734,
1891
+ 1.7489086389541626,
1892
+ -4.630945205688477,
1893
+ 1.221543788909912,
1894
+ 5.193866729736328,
1895
+ -12.430208206176758,
1896
+ -8.844193458557129,
1897
+ 3.4088165760040283,
1898
+ 9.342318534851074,
1899
+ 2.644631862640381,
1900
+ -3.967137098312378,
1901
+ 10.869965553283691
1902
+ ],
1903
+ "model_value": 0.4112440049648285,
1904
+ "model_checkpoint": "r42_800M.pkl",
1905
+ "model_architecture": "simba_aux"
1906
+ },
1907
+ {
1908
+ "turn": 29,
1909
+ "player": 0,
1910
+ "phase": "draw",
1911
+ "action": 0,
1912
+ "action_desc": "Human drew from stock",
1913
+ "card_drawn": 50,
1914
+ "hands": [
1915
+ [
1916
+ 16,
1917
+ 3,
1918
+ 19,
1919
+ 6,
1920
+ 5,
1921
+ 15,
1922
+ 17,
1923
+ 27,
1924
+ 30,
1925
+ 13
1926
+ ],
1927
+ [
1928
+ 2,
1929
+ 29,
1930
+ 8,
1931
+ 14,
1932
+ 11,
1933
+ 7,
1934
+ 4,
1935
+ 10,
1936
+ 9,
1937
+ 12
1938
+ ]
1939
+ ],
1940
+ "hand_sizes": [
1941
+ 10,
1942
+ 10
1943
+ ],
1944
+ "discard_pile": [
1945
+ 23,
1946
+ 51,
1947
+ 45,
1948
+ 36,
1949
+ 20,
1950
+ 21,
1951
+ 48,
1952
+ 42,
1953
+ 47,
1954
+ 43,
1955
+ 31,
1956
+ 35,
1957
+ 37,
1958
+ 44
1959
+ ],
1960
+ "discard_size": 14,
1961
+ "stock_remaining": 18,
1962
+ "deadwood": [
1963
+ 32,
1964
+ 14
1965
+ ],
1966
+ "model_logits": null,
1967
+ "model_value": null,
1968
+ "model_checkpoint": "r42_800M.pkl",
1969
+ "model_architecture": "simba_aux"
1970
+ },
1971
+ {
1972
+ "turn": 30,
1973
+ "player": 0,
1974
+ "phase": "discard",
1975
+ "action": 12,
1976
+ "action_desc": "Human discarded Q\u2663",
1977
+ "card_drawn": null,
1978
+ "hands": [
1979
+ [
1980
+ 16,
1981
+ 3,
1982
+ 19,
1983
+ 6,
1984
+ 5,
1985
+ 15,
1986
+ 17,
1987
+ 27,
1988
+ 30,
1989
+ 13,
1990
+ 50
1991
+ ],
1992
+ [
1993
+ 2,
1994
+ 29,
1995
+ 8,
1996
+ 14,
1997
+ 11,
1998
+ 7,
1999
+ 4,
2000
+ 10,
2001
+ 9,
2002
+ 12
2003
+ ]
2004
+ ],
2005
+ "hand_sizes": [
2006
+ 11,
2007
+ 10
2008
+ ],
2009
+ "discard_pile": [
2010
+ 23,
2011
+ 51,
2012
+ 45,
2013
+ 36,
2014
+ 20,
2015
+ 21,
2016
+ 48,
2017
+ 42,
2018
+ 47,
2019
+ 43,
2020
+ 31,
2021
+ 35,
2022
+ 37,
2023
+ 44
2024
+ ],
2025
+ "discard_size": 14,
2026
+ "stock_remaining": 17,
2027
+ "deadwood": [
2028
+ 42,
2029
+ 14
2030
+ ],
2031
+ "model_logits": null,
2032
+ "model_value": null,
2033
+ "model_checkpoint": "r42_800M.pkl",
2034
+ "model_architecture": "simba_aux"
2035
+ },
2036
+ {
2037
+ "turn": 31,
2038
+ "player": 1,
2039
+ "phase": "draw",
2040
+ "action": 0,
2041
+ "action_desc": "Model drew from stock",
2042
+ "card_drawn": 26,
2043
+ "hands": [
2044
+ [
2045
+ 16,
2046
+ 3,
2047
+ 19,
2048
+ 6,
2049
+ 5,
2050
+ 15,
2051
+ 17,
2052
+ 27,
2053
+ 30,
2054
+ 13
2055
+ ],
2056
+ [
2057
+ 2,
2058
+ 29,
2059
+ 8,
2060
+ 14,
2061
+ 11,
2062
+ 7,
2063
+ 4,
2064
+ 10,
2065
+ 9,
2066
+ 12
2067
+ ]
2068
+ ],
2069
+ "hand_sizes": [
2070
+ 10,
2071
+ 10
2072
+ ],
2073
+ "discard_pile": [
2074
+ 23,
2075
+ 51,
2076
+ 45,
2077
+ 36,
2078
+ 20,
2079
+ 21,
2080
+ 48,
2081
+ 42,
2082
+ 47,
2083
+ 43,
2084
+ 31,
2085
+ 35,
2086
+ 37,
2087
+ 44,
2088
+ 50
2089
+ ],
2090
+ "discard_size": 15,
2091
+ "stock_remaining": 17,
2092
+ "deadwood": [
2093
+ 32,
2094
+ 14
2095
+ ],
2096
+ "model_logits": [
2097
+ 4.819906711578369,
2098
+ -4.840890407562256,
2099
+ 1.950372576713562,
2100
+ 4.808573246002197,
2101
+ -1.167507290840149,
2102
+ 3.1777474880218506,
2103
+ -2.1616132259368896,
2104
+ 1.547547459602356,
2105
+ 7.936022758483887,
2106
+ -8.644107818603516,
2107
+ -7.364597797393799,
2108
+ 3.904141664505005,
2109
+ -1.6801589727401733,
2110
+ -1.2845038175582886,
2111
+ 0.061372943222522736,
2112
+ 11.340662956237793
2113
+ ],
2114
+ "model_value": 0.32115638256073,
2115
+ "model_checkpoint": "r42_800M.pkl",
2116
+ "model_architecture": "simba_aux"
2117
+ },
2118
+ {
2119
+ "turn": 32,
2120
+ "player": 1,
2121
+ "phase": "discard",
2122
+ "action": 8,
2123
+ "action_desc": "Model discarded 5\u2660",
2124
+ "card_drawn": null,
2125
+ "hands": [
2126
+ [
2127
+ 16,
2128
+ 3,
2129
+ 19,
2130
+ 6,
2131
+ 5,
2132
+ 15,
2133
+ 17,
2134
+ 27,
2135
+ 30,
2136
+ 13
2137
+ ],
2138
+ [
2139
+ 2,
2140
+ 29,
2141
+ 8,
2142
+ 14,
2143
+ 11,
2144
+ 7,
2145
+ 4,
2146
+ 10,
2147
+ 9,
2148
+ 12,
2149
+ 26
2150
+ ]
2151
+ ],
2152
+ "hand_sizes": [
2153
+ 10,
2154
+ 11
2155
+ ],
2156
+ "discard_pile": [
2157
+ 23,
2158
+ 51,
2159
+ 45,
2160
+ 36,
2161
+ 20,
2162
+ 21,
2163
+ 48,
2164
+ 42,
2165
+ 47,
2166
+ 43,
2167
+ 31,
2168
+ 35,
2169
+ 37,
2170
+ 44,
2171
+ 50
2172
+ ],
2173
+ "discard_size": 15,
2174
+ "stock_remaining": 16,
2175
+ "deadwood": [
2176
+ 32,
2177
+ 15
2178
+ ],
2179
+ "model_logits": [
2180
+ 1.896209716796875,
2181
+ -1.92103910446167,
2182
+ 4.436880111694336,
2183
+ 7.893919944763184,
2184
+ -3.3175177574157715,
2185
+ 5.695739269256592,
2186
+ -3.99066162109375,
2187
+ 2.0932838916778564,
2188
+ 11.61303997039795,
2189
+ -14.227922439575195,
2190
+ -11.156967163085938,
2191
+ 4.792416095733643,
2192
+ -3.593684673309326,
2193
+ -2.819870710372925,
2194
+ 2.2497639656066895,
2195
+ 5.461748123168945
2196
+ ],
2197
+ "model_value": 0.4467967748641968,
2198
+ "model_checkpoint": "r42_800M.pkl",
2199
+ "model_architecture": "simba_aux"
2200
+ },
2201
+ {
2202
+ "turn": 33,
2203
+ "player": 1,
2204
+ "phase": "knock_decision",
2205
+ "action": 14,
2206
+ "action_desc": "Model knocked",
2207
+ "card_drawn": null,
2208
+ "hands": [
2209
+ [
2210
+ 16,
2211
+ 3,
2212
+ 19,
2213
+ 6,
2214
+ 5,
2215
+ 15,
2216
+ 17,
2217
+ 27,
2218
+ 30,
2219
+ 13
2220
+ ],
2221
+ [
2222
+ 2,
2223
+ 29,
2224
+ 8,
2225
+ 14,
2226
+ 11,
2227
+ 7,
2228
+ 26,
2229
+ 10,
2230
+ 9,
2231
+ 12
2232
+ ]
2233
+ ],
2234
+ "hand_sizes": [
2235
+ 10,
2236
+ 10
2237
+ ],
2238
+ "discard_pile": [
2239
+ 23,
2240
+ 51,
2241
+ 45,
2242
+ 36,
2243
+ 20,
2244
+ 21,
2245
+ 48,
2246
+ 42,
2247
+ 47,
2248
+ 43,
2249
+ 31,
2250
+ 35,
2251
+ 37,
2252
+ 44,
2253
+ 50,
2254
+ 4
2255
+ ],
2256
+ "discard_size": 16,
2257
+ "stock_remaining": 16,
2258
+ "deadwood": [
2259
+ 32,
2260
+ 10
2261
+ ],
2262
+ "model_logits": [
2263
+ 3.515892505645752,
2264
+ -3.549199104309082,
2265
+ 4.966958999633789,
2266
+ 6.365931034088135,
2267
+ -2.8421521186828613,
2268
+ 1.4564769268035889,
2269
+ -2.213350772857666,
2270
+ 1.7686691284179688,
2271
+ 4.953024387359619,
2272
+ -8.77511978149414,
2273
+ -8.378056526184082,
2274
+ 5.679566383361816,
2275
+ -0.9104614853858948,
2276
+ -7.4730939865112305,
2277
+ 6.40046501159668,
2278
+ 9.804292678833008
2279
+ ],
2280
+ "model_value": 0.2832145392894745,
2281
+ "model_checkpoint": "r42_800M.pkl",
2282
+ "model_architecture": "simba_aux"
2283
+ },
2284
+ {
2285
+ "turn": 34,
2286
+ "player": -1,
2287
+ "phase": "game_over",
2288
+ "action": -1,
2289
+ "action_desc": "Game over.",
2290
+ "card_drawn": null,
2291
+ "hands": [
2292
+ [
2293
+ 16,
2294
+ 3,
2295
+ 19,
2296
+ 6,
2297
+ 5,
2298
+ 15,
2299
+ 17,
2300
+ 27,
2301
+ 30,
2302
+ 13
2303
+ ],
2304
+ [
2305
+ 2,
2306
+ 29,
2307
+ 8,
2308
+ 14,
2309
+ 11,
2310
+ 7,
2311
+ 26,
2312
+ 10,
2313
+ 9,
2314
+ 12
2315
+ ]
2316
+ ],
2317
+ "hand_sizes": [
2318
+ 10,
2319
+ 10
2320
+ ],
2321
+ "discard_pile": [
2322
+ 23,
2323
+ 51,
2324
+ 45,
2325
+ 36,
2326
+ 20,
2327
+ 21,
2328
+ 48,
2329
+ 42,
2330
+ 47,
2331
+ 43,
2332
+ 31,
2333
+ 35,
2334
+ 37,
2335
+ 44,
2336
+ 50,
2337
+ 4
2338
+ ],
2339
+ "discard_size": 16,
2340
+ "stock_remaining": 16,
2341
+ "deadwood": [
2342
+ 32,
2343
+ 10
2344
+ ],
2345
+ "model_logits": null,
2346
+ "model_value": null,
2347
+ "model_checkpoint": "r42_800M.pkl",
2348
+ "model_architecture": "simba_aux"
2349
+ }
2350
+ ]
2351
+ }