Alex-GSL commited on
Commit
5797076
·
verified ·
1 Parent(s): f6a2980

Upload human_games/game_20260325_183417_10.json with huggingface_hub

Browse files
human_games/game_20260325_183417_10.json ADDED
@@ -0,0 +1,1689 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "game_number": 10,
3
+ "timestamp": "2026-03-25T18:34:17.013623+00:00",
4
+ "checkpoint": "checkpoints/r42_1500M.pkl",
5
+ "architecture": "simba_aux",
6
+ "model_checkpoint": "r42_1500M.pkl",
7
+ "model_architecture": "simba_aux",
8
+ "result": "human_win",
9
+ "final_scores": [
10
+ 22,
11
+ 0
12
+ ],
13
+ "total_turns": 12,
14
+ "actions": [
15
+ {
16
+ "turn": 0,
17
+ "player": -1,
18
+ "phase": "draw",
19
+ "action": -1,
20
+ "action_desc": "Initial deal",
21
+ "card_drawn": null,
22
+ "hands": [
23
+ [
24
+ 19,
25
+ 49,
26
+ 13,
27
+ 36,
28
+ 51,
29
+ 31,
30
+ 12,
31
+ 16,
32
+ 29,
33
+ 47
34
+ ],
35
+ [
36
+ 10,
37
+ 2,
38
+ 18,
39
+ 33,
40
+ 32,
41
+ 37,
42
+ 28,
43
+ 23,
44
+ 50,
45
+ 40
46
+ ]
47
+ ],
48
+ "hand_sizes": [
49
+ 10,
50
+ 10
51
+ ],
52
+ "discard_pile": [
53
+ 0
54
+ ],
55
+ "discard_size": 1,
56
+ "stock_remaining": 31,
57
+ "deadwood": [
58
+ 71,
59
+ 69
60
+ ],
61
+ "model_logits": null,
62
+ "model_value": null,
63
+ "model_checkpoint": "r42_1500M.pkl",
64
+ "model_architecture": "simba_aux"
65
+ },
66
+ {
67
+ "turn": 1,
68
+ "player": 1,
69
+ "phase": "draw",
70
+ "action": 1,
71
+ "action_desc": "Model drew A\u2660 from discard",
72
+ "card_drawn": 0,
73
+ "hands": [
74
+ [
75
+ 19,
76
+ 49,
77
+ 13,
78
+ 36,
79
+ 51,
80
+ 31,
81
+ 12,
82
+ 16,
83
+ 29,
84
+ 47
85
+ ],
86
+ [
87
+ 10,
88
+ 2,
89
+ 18,
90
+ 33,
91
+ 32,
92
+ 37,
93
+ 28,
94
+ 23,
95
+ 50,
96
+ 40
97
+ ]
98
+ ],
99
+ "hand_sizes": [
100
+ 10,
101
+ 10
102
+ ],
103
+ "discard_pile": [
104
+ 0
105
+ ],
106
+ "discard_size": 1,
107
+ "stock_remaining": 31,
108
+ "deadwood": [
109
+ 71,
110
+ 69
111
+ ],
112
+ "model_logits": [
113
+ -4.204713344573975,
114
+ 4.202707767486572,
115
+ -2.310465097427368,
116
+ 4.854950904846191,
117
+ 3.756103754043579,
118
+ -5.19459867477417,
119
+ -1.9763543605804443,
120
+ -5.793529987335205,
121
+ 0.5632207989692688,
122
+ -5.028426170349121,
123
+ -1.4685839414596558,
124
+ 4.584095478057861,
125
+ 0.47564131021499634,
126
+ -0.36451399326324463,
127
+ -1.1208947896957397,
128
+ 14.265426635742188
129
+ ],
130
+ "model_value": 0.27704665064811707,
131
+ "model_checkpoint": "r42_1500M.pkl",
132
+ "model_architecture": "simba_aux"
133
+ },
134
+ {
135
+ "turn": 2,
136
+ "player": 1,
137
+ "phase": "discard",
138
+ "action": 4,
139
+ "action_desc": "Model discarded 6\u2665",
140
+ "card_drawn": null,
141
+ "hands": [
142
+ [
143
+ 19,
144
+ 49,
145
+ 13,
146
+ 36,
147
+ 51,
148
+ 31,
149
+ 12,
150
+ 16,
151
+ 29,
152
+ 47
153
+ ],
154
+ [
155
+ 10,
156
+ 2,
157
+ 18,
158
+ 33,
159
+ 32,
160
+ 37,
161
+ 28,
162
+ 23,
163
+ 50,
164
+ 40,
165
+ 0
166
+ ]
167
+ ],
168
+ "hand_sizes": [
169
+ 10,
170
+ 11
171
+ ],
172
+ "discard_pile": [],
173
+ "discard_size": 0,
174
+ "stock_remaining": 31,
175
+ "deadwood": [
176
+ 71,
177
+ 70
178
+ ],
179
+ "model_logits": [
180
+ -2.729828357696533,
181
+ 2.7276227474212646,
182
+ -8.015610694885254,
183
+ 4.178175449371338,
184
+ 5.576138496398926,
185
+ -1.7288856506347656,
186
+ -2.9260053634643555,
187
+ -3.4503698348999023,
188
+ 1.3668266534805298,
189
+ -6.552676200866699,
190
+ -3.451054811477661,
191
+ 1.4264678955078125,
192
+ 4.736198902130127,
193
+ 2.440730094909668,
194
+ -3.8085052967071533,
195
+ 12.333610534667969
196
+ ],
197
+ "model_value": 0.3482803702354431,
198
+ "model_checkpoint": "r42_1500M.pkl",
199
+ "model_architecture": "simba_aux"
200
+ },
201
+ {
202
+ "turn": 3,
203
+ "player": 0,
204
+ "phase": "draw",
205
+ "action": 0,
206
+ "action_desc": "Human drew from stock",
207
+ "card_drawn": 22,
208
+ "hands": [
209
+ [
210
+ 19,
211
+ 49,
212
+ 13,
213
+ 36,
214
+ 51,
215
+ 31,
216
+ 12,
217
+ 16,
218
+ 29,
219
+ 47
220
+ ],
221
+ [
222
+ 10,
223
+ 2,
224
+ 0,
225
+ 33,
226
+ 32,
227
+ 37,
228
+ 28,
229
+ 23,
230
+ 50,
231
+ 40
232
+ ]
233
+ ],
234
+ "hand_sizes": [
235
+ 10,
236
+ 10
237
+ ],
238
+ "discard_pile": [
239
+ 18
240
+ ],
241
+ "discard_size": 1,
242
+ "stock_remaining": 31,
243
+ "deadwood": [
244
+ 71,
245
+ 64
246
+ ],
247
+ "model_logits": null,
248
+ "model_value": null,
249
+ "model_checkpoint": "r42_1500M.pkl",
250
+ "model_architecture": "simba_aux"
251
+ },
252
+ {
253
+ "turn": 4,
254
+ "player": 0,
255
+ "phase": "discard",
256
+ "action": 12,
257
+ "action_desc": "Human discarded 10\u2665",
258
+ "card_drawn": null,
259
+ "hands": [
260
+ [
261
+ 19,
262
+ 49,
263
+ 13,
264
+ 36,
265
+ 51,
266
+ 31,
267
+ 12,
268
+ 16,
269
+ 29,
270
+ 47,
271
+ 22
272
+ ],
273
+ [
274
+ 10,
275
+ 2,
276
+ 0,
277
+ 33,
278
+ 32,
279
+ 37,
280
+ 28,
281
+ 23,
282
+ 50,
283
+ 40
284
+ ]
285
+ ],
286
+ "hand_sizes": [
287
+ 11,
288
+ 10
289
+ ],
290
+ "discard_pile": [
291
+ 18
292
+ ],
293
+ "discard_size": 1,
294
+ "stock_remaining": 30,
295
+ "deadwood": [
296
+ 81,
297
+ 64
298
+ ],
299
+ "model_logits": null,
300
+ "model_value": null,
301
+ "model_checkpoint": "r42_1500M.pkl",
302
+ "model_architecture": "simba_aux"
303
+ },
304
+ {
305
+ "turn": 5,
306
+ "player": 1,
307
+ "phase": "draw",
308
+ "action": 0,
309
+ "action_desc": "Model drew from stock",
310
+ "card_drawn": 25,
311
+ "hands": [
312
+ [
313
+ 19,
314
+ 49,
315
+ 13,
316
+ 36,
317
+ 51,
318
+ 31,
319
+ 12,
320
+ 16,
321
+ 29,
322
+ 47
323
+ ],
324
+ [
325
+ 10,
326
+ 2,
327
+ 0,
328
+ 33,
329
+ 32,
330
+ 37,
331
+ 28,
332
+ 23,
333
+ 50,
334
+ 40
335
+ ]
336
+ ],
337
+ "hand_sizes": [
338
+ 10,
339
+ 10
340
+ ],
341
+ "discard_pile": [
342
+ 18,
343
+ 22
344
+ ],
345
+ "discard_size": 2,
346
+ "stock_remaining": 30,
347
+ "deadwood": [
348
+ 71,
349
+ 64
350
+ ],
351
+ "model_logits": [
352
+ 1.28843355178833,
353
+ -1.283196210861206,
354
+ -3.4003169536590576,
355
+ 4.307423114776611,
356
+ 3.8656222820281982,
357
+ -6.291551113128662,
358
+ -0.8891344666481018,
359
+ -5.022740840911865,
360
+ 1.2295198440551758,
361
+ -0.903852641582489,
362
+ -2.397982358932495,
363
+ 1.7229371070861816,
364
+ 2.7869319915771484,
365
+ 4.126954078674316,
366
+ -5.77133846282959,
367
+ 14.5359525680542
368
+ ],
369
+ "model_value": 0.30744805932044983,
370
+ "model_checkpoint": "r42_1500M.pkl",
371
+ "model_architecture": "simba_aux"
372
+ },
373
+ {
374
+ "turn": 6,
375
+ "player": 1,
376
+ "phase": "discard",
377
+ "action": 4,
378
+ "action_desc": "Model discarded A\u2660",
379
+ "card_drawn": null,
380
+ "hands": [
381
+ [
382
+ 19,
383
+ 49,
384
+ 13,
385
+ 36,
386
+ 51,
387
+ 31,
388
+ 12,
389
+ 16,
390
+ 29,
391
+ 47
392
+ ],
393
+ [
394
+ 10,
395
+ 2,
396
+ 0,
397
+ 33,
398
+ 32,
399
+ 37,
400
+ 28,
401
+ 23,
402
+ 50,
403
+ 40,
404
+ 25
405
+ ]
406
+ ],
407
+ "hand_sizes": [
408
+ 10,
409
+ 11
410
+ ],
411
+ "discard_pile": [
412
+ 18,
413
+ 22
414
+ ],
415
+ "discard_size": 2,
416
+ "stock_remaining": 29,
417
+ "deadwood": [
418
+ 71,
419
+ 74
420
+ ],
421
+ "model_logits": [
422
+ -1.6849180459976196,
423
+ 1.6906808614730835,
424
+ -9.307866096496582,
425
+ 4.059427261352539,
426
+ 6.459877967834473,
427
+ -0.8457171320915222,
428
+ 1.8903998136520386,
429
+ -6.02644681930542,
430
+ 2.6427831649780273,
431
+ -3.1981561183929443,
432
+ -8.14437484741211,
433
+ 0.9968225359916687,
434
+ 3.0961782932281494,
435
+ 5.619723796844482,
436
+ -6.8304219245910645,
437
+ 10.066866874694824
438
+ ],
439
+ "model_value": 0.4029783606529236,
440
+ "model_checkpoint": "r42_1500M.pkl",
441
+ "model_architecture": "simba_aux"
442
+ },
443
+ {
444
+ "turn": 7,
445
+ "player": 0,
446
+ "phase": "draw",
447
+ "action": 1,
448
+ "action_desc": "Human drew A\u2660 from discard",
449
+ "card_drawn": 0,
450
+ "hands": [
451
+ [
452
+ 19,
453
+ 49,
454
+ 13,
455
+ 36,
456
+ 51,
457
+ 31,
458
+ 12,
459
+ 16,
460
+ 29,
461
+ 47
462
+ ],
463
+ [
464
+ 10,
465
+ 2,
466
+ 25,
467
+ 33,
468
+ 32,
469
+ 37,
470
+ 28,
471
+ 23,
472
+ 50,
473
+ 40
474
+ ]
475
+ ],
476
+ "hand_sizes": [
477
+ 10,
478
+ 10
479
+ ],
480
+ "discard_pile": [
481
+ 18,
482
+ 22,
483
+ 0
484
+ ],
485
+ "discard_size": 3,
486
+ "stock_remaining": 29,
487
+ "deadwood": [
488
+ 71,
489
+ 73
490
+ ],
491
+ "model_logits": null,
492
+ "model_value": null,
493
+ "model_checkpoint": "r42_1500M.pkl",
494
+ "model_architecture": "simba_aux"
495
+ },
496
+ {
497
+ "turn": 8,
498
+ "player": 0,
499
+ "phase": "discard",
500
+ "action": 2,
501
+ "action_desc": "Human discarded 7\u2665",
502
+ "card_drawn": null,
503
+ "hands": [
504
+ [
505
+ 19,
506
+ 49,
507
+ 13,
508
+ 36,
509
+ 51,
510
+ 31,
511
+ 12,
512
+ 16,
513
+ 29,
514
+ 47,
515
+ 0
516
+ ],
517
+ [
518
+ 10,
519
+ 2,
520
+ 25,
521
+ 33,
522
+ 32,
523
+ 37,
524
+ 28,
525
+ 23,
526
+ 50,
527
+ 40
528
+ ]
529
+ ],
530
+ "hand_sizes": [
531
+ 11,
532
+ 10
533
+ ],
534
+ "discard_pile": [
535
+ 18,
536
+ 22
537
+ ],
538
+ "discard_size": 2,
539
+ "stock_remaining": 29,
540
+ "deadwood": [
541
+ 72,
542
+ 73
543
+ ],
544
+ "model_logits": null,
545
+ "model_value": null,
546
+ "model_checkpoint": "r42_1500M.pkl",
547
+ "model_architecture": "simba_aux"
548
+ },
549
+ {
550
+ "turn": 9,
551
+ "player": 1,
552
+ "phase": "draw",
553
+ "action": 0,
554
+ "action_desc": "Model drew from stock",
555
+ "card_drawn": 17,
556
+ "hands": [
557
+ [
558
+ 0,
559
+ 49,
560
+ 13,
561
+ 36,
562
+ 51,
563
+ 31,
564
+ 12,
565
+ 16,
566
+ 29,
567
+ 47
568
+ ],
569
+ [
570
+ 10,
571
+ 2,
572
+ 25,
573
+ 33,
574
+ 32,
575
+ 37,
576
+ 28,
577
+ 23,
578
+ 50,
579
+ 40
580
+ ]
581
+ ],
582
+ "hand_sizes": [
583
+ 10,
584
+ 10
585
+ ],
586
+ "discard_pile": [
587
+ 18,
588
+ 22,
589
+ 19
590
+ ],
591
+ "discard_size": 3,
592
+ "stock_remaining": 29,
593
+ "deadwood": [
594
+ 65,
595
+ 73
596
+ ],
597
+ "model_logits": [
598
+ 6.784142017364502,
599
+ -6.78000545501709,
600
+ -0.07487382739782333,
601
+ -0.23603412508964539,
602
+ 4.683076858520508,
603
+ -4.5469231605529785,
604
+ 1.4678372144699097,
605
+ -0.5378774404525757,
606
+ -0.09563322365283966,
607
+ 1.0073424577713013,
608
+ 2.6383464336395264,
609
+ 0.1579200178384781,
610
+ -3.7847695350646973,
611
+ 3.7296104431152344,
612
+ -5.146703720092773,
613
+ 11.977681159973145
614
+ ],
615
+ "model_value": 0.06379446387290955,
616
+ "model_checkpoint": "r42_1500M.pkl",
617
+ "model_architecture": "simba_aux"
618
+ },
619
+ {
620
+ "turn": 10,
621
+ "player": 1,
622
+ "phase": "discard",
623
+ "action": 4,
624
+ "action_desc": "Model discarded K\u2665",
625
+ "card_drawn": null,
626
+ "hands": [
627
+ [
628
+ 0,
629
+ 49,
630
+ 13,
631
+ 36,
632
+ 51,
633
+ 31,
634
+ 12,
635
+ 16,
636
+ 29,
637
+ 47
638
+ ],
639
+ [
640
+ 10,
641
+ 2,
642
+ 25,
643
+ 33,
644
+ 32,
645
+ 37,
646
+ 28,
647
+ 23,
648
+ 50,
649
+ 40,
650
+ 17
651
+ ]
652
+ ],
653
+ "hand_sizes": [
654
+ 10,
655
+ 11
656
+ ],
657
+ "discard_pile": [
658
+ 18,
659
+ 22,
660
+ 19
661
+ ],
662
+ "discard_size": 3,
663
+ "stock_remaining": 28,
664
+ "deadwood": [
665
+ 65,
666
+ 78
667
+ ],
668
+ "model_logits": [
669
+ 1.684059500694275,
670
+ -1.6716012954711914,
671
+ -3.549666166305542,
672
+ -0.013510639779269695,
673
+ 5.818551063537598,
674
+ -5.553445339202881,
675
+ 0.9326134920120239,
676
+ 0.1367407888174057,
677
+ 0.12360849976539612,
678
+ 2.2881991863250732,
679
+ -0.7820842266082764,
680
+ -2.437870502471924,
681
+ -0.8534119725227356,
682
+ 6.776289939880371,
683
+ -7.551581859588623,
684
+ 6.002501964569092
685
+ ],
686
+ "model_value": 0.025557851418852806,
687
+ "model_checkpoint": "r42_1500M.pkl",
688
+ "model_architecture": "simba_aux"
689
+ },
690
+ {
691
+ "turn": 11,
692
+ "player": 0,
693
+ "phase": "draw",
694
+ "action": 1,
695
+ "action_desc": "Human drew K\u2665 from discard",
696
+ "card_drawn": 25,
697
+ "hands": [
698
+ [
699
+ 0,
700
+ 49,
701
+ 13,
702
+ 36,
703
+ 51,
704
+ 31,
705
+ 12,
706
+ 16,
707
+ 29,
708
+ 47
709
+ ],
710
+ [
711
+ 10,
712
+ 2,
713
+ 17,
714
+ 33,
715
+ 32,
716
+ 37,
717
+ 28,
718
+ 23,
719
+ 50,
720
+ 40
721
+ ]
722
+ ],
723
+ "hand_sizes": [
724
+ 10,
725
+ 10
726
+ ],
727
+ "discard_pile": [
728
+ 18,
729
+ 22,
730
+ 19,
731
+ 25
732
+ ],
733
+ "discard_size": 4,
734
+ "stock_remaining": 28,
735
+ "deadwood": [
736
+ 65,
737
+ 68
738
+ ],
739
+ "model_logits": null,
740
+ "model_value": null,
741
+ "model_checkpoint": "r42_1500M.pkl",
742
+ "model_architecture": "simba_aux"
743
+ },
744
+ {
745
+ "turn": 12,
746
+ "player": 0,
747
+ "phase": "discard",
748
+ "action": 7,
749
+ "action_desc": "Human discarded 6\u2666",
750
+ "card_drawn": null,
751
+ "hands": [
752
+ [
753
+ 0,
754
+ 49,
755
+ 13,
756
+ 36,
757
+ 51,
758
+ 31,
759
+ 12,
760
+ 16,
761
+ 29,
762
+ 47,
763
+ 25
764
+ ],
765
+ [
766
+ 10,
767
+ 2,
768
+ 17,
769
+ 33,
770
+ 32,
771
+ 37,
772
+ 28,
773
+ 23,
774
+ 50,
775
+ 40
776
+ ]
777
+ ],
778
+ "hand_sizes": [
779
+ 11,
780
+ 10
781
+ ],
782
+ "discard_pile": [
783
+ 18,
784
+ 22,
785
+ 19
786
+ ],
787
+ "discard_size": 3,
788
+ "stock_remaining": 28,
789
+ "deadwood": [
790
+ 45,
791
+ 68
792
+ ],
793
+ "model_logits": null,
794
+ "model_value": null,
795
+ "model_checkpoint": "r42_1500M.pkl",
796
+ "model_architecture": "simba_aux"
797
+ },
798
+ {
799
+ "turn": 13,
800
+ "player": 1,
801
+ "phase": "draw",
802
+ "action": 1,
803
+ "action_desc": "Model drew 6\u2666 from discard",
804
+ "card_drawn": 31,
805
+ "hands": [
806
+ [
807
+ 0,
808
+ 49,
809
+ 13,
810
+ 36,
811
+ 51,
812
+ 25,
813
+ 12,
814
+ 16,
815
+ 29,
816
+ 47
817
+ ],
818
+ [
819
+ 10,
820
+ 2,
821
+ 17,
822
+ 33,
823
+ 32,
824
+ 37,
825
+ 28,
826
+ 23,
827
+ 50,
828
+ 40
829
+ ]
830
+ ],
831
+ "hand_sizes": [
832
+ 10,
833
+ 10
834
+ ],
835
+ "discard_pile": [
836
+ 18,
837
+ 22,
838
+ 19,
839
+ 31
840
+ ],
841
+ "discard_size": 4,
842
+ "stock_remaining": 28,
843
+ "deadwood": [
844
+ 39,
845
+ 68
846
+ ],
847
+ "model_logits": [
848
+ -12.766131401062012,
849
+ 12.773184776306152,
850
+ 5.610723972320557,
851
+ -3.1971096992492676,
852
+ 0.7436082363128662,
853
+ -5.609472274780273,
854
+ 3.3056578636169434,
855
+ -2.3578593730926514,
856
+ 1.7023518085479736,
857
+ 3.813063144683838,
858
+ 1.2607258558273315,
859
+ -2.139775514602661,
860
+ -9.637784957885742,
861
+ 2.2619893550872803,
862
+ -2.404135227203369,
863
+ 4.29603385925293
864
+ ],
865
+ "model_value": -0.045100729912519455,
866
+ "model_checkpoint": "r42_1500M.pkl",
867
+ "model_architecture": "simba_aux"
868
+ },
869
+ {
870
+ "turn": 14,
871
+ "player": 1,
872
+ "phase": "discard",
873
+ "action": 10,
874
+ "action_desc": "Model discarded Q\u2663",
875
+ "card_drawn": null,
876
+ "hands": [
877
+ [
878
+ 0,
879
+ 49,
880
+ 13,
881
+ 36,
882
+ 51,
883
+ 25,
884
+ 12,
885
+ 16,
886
+ 29,
887
+ 47
888
+ ],
889
+ [
890
+ 10,
891
+ 2,
892
+ 17,
893
+ 33,
894
+ 32,
895
+ 37,
896
+ 28,
897
+ 23,
898
+ 50,
899
+ 40,
900
+ 31
901
+ ]
902
+ ],
903
+ "hand_sizes": [
904
+ 10,
905
+ 11
906
+ ],
907
+ "discard_pile": [
908
+ 18,
909
+ 22,
910
+ 19
911
+ ],
912
+ "discard_size": 3,
913
+ "stock_remaining": 28,
914
+ "deadwood": [
915
+ 39,
916
+ 53
917
+ ],
918
+ "model_logits": [
919
+ -8.021721839904785,
920
+ 8.047432899475098,
921
+ 7.162282466888428,
922
+ 4.094487190246582,
923
+ 7.1345295906066895,
924
+ -16.153514862060547,
925
+ -7.9078874588012695,
926
+ 1.2269738912582397,
927
+ 1.6170539855957031,
928
+ 7.448216438293457,
929
+ 8.225522994995117,
930
+ -3.8998653888702393,
931
+ -16.716604232788086,
932
+ 5.179447174072266,
933
+ -5.225028991699219,
934
+ 2.2410192489624023
935
+ ],
936
+ "model_value": 0.15882274508476257,
937
+ "model_checkpoint": "r42_1500M.pkl",
938
+ "model_architecture": "simba_aux"
939
+ },
940
+ {
941
+ "turn": 15,
942
+ "player": 0,
943
+ "phase": "draw",
944
+ "action": 0,
945
+ "action_desc": "Human drew from stock",
946
+ "card_drawn": 21,
947
+ "hands": [
948
+ [
949
+ 0,
950
+ 49,
951
+ 13,
952
+ 36,
953
+ 51,
954
+ 25,
955
+ 12,
956
+ 16,
957
+ 29,
958
+ 47
959
+ ],
960
+ [
961
+ 10,
962
+ 2,
963
+ 17,
964
+ 33,
965
+ 32,
966
+ 37,
967
+ 28,
968
+ 23,
969
+ 31,
970
+ 40
971
+ ]
972
+ ],
973
+ "hand_sizes": [
974
+ 10,
975
+ 10
976
+ ],
977
+ "discard_pile": [
978
+ 18,
979
+ 22,
980
+ 19,
981
+ 50
982
+ ],
983
+ "discard_size": 4,
984
+ "stock_remaining": 28,
985
+ "deadwood": [
986
+ 39,
987
+ 43
988
+ ],
989
+ "model_logits": null,
990
+ "model_value": null,
991
+ "model_checkpoint": "r42_1500M.pkl",
992
+ "model_architecture": "simba_aux"
993
+ },
994
+ {
995
+ "turn": 16,
996
+ "player": 0,
997
+ "phase": "discard",
998
+ "action": 12,
999
+ "action_desc": "Human discarded 9\u2665",
1000
+ "card_drawn": null,
1001
+ "hands": [
1002
+ [
1003
+ 0,
1004
+ 49,
1005
+ 13,
1006
+ 36,
1007
+ 51,
1008
+ 25,
1009
+ 12,
1010
+ 16,
1011
+ 29,
1012
+ 47,
1013
+ 21
1014
+ ],
1015
+ [
1016
+ 10,
1017
+ 2,
1018
+ 17,
1019
+ 33,
1020
+ 32,
1021
+ 37,
1022
+ 28,
1023
+ 23,
1024
+ 31,
1025
+ 40
1026
+ ]
1027
+ ],
1028
+ "hand_sizes": [
1029
+ 11,
1030
+ 10
1031
+ ],
1032
+ "discard_pile": [
1033
+ 18,
1034
+ 22,
1035
+ 19,
1036
+ 50
1037
+ ],
1038
+ "discard_size": 4,
1039
+ "stock_remaining": 27,
1040
+ "deadwood": [
1041
+ 48,
1042
+ 43
1043
+ ],
1044
+ "model_logits": null,
1045
+ "model_value": null,
1046
+ "model_checkpoint": "r42_1500M.pkl",
1047
+ "model_architecture": "simba_aux"
1048
+ },
1049
+ {
1050
+ "turn": 17,
1051
+ "player": 1,
1052
+ "phase": "draw",
1053
+ "action": 0,
1054
+ "action_desc": "Model drew from stock",
1055
+ "card_drawn": 45,
1056
+ "hands": [
1057
+ [
1058
+ 0,
1059
+ 49,
1060
+ 13,
1061
+ 36,
1062
+ 51,
1063
+ 25,
1064
+ 12,
1065
+ 16,
1066
+ 29,
1067
+ 47
1068
+ ],
1069
+ [
1070
+ 10,
1071
+ 2,
1072
+ 17,
1073
+ 33,
1074
+ 32,
1075
+ 37,
1076
+ 28,
1077
+ 23,
1078
+ 31,
1079
+ 40
1080
+ ]
1081
+ ],
1082
+ "hand_sizes": [
1083
+ 10,
1084
+ 10
1085
+ ],
1086
+ "discard_pile": [
1087
+ 18,
1088
+ 22,
1089
+ 19,
1090
+ 50,
1091
+ 21
1092
+ ],
1093
+ "discard_size": 5,
1094
+ "stock_remaining": 27,
1095
+ "deadwood": [
1096
+ 39,
1097
+ 43
1098
+ ],
1099
+ "model_logits": [
1100
+ 4.548898220062256,
1101
+ -4.5547943115234375,
1102
+ 13.209750175476074,
1103
+ -1.820097804069519,
1104
+ 1.79795503616333,
1105
+ -10.446738243103027,
1106
+ -7.593908309936523,
1107
+ 9.362470626831055,
1108
+ -0.7354304790496826,
1109
+ 7.653305530548096,
1110
+ -8.878214836120605,
1111
+ -0.12164069712162018,
1112
+ 0.32691746950149536,
1113
+ 4.507696628570557,
1114
+ -5.78254508972168,
1115
+ 10.70018482208252
1116
+ ],
1117
+ "model_value": 0.06861646473407745,
1118
+ "model_checkpoint": "r42_1500M.pkl",
1119
+ "model_architecture": "simba_aux"
1120
+ },
1121
+ {
1122
+ "turn": 18,
1123
+ "player": 1,
1124
+ "phase": "discard",
1125
+ "action": 7,
1126
+ "action_desc": "Model discarded Q\u2666",
1127
+ "card_drawn": null,
1128
+ "hands": [
1129
+ [
1130
+ 0,
1131
+ 49,
1132
+ 13,
1133
+ 36,
1134
+ 51,
1135
+ 25,
1136
+ 12,
1137
+ 16,
1138
+ 29,
1139
+ 47
1140
+ ],
1141
+ [
1142
+ 10,
1143
+ 2,
1144
+ 17,
1145
+ 33,
1146
+ 32,
1147
+ 37,
1148
+ 28,
1149
+ 23,
1150
+ 31,
1151
+ 40,
1152
+ 45
1153
+ ]
1154
+ ],
1155
+ "hand_sizes": [
1156
+ 10,
1157
+ 11
1158
+ ],
1159
+ "discard_pile": [
1160
+ 18,
1161
+ 22,
1162
+ 19,
1163
+ 50,
1164
+ 21
1165
+ ],
1166
+ "discard_size": 5,
1167
+ "stock_remaining": 26,
1168
+ "deadwood": [
1169
+ 39,
1170
+ 50
1171
+ ],
1172
+ "model_logits": [
1173
+ 0.7739650011062622,
1174
+ -0.7837098836898804,
1175
+ 8.015311241149902,
1176
+ -3.652681827545166,
1177
+ 3.8251984119415283,
1178
+ -15.586004257202148,
1179
+ -7.727936744689941,
1180
+ 13.57060432434082,
1181
+ -1.0775803327560425,
1182
+ 7.013462543487549,
1183
+ -12.81948184967041,
1184
+ -2.7394797801971436,
1185
+ 9.556540489196777,
1186
+ 7.017982006072998,
1187
+ -8.095766067504883,
1188
+ 8.072487831115723
1189
+ ],
1190
+ "model_value": 0.08303806185722351,
1191
+ "model_checkpoint": "r42_1500M.pkl",
1192
+ "model_architecture": "simba_aux"
1193
+ },
1194
+ {
1195
+ "turn": 19,
1196
+ "player": 0,
1197
+ "phase": "draw",
1198
+ "action": 0,
1199
+ "action_desc": "Human drew from stock",
1200
+ "card_drawn": 38,
1201
+ "hands": [
1202
+ [
1203
+ 0,
1204
+ 49,
1205
+ 13,
1206
+ 36,
1207
+ 51,
1208
+ 25,
1209
+ 12,
1210
+ 16,
1211
+ 29,
1212
+ 47
1213
+ ],
1214
+ [
1215
+ 10,
1216
+ 2,
1217
+ 17,
1218
+ 33,
1219
+ 32,
1220
+ 45,
1221
+ 28,
1222
+ 23,
1223
+ 31,
1224
+ 40
1225
+ ]
1226
+ ],
1227
+ "hand_sizes": [
1228
+ 10,
1229
+ 10
1230
+ ],
1231
+ "discard_pile": [
1232
+ 18,
1233
+ 22,
1234
+ 19,
1235
+ 50,
1236
+ 21,
1237
+ 37
1238
+ ],
1239
+ "discard_size": 6,
1240
+ "stock_remaining": 26,
1241
+ "deadwood": [
1242
+ 39,
1243
+ 40
1244
+ ],
1245
+ "model_logits": null,
1246
+ "model_value": null,
1247
+ "model_checkpoint": "r42_1500M.pkl",
1248
+ "model_architecture": "simba_aux"
1249
+ },
1250
+ {
1251
+ "turn": 20,
1252
+ "player": 0,
1253
+ "phase": "discard",
1254
+ "action": 11,
1255
+ "action_desc": "Human discarded 9\u2663",
1256
+ "card_drawn": null,
1257
+ "hands": [
1258
+ [
1259
+ 0,
1260
+ 49,
1261
+ 13,
1262
+ 36,
1263
+ 51,
1264
+ 25,
1265
+ 12,
1266
+ 16,
1267
+ 29,
1268
+ 47,
1269
+ 38
1270
+ ],
1271
+ [
1272
+ 10,
1273
+ 2,
1274
+ 17,
1275
+ 33,
1276
+ 32,
1277
+ 45,
1278
+ 28,
1279
+ 23,
1280
+ 31,
1281
+ 40
1282
+ ]
1283
+ ],
1284
+ "hand_sizes": [
1285
+ 11,
1286
+ 10
1287
+ ],
1288
+ "discard_pile": [
1289
+ 18,
1290
+ 22,
1291
+ 19,
1292
+ 50,
1293
+ 21,
1294
+ 37
1295
+ ],
1296
+ "discard_size": 6,
1297
+ "stock_remaining": 25,
1298
+ "deadwood": [
1299
+ 39,
1300
+ 40
1301
+ ],
1302
+ "model_logits": null,
1303
+ "model_value": null,
1304
+ "model_checkpoint": "r42_1500M.pkl",
1305
+ "model_architecture": "simba_aux"
1306
+ },
1307
+ {
1308
+ "turn": 21,
1309
+ "player": 1,
1310
+ "phase": "draw",
1311
+ "action": 0,
1312
+ "action_desc": "Model drew from stock",
1313
+ "card_drawn": 20,
1314
+ "hands": [
1315
+ [
1316
+ 0,
1317
+ 49,
1318
+ 13,
1319
+ 36,
1320
+ 51,
1321
+ 25,
1322
+ 12,
1323
+ 16,
1324
+ 29,
1325
+ 38
1326
+ ],
1327
+ [
1328
+ 10,
1329
+ 2,
1330
+ 17,
1331
+ 33,
1332
+ 32,
1333
+ 45,
1334
+ 28,
1335
+ 23,
1336
+ 31,
1337
+ 40
1338
+ ]
1339
+ ],
1340
+ "hand_sizes": [
1341
+ 10,
1342
+ 10
1343
+ ],
1344
+ "discard_pile": [
1345
+ 18,
1346
+ 22,
1347
+ 19,
1348
+ 50,
1349
+ 21,
1350
+ 37,
1351
+ 47
1352
+ ],
1353
+ "discard_size": 7,
1354
+ "stock_remaining": 25,
1355
+ "deadwood": [
1356
+ 30,
1357
+ 40
1358
+ ],
1359
+ "model_logits": [
1360
+ 4.57336950302124,
1361
+ -4.58177375793457,
1362
+ 12.968157768249512,
1363
+ -2.3562889099121094,
1364
+ 1.7222951650619507,
1365
+ -9.085198402404785,
1366
+ -7.915225028991699,
1367
+ 5.865841388702393,
1368
+ 0.047242775559425354,
1369
+ 10.256677627563477,
1370
+ -8.264598846435547,
1371
+ 0.014974662102758884,
1372
+ -1.0141911506652832,
1373
+ 4.182888031005859,
1374
+ -5.529803276062012,
1375
+ 11.647488594055176
1376
+ ],
1377
+ "model_value": 0.08017067611217499,
1378
+ "model_checkpoint": "r42_1500M.pkl",
1379
+ "model_architecture": "simba_aux"
1380
+ },
1381
+ {
1382
+ "turn": 22,
1383
+ "player": 1,
1384
+ "phase": "discard",
1385
+ "action": 2,
1386
+ "action_desc": "Model discarded J\u2660",
1387
+ "card_drawn": null,
1388
+ "hands": [
1389
+ [
1390
+ 0,
1391
+ 49,
1392
+ 13,
1393
+ 36,
1394
+ 51,
1395
+ 25,
1396
+ 12,
1397
+ 16,
1398
+ 29,
1399
+ 38
1400
+ ],
1401
+ [
1402
+ 10,
1403
+ 2,
1404
+ 17,
1405
+ 33,
1406
+ 32,
1407
+ 45,
1408
+ 28,
1409
+ 23,
1410
+ 31,
1411
+ 40,
1412
+ 20
1413
+ ]
1414
+ ],
1415
+ "hand_sizes": [
1416
+ 10,
1417
+ 11
1418
+ ],
1419
+ "discard_pile": [
1420
+ 18,
1421
+ 22,
1422
+ 19,
1423
+ 50,
1424
+ 21,
1425
+ 37,
1426
+ 47
1427
+ ],
1428
+ "discard_size": 7,
1429
+ "stock_remaining": 24,
1430
+ "deadwood": [
1431
+ 30,
1432
+ 48
1433
+ ],
1434
+ "model_logits": [
1435
+ -1.6451659202575684,
1436
+ 1.6378941535949707,
1437
+ 9.565203666687012,
1438
+ -3.0731124877929688,
1439
+ 2.7273499965667725,
1440
+ -15.453739166259766,
1441
+ -7.346385955810547,
1442
+ 8.290855407714844,
1443
+ 0.5172525644302368,
1444
+ 10.439199447631836,
1445
+ -11.189264297485352,
1446
+ -2.1577279567718506,
1447
+ 5.048219680786133,
1448
+ 7.282042980194092,
1449
+ -8.237589836120605,
1450
+ 8.142048835754395
1451
+ ],
1452
+ "model_value": 0.08465616405010223,
1453
+ "model_checkpoint": "r42_1500M.pkl",
1454
+ "model_architecture": "simba_aux"
1455
+ },
1456
+ {
1457
+ "turn": 23,
1458
+ "player": 0,
1459
+ "phase": "draw",
1460
+ "action": 1,
1461
+ "action_desc": "Human drew J\u2660 from discard",
1462
+ "card_drawn": 10,
1463
+ "hands": [
1464
+ [
1465
+ 0,
1466
+ 49,
1467
+ 13,
1468
+ 36,
1469
+ 51,
1470
+ 25,
1471
+ 12,
1472
+ 16,
1473
+ 29,
1474
+ 38
1475
+ ],
1476
+ [
1477
+ 20,
1478
+ 2,
1479
+ 17,
1480
+ 33,
1481
+ 32,
1482
+ 45,
1483
+ 28,
1484
+ 23,
1485
+ 31,
1486
+ 40
1487
+ ]
1488
+ ],
1489
+ "hand_sizes": [
1490
+ 10,
1491
+ 10
1492
+ ],
1493
+ "discard_pile": [
1494
+ 18,
1495
+ 22,
1496
+ 19,
1497
+ 50,
1498
+ 21,
1499
+ 37,
1500
+ 47,
1501
+ 10
1502
+ ],
1503
+ "discard_size": 8,
1504
+ "stock_remaining": 24,
1505
+ "deadwood": [
1506
+ 30,
1507
+ 38
1508
+ ],
1509
+ "model_logits": null,
1510
+ "model_value": null,
1511
+ "model_checkpoint": "r42_1500M.pkl",
1512
+ "model_architecture": "simba_aux"
1513
+ },
1514
+ {
1515
+ "turn": 24,
1516
+ "player": 0,
1517
+ "phase": "discard",
1518
+ "action": 9,
1519
+ "action_desc": "Human discarded 4\u2665",
1520
+ "card_drawn": null,
1521
+ "hands": [
1522
+ [
1523
+ 0,
1524
+ 49,
1525
+ 13,
1526
+ 36,
1527
+ 51,
1528
+ 25,
1529
+ 12,
1530
+ 16,
1531
+ 29,
1532
+ 38,
1533
+ 10
1534
+ ],
1535
+ [
1536
+ 20,
1537
+ 2,
1538
+ 17,
1539
+ 33,
1540
+ 32,
1541
+ 45,
1542
+ 28,
1543
+ 23,
1544
+ 31,
1545
+ 40
1546
+ ]
1547
+ ],
1548
+ "hand_sizes": [
1549
+ 11,
1550
+ 10
1551
+ ],
1552
+ "discard_pile": [
1553
+ 18,
1554
+ 22,
1555
+ 19,
1556
+ 50,
1557
+ 21,
1558
+ 37,
1559
+ 47
1560
+ ],
1561
+ "discard_size": 7,
1562
+ "stock_remaining": 24,
1563
+ "deadwood": [
1564
+ 10,
1565
+ 38
1566
+ ],
1567
+ "model_logits": null,
1568
+ "model_value": null,
1569
+ "model_checkpoint": "r42_1500M.pkl",
1570
+ "model_architecture": "simba_aux"
1571
+ },
1572
+ {
1573
+ "turn": 25,
1574
+ "player": 0,
1575
+ "phase": "knock_decision",
1576
+ "action": 14,
1577
+ "action_desc": "Human knocked",
1578
+ "card_drawn": null,
1579
+ "hands": [
1580
+ [
1581
+ 0,
1582
+ 49,
1583
+ 13,
1584
+ 36,
1585
+ 51,
1586
+ 25,
1587
+ 12,
1588
+ 10,
1589
+ 29,
1590
+ 38
1591
+ ],
1592
+ [
1593
+ 20,
1594
+ 2,
1595
+ 17,
1596
+ 33,
1597
+ 32,
1598
+ 45,
1599
+ 28,
1600
+ 23,
1601
+ 31,
1602
+ 40
1603
+ ]
1604
+ ],
1605
+ "hand_sizes": [
1606
+ 10,
1607
+ 10
1608
+ ],
1609
+ "discard_pile": [
1610
+ 18,
1611
+ 22,
1612
+ 19,
1613
+ 50,
1614
+ 21,
1615
+ 37,
1616
+ 47,
1617
+ 16
1618
+ ],
1619
+ "discard_size": 8,
1620
+ "stock_remaining": 24,
1621
+ "deadwood": [
1622
+ 6,
1623
+ 38
1624
+ ],
1625
+ "model_logits": null,
1626
+ "model_value": null,
1627
+ "model_checkpoint": "r42_1500M.pkl",
1628
+ "model_architecture": "simba_aux"
1629
+ },
1630
+ {
1631
+ "turn": 26,
1632
+ "player": -1,
1633
+ "phase": "game_over",
1634
+ "action": -1,
1635
+ "action_desc": "Game over.",
1636
+ "card_drawn": null,
1637
+ "hands": [
1638
+ [
1639
+ 0,
1640
+ 49,
1641
+ 13,
1642
+ 36,
1643
+ 51,
1644
+ 25,
1645
+ 12,
1646
+ 10,
1647
+ 29,
1648
+ 38
1649
+ ],
1650
+ [
1651
+ 20,
1652
+ 2,
1653
+ 17,
1654
+ 33,
1655
+ 32,
1656
+ 45,
1657
+ 28,
1658
+ 23,
1659
+ 31,
1660
+ 40
1661
+ ]
1662
+ ],
1663
+ "hand_sizes": [
1664
+ 10,
1665
+ 10
1666
+ ],
1667
+ "discard_pile": [
1668
+ 18,
1669
+ 22,
1670
+ 19,
1671
+ 50,
1672
+ 21,
1673
+ 37,
1674
+ 47,
1675
+ 16
1676
+ ],
1677
+ "discard_size": 8,
1678
+ "stock_remaining": 24,
1679
+ "deadwood": [
1680
+ 6,
1681
+ 38
1682
+ ],
1683
+ "model_logits": null,
1684
+ "model_value": null,
1685
+ "model_checkpoint": "r42_1500M.pkl",
1686
+ "model_architecture": "simba_aux"
1687
+ }
1688
+ ]
1689
+ }