Arijit-07 commited on
Commit
d765530
·
verified ·
1 Parent(s): 4954ce0

Upload training_log.json with huggingface_hub

Browse files
Files changed (1) hide show
  1. training_log.json +962 -0
training_log.json ADDED
@@ -0,0 +1,962 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "episode": 1,
4
+ "task_id": "easy",
5
+ "score": 0.15,
6
+ "rolling_avg": 0.15,
7
+ "loss": -0.010825451463460922,
8
+ "elapsed_min": 0.7
9
+ },
10
+ {
11
+ "episode": 2,
12
+ "task_id": "easy",
13
+ "score": 0.44999999999999996,
14
+ "rolling_avg": 0.3,
15
+ "loss": -0.3882697783410549,
16
+ "elapsed_min": 1.2
17
+ },
18
+ {
19
+ "episode": 3,
20
+ "task_id": "easy",
21
+ "score": 0.15,
22
+ "rolling_avg": 0.25,
23
+ "loss": -0.006317885592579842,
24
+ "elapsed_min": 1.7
25
+ },
26
+ {
27
+ "episode": 4,
28
+ "task_id": "easy",
29
+ "score": 0.15,
30
+ "rolling_avg": 0.22499999999999998,
31
+ "loss": -0.002757230463127295,
32
+ "elapsed_min": 2.2
33
+ },
34
+ {
35
+ "episode": 5,
36
+ "task_id": "easy",
37
+ "score": 0.15,
38
+ "rolling_avg": 0.20999999999999996,
39
+ "loss": -0.0009385837086786827,
40
+ "elapsed_min": 2.7
41
+ },
42
+ {
43
+ "episode": 6,
44
+ "task_id": "easy",
45
+ "score": 0.15,
46
+ "rolling_avg": 0.19999999999999998,
47
+ "loss": -0.000268050159017245,
48
+ "elapsed_min": 3.1
49
+ },
50
+ {
51
+ "episode": 7,
52
+ "task_id": "easy",
53
+ "score": 0.15,
54
+ "rolling_avg": 0.19285714285714284,
55
+ "loss": -0.00014986475192320844,
56
+ "elapsed_min": 3.6
57
+ },
58
+ {
59
+ "episode": 8,
60
+ "task_id": "easy",
61
+ "score": 0.15,
62
+ "rolling_avg": 0.1875,
63
+ "loss": -0.00012323262247567376,
64
+ "elapsed_min": 4.1
65
+ },
66
+ {
67
+ "episode": 9,
68
+ "task_id": "easy",
69
+ "score": 0.15,
70
+ "rolling_avg": 0.18333333333333332,
71
+ "loss": -0.0001001283914471666,
72
+ "elapsed_min": 4.5
73
+ },
74
+ {
75
+ "episode": 10,
76
+ "task_id": "easy",
77
+ "score": 0.15,
78
+ "rolling_avg": 0.18,
79
+ "loss": -7.673109939787537e-05,
80
+ "elapsed_min": 5.0
81
+ },
82
+ {
83
+ "episode": 11,
84
+ "task_id": "easy",
85
+ "score": 0.15,
86
+ "rolling_avg": 0.18,
87
+ "loss": -7.291024667210877e-05,
88
+ "elapsed_min": 5.5
89
+ },
90
+ {
91
+ "episode": 12,
92
+ "task_id": "easy",
93
+ "score": 0.15,
94
+ "rolling_avg": 0.15,
95
+ "loss": -6.4735664636828e-05,
96
+ "elapsed_min": 5.9
97
+ },
98
+ {
99
+ "episode": 13,
100
+ "task_id": "easy",
101
+ "score": 0.15,
102
+ "rolling_avg": 0.15,
103
+ "loss": -5.6709604299006365e-05,
104
+ "elapsed_min": 6.4
105
+ },
106
+ {
107
+ "episode": 14,
108
+ "task_id": "easy",
109
+ "score": 0.15,
110
+ "rolling_avg": 0.15,
111
+ "loss": -6.014305108692497e-05,
112
+ "elapsed_min": 6.9
113
+ },
114
+ {
115
+ "episode": 15,
116
+ "task_id": "easy",
117
+ "score": 0.15,
118
+ "rolling_avg": 0.15,
119
+ "loss": -4.97970807676514e-05,
120
+ "elapsed_min": 7.3
121
+ },
122
+ {
123
+ "episode": 16,
124
+ "task_id": "easy",
125
+ "score": 0.15,
126
+ "rolling_avg": 0.15,
127
+ "loss": -4.758102780518433e-05,
128
+ "elapsed_min": 7.8
129
+ },
130
+ {
131
+ "episode": 17,
132
+ "task_id": "easy",
133
+ "score": 0.15,
134
+ "rolling_avg": 0.15,
135
+ "loss": -5.123086157254875e-05,
136
+ "elapsed_min": 8.3
137
+ },
138
+ {
139
+ "episode": 18,
140
+ "task_id": "easy",
141
+ "score": 0.15,
142
+ "rolling_avg": 0.15,
143
+ "loss": -4.520122214065244e-05,
144
+ "elapsed_min": 8.7
145
+ },
146
+ {
147
+ "episode": 19,
148
+ "task_id": "easy",
149
+ "score": 0.15,
150
+ "rolling_avg": 0.15,
151
+ "loss": -4.29553814077129e-05,
152
+ "elapsed_min": 9.2
153
+ },
154
+ {
155
+ "episode": 20,
156
+ "task_id": "easy",
157
+ "score": 0.15,
158
+ "rolling_avg": 0.15,
159
+ "loss": -4.222163018615296e-05,
160
+ "elapsed_min": 9.7
161
+ },
162
+ {
163
+ "episode": 21,
164
+ "task_id": "easy",
165
+ "score": 0.15,
166
+ "rolling_avg": 0.15,
167
+ "loss": -4.065355945688983e-05,
168
+ "elapsed_min": 10.1
169
+ },
170
+ {
171
+ "episode": 22,
172
+ "task_id": "easy",
173
+ "score": 0.15,
174
+ "rolling_avg": 0.15,
175
+ "loss": -4.031465505249798e-05,
176
+ "elapsed_min": 10.6
177
+ },
178
+ {
179
+ "episode": 23,
180
+ "task_id": "easy",
181
+ "score": 0.15,
182
+ "rolling_avg": 0.15,
183
+ "loss": -3.8977515941951424e-05,
184
+ "elapsed_min": 11.1
185
+ },
186
+ {
187
+ "episode": 24,
188
+ "task_id": "easy",
189
+ "score": 0.15,
190
+ "rolling_avg": 0.15,
191
+ "loss": -3.953995474148542e-05,
192
+ "elapsed_min": 11.5
193
+ },
194
+ {
195
+ "episode": 25,
196
+ "task_id": "easy",
197
+ "score": 0.15,
198
+ "rolling_avg": 0.15,
199
+ "loss": -4.222510809389254e-05,
200
+ "elapsed_min": 12.0
201
+ },
202
+ {
203
+ "episode": 26,
204
+ "task_id": "easy",
205
+ "score": 0.15,
206
+ "rolling_avg": 0.15,
207
+ "loss": -3.5085269094755254e-05,
208
+ "elapsed_min": 12.5
209
+ },
210
+ {
211
+ "episode": 27,
212
+ "task_id": "easy",
213
+ "score": 0.15,
214
+ "rolling_avg": 0.15,
215
+ "loss": -4.1230644759101175e-05,
216
+ "elapsed_min": 13.0
217
+ },
218
+ {
219
+ "episode": 28,
220
+ "task_id": "easy",
221
+ "score": 0.15,
222
+ "rolling_avg": 0.15,
223
+ "loss": -3.651188065608343e-05,
224
+ "elapsed_min": 13.4
225
+ },
226
+ {
227
+ "episode": 29,
228
+ "task_id": "easy",
229
+ "score": 0.15,
230
+ "rolling_avg": 0.15,
231
+ "loss": -3.5379496694076806e-05,
232
+ "elapsed_min": 13.9
233
+ },
234
+ {
235
+ "episode": 30,
236
+ "task_id": "easy",
237
+ "score": 0.15,
238
+ "rolling_avg": 0.15,
239
+ "loss": -3.4791060897987336e-05,
240
+ "elapsed_min": 14.4
241
+ },
242
+ {
243
+ "episode": 31,
244
+ "task_id": "easy",
245
+ "score": 0.15,
246
+ "rolling_avg": 0.15,
247
+ "loss": -3.258228146781524e-05,
248
+ "elapsed_min": 14.8
249
+ },
250
+ {
251
+ "episode": 32,
252
+ "task_id": "easy",
253
+ "score": 0.15,
254
+ "rolling_avg": 0.15,
255
+ "loss": -3.906668037719404e-05,
256
+ "elapsed_min": 15.3
257
+ },
258
+ {
259
+ "episode": 33,
260
+ "task_id": "easy",
261
+ "score": 0.15,
262
+ "rolling_avg": 0.15,
263
+ "loss": -3.1870833481661975e-05,
264
+ "elapsed_min": 15.8
265
+ },
266
+ {
267
+ "episode": 34,
268
+ "task_id": "easy",
269
+ "score": 0.15,
270
+ "rolling_avg": 0.15,
271
+ "loss": -3.381890564924106e-05,
272
+ "elapsed_min": 16.2
273
+ },
274
+ {
275
+ "episode": 35,
276
+ "task_id": "easy",
277
+ "score": 0.15,
278
+ "rolling_avg": 0.15,
279
+ "loss": -3.911871317541227e-05,
280
+ "elapsed_min": 16.7
281
+ },
282
+ {
283
+ "episode": 36,
284
+ "task_id": "easy",
285
+ "score": 0.15,
286
+ "rolling_avg": 0.15,
287
+ "loss": -3.1144530415379755e-05,
288
+ "elapsed_min": 17.2
289
+ },
290
+ {
291
+ "episode": 37,
292
+ "task_id": "easy",
293
+ "score": 0.15,
294
+ "rolling_avg": 0.15,
295
+ "loss": -3.791576212582489e-05,
296
+ "elapsed_min": 17.6
297
+ },
298
+ {
299
+ "episode": 38,
300
+ "task_id": "easy",
301
+ "score": 0.15,
302
+ "rolling_avg": 0.15,
303
+ "loss": -3.972935276882102e-05,
304
+ "elapsed_min": 18.1
305
+ },
306
+ {
307
+ "episode": 39,
308
+ "task_id": "easy",
309
+ "score": 0.15,
310
+ "rolling_avg": 0.15,
311
+ "loss": -3.282070732287442e-05,
312
+ "elapsed_min": 18.6
313
+ },
314
+ {
315
+ "episode": 40,
316
+ "task_id": "easy",
317
+ "score": 0.15,
318
+ "rolling_avg": 0.15,
319
+ "loss": -3.720065190767249e-05,
320
+ "elapsed_min": 19.0
321
+ },
322
+ {
323
+ "episode": 41,
324
+ "task_id": "easy",
325
+ "score": 0.15,
326
+ "rolling_avg": 0.15,
327
+ "loss": -3.780400341687103e-05,
328
+ "elapsed_min": 19.5
329
+ },
330
+ {
331
+ "episode": 42,
332
+ "task_id": "easy",
333
+ "score": 0.15,
334
+ "rolling_avg": 0.15,
335
+ "loss": -3.088007360929623e-05,
336
+ "elapsed_min": 20.0
337
+ },
338
+ {
339
+ "episode": 43,
340
+ "task_id": "easy",
341
+ "score": 0.15,
342
+ "rolling_avg": 0.15,
343
+ "loss": -3.066030330955982e-05,
344
+ "elapsed_min": 20.4
345
+ },
346
+ {
347
+ "episode": 44,
348
+ "task_id": "easy",
349
+ "score": 0.15,
350
+ "rolling_avg": 0.15,
351
+ "loss": -3.078695347843071e-05,
352
+ "elapsed_min": 20.9
353
+ },
354
+ {
355
+ "episode": 45,
356
+ "task_id": "easy",
357
+ "score": 0.15,
358
+ "rolling_avg": 0.15,
359
+ "loss": -3.0798129349326096e-05,
360
+ "elapsed_min": 21.4
361
+ },
362
+ {
363
+ "episode": 46,
364
+ "task_id": "easy",
365
+ "score": 0.15,
366
+ "rolling_avg": 0.15,
367
+ "loss": -3.2694064429961145e-05,
368
+ "elapsed_min": 21.8
369
+ },
370
+ {
371
+ "episode": 47,
372
+ "task_id": "easy",
373
+ "score": 0.15,
374
+ "rolling_avg": 0.15,
375
+ "loss": -3.8038631222055606e-05,
376
+ "elapsed_min": 22.3
377
+ },
378
+ {
379
+ "episode": 48,
380
+ "task_id": "easy",
381
+ "score": 0.15,
382
+ "rolling_avg": 0.15,
383
+ "loss": -3.114829208546629e-05,
384
+ "elapsed_min": 22.8
385
+ },
386
+ {
387
+ "episode": 49,
388
+ "task_id": "easy",
389
+ "score": 0.15,
390
+ "rolling_avg": 0.15,
391
+ "loss": -3.783006832236424e-05,
392
+ "elapsed_min": 23.2
393
+ },
394
+ {
395
+ "episode": 50,
396
+ "task_id": "easy",
397
+ "score": 0.15,
398
+ "rolling_avg": 0.15,
399
+ "loss": -3.803863849801322e-05,
400
+ "elapsed_min": 23.7
401
+ },
402
+ {
403
+ "episode": 51,
404
+ "task_id": "easy",
405
+ "score": 0.15,
406
+ "rolling_avg": 0.15,
407
+ "loss": -3.744272301749637e-05,
408
+ "elapsed_min": 24.2
409
+ },
410
+ {
411
+ "episode": 52,
412
+ "task_id": "easy",
413
+ "score": 0.15,
414
+ "rolling_avg": 0.15,
415
+ "loss": -3.096200816798955e-05,
416
+ "elapsed_min": 24.7
417
+ },
418
+ {
419
+ "episode": 53,
420
+ "task_id": "easy",
421
+ "score": 0.15,
422
+ "rolling_avg": 0.15,
423
+ "loss": -3.201609069947153e-05,
424
+ "elapsed_min": 25.1
425
+ },
426
+ {
427
+ "episode": 54,
428
+ "task_id": "easy",
429
+ "score": 0.15,
430
+ "rolling_avg": 0.15,
431
+ "loss": -3.262702375650406e-05,
432
+ "elapsed_min": 25.6
433
+ },
434
+ {
435
+ "episode": 55,
436
+ "task_id": "easy",
437
+ "score": 0.15,
438
+ "rolling_avg": 0.15,
439
+ "loss": -3.2898945695099734e-05,
440
+ "elapsed_min": 26.1
441
+ },
442
+ {
443
+ "episode": 56,
444
+ "task_id": "easy",
445
+ "score": 0.15,
446
+ "rolling_avg": 0.15,
447
+ "loss": -3.9476086385548115e-05,
448
+ "elapsed_min": 26.5
449
+ },
450
+ {
451
+ "episode": 57,
452
+ "task_id": "easy",
453
+ "score": 0.15,
454
+ "rolling_avg": 0.15,
455
+ "loss": -3.774813861430933e-05,
456
+ "elapsed_min": 27.0
457
+ },
458
+ {
459
+ "episode": 58,
460
+ "task_id": "easy",
461
+ "score": 0.15,
462
+ "rolling_avg": 0.15,
463
+ "loss": -3.258597765428325e-05,
464
+ "elapsed_min": 27.5
465
+ },
466
+ {
467
+ "episode": 59,
468
+ "task_id": "easy",
469
+ "score": 0.15,
470
+ "rolling_avg": 0.15,
471
+ "loss": -3.207565168850124e-05,
472
+ "elapsed_min": 27.9
473
+ },
474
+ {
475
+ "episode": 60,
476
+ "task_id": "easy",
477
+ "score": 0.15,
478
+ "rolling_avg": 0.15,
479
+ "loss": -3.259719718092432e-05,
480
+ "elapsed_min": 28.4
481
+ },
482
+ {
483
+ "episode": 61,
484
+ "task_id": "medium",
485
+ "score": 0.1,
486
+ "rolling_avg": 0.1,
487
+ "loss": -9.103306365432218e-05,
488
+ "elapsed_min": 29.1
489
+ },
490
+ {
491
+ "episode": 62,
492
+ "task_id": "medium",
493
+ "score": 0.1,
494
+ "rolling_avg": 0.1,
495
+ "loss": -0.00010130244481842965,
496
+ "elapsed_min": 29.8
497
+ },
498
+ {
499
+ "episode": 63,
500
+ "task_id": "medium",
501
+ "score": 0.1,
502
+ "rolling_avg": 0.10000000000000002,
503
+ "loss": -0.0002113376249326393,
504
+ "elapsed_min": 30.5
505
+ },
506
+ {
507
+ "episode": 64,
508
+ "task_id": "medium",
509
+ "score": 0.1,
510
+ "rolling_avg": 0.1,
511
+ "loss": -8.943062130128965e-05,
512
+ "elapsed_min": 31.2
513
+ },
514
+ {
515
+ "episode": 65,
516
+ "task_id": "medium",
517
+ "score": 0.1,
518
+ "rolling_avg": 0.1,
519
+ "loss": -0.00020467072317842394,
520
+ "elapsed_min": 31.9
521
+ },
522
+ {
523
+ "episode": 66,
524
+ "task_id": "medium",
525
+ "score": 0.1,
526
+ "rolling_avg": 0.10000000000000002,
527
+ "loss": -0.00022407736105378717,
528
+ "elapsed_min": 32.6
529
+ },
530
+ {
531
+ "episode": 67,
532
+ "task_id": "medium",
533
+ "score": 0.1,
534
+ "rolling_avg": 0.1,
535
+ "loss": -0.000216203392483294,
536
+ "elapsed_min": 33.3
537
+ },
538
+ {
539
+ "episode": 68,
540
+ "task_id": "medium",
541
+ "score": 0.1,
542
+ "rolling_avg": 0.1,
543
+ "loss": -9.213412704411894e-05,
544
+ "elapsed_min": 34.0
545
+ },
546
+ {
547
+ "episode": 69,
548
+ "task_id": "medium",
549
+ "score": 0.1,
550
+ "rolling_avg": 0.1,
551
+ "loss": -0.00020417060295585543,
552
+ "elapsed_min": 34.7
553
+ },
554
+ {
555
+ "episode": 70,
556
+ "task_id": "medium",
557
+ "score": 0.1,
558
+ "rolling_avg": 0.1,
559
+ "loss": -8.2222672062926e-05,
560
+ "elapsed_min": 35.4
561
+ },
562
+ {
563
+ "episode": 71,
564
+ "task_id": "medium",
565
+ "score": 0.1,
566
+ "rolling_avg": 0.1,
567
+ "loss": -9.799903637031093e-05,
568
+ "elapsed_min": 36.1
569
+ },
570
+ {
571
+ "episode": 72,
572
+ "task_id": "medium",
573
+ "score": 0.1,
574
+ "rolling_avg": 0.1,
575
+ "loss": -8.412641182076186e-05,
576
+ "elapsed_min": 36.8
577
+ },
578
+ {
579
+ "episode": 73,
580
+ "task_id": "medium",
581
+ "score": 0.1,
582
+ "rolling_avg": 0.1,
583
+ "loss": -9.218390187015757e-05,
584
+ "elapsed_min": 37.5
585
+ },
586
+ {
587
+ "episode": 74,
588
+ "task_id": "medium",
589
+ "score": 0.1,
590
+ "rolling_avg": 0.1,
591
+ "loss": -8.848460856825113e-05,
592
+ "elapsed_min": 38.2
593
+ },
594
+ {
595
+ "episode": 75,
596
+ "task_id": "medium",
597
+ "score": 0.1,
598
+ "rolling_avg": 0.1,
599
+ "loss": -9.821655112318695e-05,
600
+ "elapsed_min": 38.9
601
+ },
602
+ {
603
+ "episode": 76,
604
+ "task_id": "medium",
605
+ "score": 0.1,
606
+ "rolling_avg": 0.1,
607
+ "loss": -9.733268234413117e-05,
608
+ "elapsed_min": 39.6
609
+ },
610
+ {
611
+ "episode": 77,
612
+ "task_id": "medium",
613
+ "score": 0.1,
614
+ "rolling_avg": 0.1,
615
+ "loss": -0.0002242178888991475,
616
+ "elapsed_min": 40.3
617
+ },
618
+ {
619
+ "episode": 78,
620
+ "task_id": "medium",
621
+ "score": 0.1,
622
+ "rolling_avg": 0.1,
623
+ "loss": -9.307556319981813e-05,
624
+ "elapsed_min": 41.0
625
+ },
626
+ {
627
+ "episode": 79,
628
+ "task_id": "medium",
629
+ "score": 0.1,
630
+ "rolling_avg": 0.1,
631
+ "loss": -0.00022402922331821173,
632
+ "elapsed_min": 41.7
633
+ },
634
+ {
635
+ "episode": 80,
636
+ "task_id": "medium",
637
+ "score": 0.1,
638
+ "rolling_avg": 0.1,
639
+ "loss": -0.0002193757245549932,
640
+ "elapsed_min": 42.4
641
+ },
642
+ {
643
+ "episode": 81,
644
+ "task_id": "medium",
645
+ "score": 0.1,
646
+ "rolling_avg": 0.1,
647
+ "loss": -9.736053470987827e-05,
648
+ "elapsed_min": 43.1
649
+ },
650
+ {
651
+ "episode": 82,
652
+ "task_id": "medium",
653
+ "score": 0.1,
654
+ "rolling_avg": 0.1,
655
+ "loss": -9.2687776486855e-05,
656
+ "elapsed_min": 43.8
657
+ },
658
+ {
659
+ "episode": 83,
660
+ "task_id": "medium",
661
+ "score": 0.1,
662
+ "rolling_avg": 0.1,
663
+ "loss": -9.923032484948635e-05,
664
+ "elapsed_min": 44.5
665
+ },
666
+ {
667
+ "episode": 84,
668
+ "task_id": "medium",
669
+ "score": 0.1,
670
+ "rolling_avg": 0.1,
671
+ "loss": -9.229838178725913e-05,
672
+ "elapsed_min": 45.1
673
+ },
674
+ {
675
+ "episode": 85,
676
+ "task_id": "medium",
677
+ "score": 0.1,
678
+ "rolling_avg": 0.1,
679
+ "loss": -0.00020920761744491756,
680
+ "elapsed_min": 45.8
681
+ },
682
+ {
683
+ "episode": 86,
684
+ "task_id": "medium",
685
+ "score": 0.1,
686
+ "rolling_avg": 0.1,
687
+ "loss": -9.279428923036903e-05,
688
+ "elapsed_min": 46.5
689
+ },
690
+ {
691
+ "episode": 87,
692
+ "task_id": "medium",
693
+ "score": 0.1,
694
+ "rolling_avg": 0.1,
695
+ "loss": -9.046419290825725e-05,
696
+ "elapsed_min": 47.2
697
+ },
698
+ {
699
+ "episode": 88,
700
+ "task_id": "medium",
701
+ "score": 0.1,
702
+ "rolling_avg": 0.1,
703
+ "loss": -9.74025679170154e-05,
704
+ "elapsed_min": 47.9
705
+ },
706
+ {
707
+ "episode": 89,
708
+ "task_id": "medium",
709
+ "score": 0.1,
710
+ "rolling_avg": 0.1,
711
+ "loss": -0.00023454223992303014,
712
+ "elapsed_min": 48.6
713
+ },
714
+ {
715
+ "episode": 90,
716
+ "task_id": "medium",
717
+ "score": 0.1,
718
+ "rolling_avg": 0.1,
719
+ "loss": -0.0002482115232851356,
720
+ "elapsed_min": 49.3
721
+ },
722
+ {
723
+ "episode": 91,
724
+ "task_id": "medium",
725
+ "score": 0.1,
726
+ "rolling_avg": 0.1,
727
+ "loss": -9.371624037157744e-05,
728
+ "elapsed_min": 50.0
729
+ },
730
+ {
731
+ "episode": 92,
732
+ "task_id": "medium",
733
+ "score": 0.1,
734
+ "rolling_avg": 0.1,
735
+ "loss": -8.326796523761004e-05,
736
+ "elapsed_min": 50.7
737
+ },
738
+ {
739
+ "episode": 93,
740
+ "task_id": "medium",
741
+ "score": 0.1,
742
+ "rolling_avg": 0.1,
743
+ "loss": -8.937737584346905e-05,
744
+ "elapsed_min": 51.4
745
+ },
746
+ {
747
+ "episode": 94,
748
+ "task_id": "medium",
749
+ "score": 0.1,
750
+ "rolling_avg": 0.1,
751
+ "loss": -9.497594146523625e-05,
752
+ "elapsed_min": 52.1
753
+ },
754
+ {
755
+ "episode": 95,
756
+ "task_id": "medium",
757
+ "score": 0.1,
758
+ "rolling_avg": 0.1,
759
+ "loss": -9.383925498696044e-05,
760
+ "elapsed_min": 52.8
761
+ },
762
+ {
763
+ "episode": 96,
764
+ "task_id": "medium",
765
+ "score": 0.1,
766
+ "rolling_avg": 0.1,
767
+ "loss": -0.00020839256467297673,
768
+ "elapsed_min": 53.5
769
+ },
770
+ {
771
+ "episode": 97,
772
+ "task_id": "medium",
773
+ "score": 0.1,
774
+ "rolling_avg": 0.1,
775
+ "loss": -9.501779277343303e-05,
776
+ "elapsed_min": 54.2
777
+ },
778
+ {
779
+ "episode": 98,
780
+ "task_id": "medium",
781
+ "score": 0.1,
782
+ "rolling_avg": 0.1,
783
+ "loss": -9.319297532783821e-05,
784
+ "elapsed_min": 54.9
785
+ },
786
+ {
787
+ "episode": 99,
788
+ "task_id": "medium",
789
+ "score": 0.1,
790
+ "rolling_avg": 0.1,
791
+ "loss": -0.0002205862256232649,
792
+ "elapsed_min": 55.6
793
+ },
794
+ {
795
+ "episode": 100,
796
+ "task_id": "medium",
797
+ "score": 0.1,
798
+ "rolling_avg": 0.1,
799
+ "loss": -9.067376959137619e-05,
800
+ "elapsed_min": 56.3
801
+ },
802
+ {
803
+ "episode": 101,
804
+ "task_id": "medium",
805
+ "score": 0.1,
806
+ "rolling_avg": 0.1,
807
+ "loss": -0.00023439245705958456,
808
+ "elapsed_min": 57.0
809
+ },
810
+ {
811
+ "episode": 102,
812
+ "task_id": "medium",
813
+ "score": 0.1,
814
+ "rolling_avg": 0.1,
815
+ "loss": -9.057316492544487e-05,
816
+ "elapsed_min": 57.7
817
+ },
818
+ {
819
+ "episode": 103,
820
+ "task_id": "medium",
821
+ "score": 0.1,
822
+ "rolling_avg": 0.1,
823
+ "loss": -0.0002118111588060856,
824
+ "elapsed_min": 58.4
825
+ },
826
+ {
827
+ "episode": 104,
828
+ "task_id": "medium",
829
+ "score": 0.1,
830
+ "rolling_avg": 0.1,
831
+ "loss": -9.361907723359764e-05,
832
+ "elapsed_min": 59.1
833
+ },
834
+ {
835
+ "episode": 105,
836
+ "task_id": "medium",
837
+ "score": 0.1,
838
+ "rolling_avg": 0.1,
839
+ "loss": -8.771108696237206e-05,
840
+ "elapsed_min": 59.8
841
+ },
842
+ {
843
+ "episode": 106,
844
+ "task_id": "medium",
845
+ "score": 0.1,
846
+ "rolling_avg": 0.1,
847
+ "loss": -0.0002442508703097701,
848
+ "elapsed_min": 60.5
849
+ },
850
+ {
851
+ "episode": 107,
852
+ "task_id": "medium",
853
+ "score": 0.1,
854
+ "rolling_avg": 0.1,
855
+ "loss": -9.948814113158733e-05,
856
+ "elapsed_min": 61.2
857
+ },
858
+ {
859
+ "episode": 108,
860
+ "task_id": "medium",
861
+ "score": 0.1,
862
+ "rolling_avg": 0.1,
863
+ "loss": -0.00022064431686885655,
864
+ "elapsed_min": 61.9
865
+ },
866
+ {
867
+ "episode": 109,
868
+ "task_id": "medium",
869
+ "score": 0.1,
870
+ "rolling_avg": 0.1,
871
+ "loss": -8.638978761155158e-05,
872
+ "elapsed_min": 62.6
873
+ },
874
+ {
875
+ "episode": 110,
876
+ "task_id": "medium",
877
+ "score": 0.1,
878
+ "rolling_avg": 0.1,
879
+ "loss": -0.0002063308347715065,
880
+ "elapsed_min": 63.3
881
+ },
882
+ {
883
+ "episode": 111,
884
+ "task_id": "medium",
885
+ "score": 0.1,
886
+ "rolling_avg": 0.1,
887
+ "loss": -8.071197953540832e-05,
888
+ "elapsed_min": 63.9
889
+ },
890
+ {
891
+ "episode": 112,
892
+ "task_id": "medium",
893
+ "score": 0.1,
894
+ "rolling_avg": 0.1,
895
+ "loss": -9.405435412190855e-05,
896
+ "elapsed_min": 64.6
897
+ },
898
+ {
899
+ "episode": 113,
900
+ "task_id": "medium",
901
+ "score": 0.1,
902
+ "rolling_avg": 0.1,
903
+ "loss": -9.206426329910755e-05,
904
+ "elapsed_min": 65.3
905
+ },
906
+ {
907
+ "episode": 114,
908
+ "task_id": "medium",
909
+ "score": 0.1,
910
+ "rolling_avg": 0.1,
911
+ "loss": -9.239844075636938e-05,
912
+ "elapsed_min": 66.0
913
+ },
914
+ {
915
+ "episode": 115,
916
+ "task_id": "medium",
917
+ "score": 0.1,
918
+ "rolling_avg": 0.1,
919
+ "loss": -8.934973448049277e-05,
920
+ "elapsed_min": 66.7
921
+ },
922
+ {
923
+ "episode": 116,
924
+ "task_id": "medium",
925
+ "score": 0.1,
926
+ "rolling_avg": 0.1,
927
+ "loss": -9.305024286732078e-05,
928
+ "elapsed_min": 67.4
929
+ },
930
+ {
931
+ "episode": 117,
932
+ "task_id": "medium",
933
+ "score": 0.1,
934
+ "rolling_avg": 0.1,
935
+ "loss": -8.554262603865936e-05,
936
+ "elapsed_min": 68.1
937
+ },
938
+ {
939
+ "episode": 118,
940
+ "task_id": "medium",
941
+ "score": 0.1,
942
+ "rolling_avg": 0.1,
943
+ "loss": -0.00021213541913311929,
944
+ "elapsed_min": 68.8
945
+ },
946
+ {
947
+ "episode": 119,
948
+ "task_id": "medium",
949
+ "score": 0.1,
950
+ "rolling_avg": 0.1,
951
+ "loss": -0.00019552590674720705,
952
+ "elapsed_min": 69.5
953
+ },
954
+ {
955
+ "episode": 120,
956
+ "task_id": "medium",
957
+ "score": 0.1,
958
+ "rolling_avg": 0.1,
959
+ "loss": -0.00023136528034228832,
960
+ "elapsed_min": 70.2
961
+ }
962
+ ]