File size: 1,776 Bytes
e65379c
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
{
  "family": "router",
  "split": "test",
  "overall": {
    "rows": 3000,
    "accuracy": 1.0,
    "macro_f1": 1.0,
    "weighted_f1": 1.0,
    "micro_f1": 1.0,
    "labels": [
      "low",
      "medium",
      "high"
    ],
    "confusion_matrix": [
      [
        1000,
        0,
        0
      ],
      [
        0,
        1000,
        0
      ],
      [
        0,
        0,
        1000
      ]
    ],
    "calibration_error": 0.0,
    "classification_report": {
      "confidence_bin::high": {
        "precision": 1.0,
        "recall": 1.0,
        "f1-score": 1.0,
        "support": 1000.0
      },
      "confidence_bin::low": {
        "precision": 1.0,
        "recall": 1.0,
        "f1-score": 1.0,
        "support": 1000.0
      },
      "confidence_bin::medium": {
        "precision": 1.0,
        "recall": 1.0,
        "f1-score": 1.0,
        "support": 1000.0
      },
      "accuracy": 1.0,
      "macro avg": {
        "precision": 1.0,
        "recall": 1.0,
        "f1-score": 1.0,
        "support": 3000.0
      },
      "weighted avg": {
        "precision": 1.0,
        "recall": 1.0,
        "f1-score": 1.0,
        "support": 3000.0
      }
    },
    "ordinal_mae": 0.0,
    "off_by_two_rate": 0.0
  },
  "per_task": {
    "confidence_bin": {
      "rows": 3000,
      "accuracy": 1.0,
      "macro_f1": 1.0,
      "weighted_f1": 1.0,
      "micro_f1": 1.0,
      "labels": [
        "high",
        "low",
        "medium"
      ],
      "confusion_matrix": [
        [
          1000,
          0,
          0
        ],
        [
          0,
          1000,
          0
        ],
        [
          0,
          0,
          1000
        ]
      ],
      "wrong_task_predictions": 0,
      "wrong_task_rate": 0.0
    }
  }
}