maximuspowers commited on
Commit
5fe2481
·
verified ·
1 Parent(s): f079423

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +19 -19
README.md CHANGED
@@ -51,11 +51,11 @@ The model predicts which of the following 14 patterns the subject model was trai
51
 
52
  ## Test Set Performance
53
 
54
- - **F1 Macro**: 0.1378
55
- - **F1 Micro**: 0.1508
56
- - **Hamming Accuracy**: 0.5746
57
- - **Exact Match Accuracy**: 0.0000
58
- - **BCE Loss**: 0.6393
59
 
60
  ### Per-Pattern Accuracy (Test Set)
61
 
@@ -63,20 +63,20 @@ When a model was trained on a pattern, what % of the time does the classifier de
63
 
64
  | Pattern | Recall (Detection Rate) |
65
  |---------|-------------------------|
66
- | palindrome | 57.1% |
67
- | sorted_ascending | 77.9% |
68
- | sorted_descending | 50.7% |
69
- | alternating | 71.0% |
70
- | contains_abc | 74.7% |
71
- | starts_with | 54.9% |
72
- | ends_with | 63.2% |
73
- | no_repeats | 44.4% |
74
- | has_majority | 22.6% |
75
- | increasing_pairs | 47.0% |
76
- | decreasing_pairs | 29.7% |
77
- | vowel_consonant | 11.8% |
78
- | first_last_match | 40.0% |
79
- | mountain_pattern | 30.8% |
80
 
81
  ## Usage
82
 
 
51
 
52
  ## Test Set Performance
53
 
54
+ - **F1 Macro**: 0.1601
55
+ - **F1 Micro**: 0.1771
56
+ - **Hamming Accuracy**: 0.8305
57
+ - **Exact Match Accuracy**: 0.0843
58
+ - **BCE Loss**: 0.5038
59
 
60
  ### Per-Pattern Accuracy (Test Set)
61
 
 
63
 
64
  | Pattern | Recall (Detection Rate) |
65
  |---------|-------------------------|
66
+ | palindrome | 25.4% |
67
+ | sorted_ascending | 24.4% |
68
+ | sorted_descending | 28.4% |
69
+ | alternating | 36.2% |
70
+ | contains_abc | 26.6% |
71
+ | starts_with | 13.7% |
72
+ | ends_with | 36.8% |
73
+ | no_repeats | 22.2% |
74
+ | has_majority | 6.5% |
75
+ | increasing_pairs | 22.7% |
76
+ | decreasing_pairs | 39.1% |
77
+ | vowel_consonant | 0.0% |
78
+ | first_last_match | 14.0% |
79
+ | mountain_pattern | 26.2% |
80
 
81
  ## Usage
82