Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -51,11 +51,11 @@ The model predicts which of the following 14 patterns the subject model was trai
|
|
| 51 |
|
| 52 |
## Test Set Performance
|
| 53 |
|
| 54 |
-
- **F1 Macro**: 0.
|
| 55 |
-
- **F1 Micro**: 0.
|
| 56 |
-
- **Hamming Accuracy**: 0.
|
| 57 |
-
- **Exact Match Accuracy**: 0.
|
| 58 |
-
- **BCE Loss**: 0.
|
| 59 |
|
| 60 |
### Per-Pattern Accuracy (Test Set)
|
| 61 |
|
|
@@ -63,20 +63,20 @@ When a model was trained on a pattern, what % of the time does the classifier de
|
|
| 63 |
|
| 64 |
| Pattern | Recall (Detection Rate) |
|
| 65 |
|---------|-------------------------|
|
| 66 |
-
| palindrome |
|
| 67 |
-
| sorted_ascending |
|
| 68 |
-
| sorted_descending |
|
| 69 |
-
| alternating |
|
| 70 |
-
| contains_abc |
|
| 71 |
-
| starts_with |
|
| 72 |
-
| ends_with |
|
| 73 |
-
| no_repeats |
|
| 74 |
-
| has_majority |
|
| 75 |
-
| increasing_pairs |
|
| 76 |
-
| decreasing_pairs |
|
| 77 |
-
| vowel_consonant |
|
| 78 |
-
| first_last_match |
|
| 79 |
-
| mountain_pattern |
|
| 80 |
|
| 81 |
## Usage
|
| 82 |
|
|
|
|
| 51 |
|
| 52 |
## Test Set Performance
|
| 53 |
|
| 54 |
+
- **F1 Macro**: 0.1601
|
| 55 |
+
- **F1 Micro**: 0.1771
|
| 56 |
+
- **Hamming Accuracy**: 0.8305
|
| 57 |
+
- **Exact Match Accuracy**: 0.0843
|
| 58 |
+
- **BCE Loss**: 0.5038
|
| 59 |
|
| 60 |
### Per-Pattern Accuracy (Test Set)
|
| 61 |
|
|
|
|
| 63 |
|
| 64 |
| Pattern | Recall (Detection Rate) |
|
| 65 |
|---------|-------------------------|
|
| 66 |
+
| palindrome | 25.4% |
|
| 67 |
+
| sorted_ascending | 24.4% |
|
| 68 |
+
| sorted_descending | 28.4% |
|
| 69 |
+
| alternating | 36.2% |
|
| 70 |
+
| contains_abc | 26.6% |
|
| 71 |
+
| starts_with | 13.7% |
|
| 72 |
+
| ends_with | 36.8% |
|
| 73 |
+
| no_repeats | 22.2% |
|
| 74 |
+
| has_majority | 6.5% |
|
| 75 |
+
| increasing_pairs | 22.7% |
|
| 76 |
+
| decreasing_pairs | 39.1% |
|
| 77 |
+
| vowel_consonant | 0.0% |
|
| 78 |
+
| first_last_match | 14.0% |
|
| 79 |
+
| mountain_pattern | 26.2% |
|
| 80 |
|
| 81 |
## Usage
|
| 82 |
|