DBD-research-group
/

AudioProtoPNet-1-BirdSet-XCL

@@ -59,30 +59,29 @@ ornithology and machine learning.
 Mean performance of AudioProtoPNet models with one, five, ten, and twenty prototypes per class for the
 validation dataset POW and the seven test datasets, averaged over five different random seeds. The 'Score'
 column represents the average of the respective metric across all test datasets. Best values for each metric are
-**bolded**, and second-best values are *underlined*. While models with five, ten, and twenty prototypes performed
 similarly, the model with only one prototype per class showed slightly lower performance.
-| Model             | Metric  | POW    | PER    | NES    | UHH    | HSN    | NBP    | SSW    | SNE    | Score  |
-| :---------------- | :------ | :----- | :----- | :----- | :----- | :----- | :----- | :----- | :----- | :----- |
-| AudioProtoPNet-1  | cmAP    | 0.49   | 0.30   | 0.36   | 0.28   | **0.50** | **0.66** | 0.40   | 0.32   | 0.40   |
-|                   | AUROC   | 0.88   | 0.79   | 0.92   | 0.85   | 0.91   | 0.92   | **0.96** | 0.84   | 0.88   |
-|                   | T1-Acc  | **0.87** | 0.59   | 0.49   | 0.42   | 0.64   | 0.71   | 0.64   | 0.70   | 0.60   |
-| AudioProtoPNet-5  | cmAP    | **0.50** | 0.30   | 0.38   | 0.31   | 0.54   | **0.68** | 0.42   | 0.33   | 0.42   |
-|                   | AUROC   | 0.88   | 0.79   | 0.93   | 0.87   | 0.92   | 0.93   | **0.97** | 0.88   | **0.90** |
-|                   | T1-Acc  | 0.84   | **0.59** | 0.52   | **0.49** | 0.65   | **0.71** | 0.66   | **0.74** | 0.62   |
-| AudioProtoPNet-10 | cmAP    | **0.50** | **0.30** | **0.38** | **0.30** | **0.54** | **0.68** | **0.42** | **0.34** | **0.42** |
-|                   | AUROC   | 0.88   | **0.80** | **0.94** | 0.86   | **0.92** | **0.93** | **0.97** | 0.86   | **0.90** |
-|                   | T1-Acc  | 0.85   | **0.59** | **0.52** | 0.47   | **0.64** | **0.72** | **0.67** | **0.74** | **0.62** |
-| AudioProtoPNet-20 | cmAP    | **0.50** | **0.30** | **0.38** | **0.31** | **0.54** | **0.68** | **0.43** | **0.33** | **0.42** |
-|                   | AUROC   | **0.89** | **0.80** | **0.94** | **0.86** | **0.92** | **0.93** | **0.97** | **0.87** | **0.90** |
-|                   | T1-Acc  | **0.87** | **0.60** | **0.52** | 0.42   | **0.65** | **0.72** | **0.68** | **0.75** | **0.62** |
 **Table 2: Comparative Performance of AudioProtoPNet, ConvNeXt, and Perch**
 Mean performance of AudioProtoPNet-5, ConvNeXt, and Perch for the validation dataset POW and the seven
 test datasets, averaged over five different random seeds. The 'Score' column represents the average of the
-respective metric across all test datasets. Best values for each metric are **bolded**, and second-best values are
-*underlined*. AudioProtoPNet-5 notably outperformed both Perch and ConvNeXt in terms of cmAP, AUROC,
 and top-1 accuracy scores.
 | Model             | Metric  | POW    | PER    | NES    | UHH    | HSN    | NBP    | SSW    | SNE    | Score  |
@@ -97,6 +96,8 @@ and top-1 accuracy scores.
 |                   | AUROC   | 0.84   | 0.70   | 0.90   | 0.76   | 0.86   | 0.91   | 0.91   | 0.83   | 0.84   |
 |                   | T1-Acc  | 0.85   | 0.48   | **0.66** | **0.57** | 0.58   | 0.69   | 0.62   | 0.69   | 0.61   |
 ## Example
 This model can be easily loaded and used for inference with the `transformers` library.

 Mean performance of AudioProtoPNet models with one, five, ten, and twenty prototypes per class for the
 validation dataset POW and the seven test datasets, averaged over five different random seeds. The 'Score'
 column represents the average of the respective metric across all test datasets. Best values for each metric are
+**bolded**. While models with five, ten, and twenty prototypes performed
 similarly, the model with only one prototype per class showed slightly lower performance.
+|                      | Metric  |  POW  |  PER  |  NES  |  UHH  |  HSN  |  NBP  |  SSW  |  SNE  | Score |
+|----------------------|---------|-------|-------|-------|-------|-------|-------|-------|-------|-------|
+| AudioProtoPNet-1     | cmAP    | 0.49  | **0.30** | 0.36  | 0.28  | 0.50  | 0.66  | 0.40  | 0.32  | 0.40  |
+|                      | AUROC   | 0.88  | 0.79  | 0.92  | 0.85  | 0.91  | 0.92  | 0.96  | 0.84  | 0.88  |
+|                      | T1-Acc  | **0.87** | 0.59  | 0.49  | 0.42  | 0.64  | 0.71  | 0.64  | 0.70  | 0.60  |
+| AudioProtoPNet-5     | cmAP    | **0.50** | **0.30** | **0.38** | **0.31** | **0.54** | **0.68** | 0.42  | 0.33  | **0.42** |
+|                      | AUROC   | 0.88  | 0.79  | 0.93  | **0.87** | **0.92** | **0.93** | **0.97** | **0.88** | **0.90** |
+|                      | T1-Acc  | 0.84  | 0.59  | **0.52** | **0.49** | **0.65** | 0.71  | 0.66  | 0.74  | **0.62** |
+| AudioProtoPNet-10    | cmAP    | **0.50** | **0.30** | **0.38** | 0.30  | **0.54** | **0.68** | 0.42  | **0.34** | **0.42** |
+|                      | AUROC   | 0.88  | **0.80** | **0.94** | 0.86  | **0.92** | **0.93** | **0.97** | 0.86  | **0.90** |
+|                      | T1-Acc  | 0.85  | 0.59  | **0.52** | 0.47  | 0.64  | **0.72** | 0.67  | 0.74  | **0.62** |
+| AudioProtoPNet-20    | cmAP    | **0.50** | **0.30** | **0.38** | **0.31** | **0.54** | **0.68** | **0.43** | 0.33  | **0.42** |
+|                      | AUROC   | **0.89** | **0.80** | **0.94** | 0.86  | **0.92** | **0.93** | **0.97** | 0.87  | **0.90** |
+|                      | T1-Acc  | **0.87** | **0.60** | **0.52** | 0.42  | **0.65** | **0.72** | **0.68** | **0.75** | **0.62** |
 **Table 2: Comparative Performance of AudioProtoPNet, ConvNeXt, and Perch**
 Mean performance of AudioProtoPNet-5, ConvNeXt, and Perch for the validation dataset POW and the seven
 test datasets, averaged over five different random seeds. The 'Score' column represents the average of the
+respective metric across all test datasets. Best values for each metric are **bolded**. AudioProtoPNet-5 notably outperformed both Perch and ConvNeXt in terms of cmAP, AUROC,
 and top-1 accuracy scores.
 | Model             | Metric  | POW    | PER    | NES    | UHH    | HSN    | NBP    | SSW    | SNE    | Score  |
 |                   | AUROC   | 0.84   | 0.70   | 0.90   | 0.76   | 0.86   | 0.91   | 0.91   | 0.83   | 0.84   |
 |                   | T1-Acc  | 0.85   | 0.48   | **0.66** | **0.57** | 0.58   | 0.69   | 0.62   | 0.69   | 0.61   |
 ## Example
 This model can be easily loaded and used for inference with the `transformers` library.