tabularisai
/

ModernFinBERT

@@ -45,33 +45,35 @@ for i, sentence in enumerate(sentences, 1):
 ```
-## Benchmark Resuts
 | Dataset | Model | Accuracy | F1-Score | Precision | Recall | ROC-AUC |
 |---------|-------|----------|----------|-----------|--------|---------|
 | FIQA | ModernFinBERT | **0.80** | **0.61** | **0.64** | **0.88** | **0.96** |
-| FIQA | distilroberta_financial | _0.54_ | _0.47_ | _0.61_ | _0.71_ | 0.71 |
 | FIQA | finbert | 0.48 | 0.43 | 0.59 | 0.66 | 0.76 |
-| FIQA | roberta_sentiment | 0.36 | 0.35 | 0.60 | 0.58 | _0.89_ |
-| Twitter | ModernFinBERT | 0.71 | _0.70_ | **0.68** | **0.81** | **0.94** |
-| Twitter | distilroberta_financial | **0.75** | **0.71** | **0.68** | _0.75_ | _0.87_ |
-| Twitter | finbert | _0.73_ | _0.67_ | _0.65_ | _0.70_ | 0.86 |
 | Twitter | roberta_sentiment | 0.70 | 0.61 | 0.63 | 0.60 | 0.82 |
 | JeanBaptiste | ModernFinBERT | 0.74 | 0.58 | 0.71 | 0.56 | 0.84 |
-| JeanBaptiste | distilroberta_financial | **0.88** | **0.79** | **0.92** | **0.74** | _0.86_ |
-| JeanBaptiste | finbert | _0.77_ | _0.68_ | 0.70 | _0.67_ | **0.88** |
-| JeanBaptiste | roberta_sentiment | 0.70 | 0.55 | _0.79_ | 0.51 | 0.83 |
 ## Model Averages Across All Datasets
 | Model | Accuracy | F1-Score | Precision | Recall | ROC-AUC |
 |-------|----------|----------|-----------|--------|---------|
-| **ModernFinBERT** | **0.75** | _0.63_ | _0.68_ | **0.75** | **0.91** |
-| distilroberta_financial | _0.73_ | **0.66** | **0.73** | _0.73_ | 0.82 |
-| finbert | 0.66 | 0.59 | 0.65 | 0.68 | _0.84_ |
-| roberta_sentiment | 0.59 | 0.50 | 0.67 | 0.56 | _0.84_ |
 ### Legend:
 **Bold** = Best result per metric per dataset
-_Underlined_ = Second best result per metric per dataset

 ```
+## Benchmark Results
 | Dataset | Model | Accuracy | F1-Score | Precision | Recall | ROC-AUC |
 |---------|-------|----------|----------|-----------|--------|---------|
 | FIQA | ModernFinBERT | **0.80** | **0.61** | **0.64** | **0.88** | **0.96** |
+| FIQA | distilroberta_financial | *0.54* | *0.47* | 0.61 | *0.71* | 0.71 |
 | FIQA | finbert | 0.48 | 0.43 | 0.59 | 0.66 | 0.76 |
+| FIQA | finbert-tone | 0.36 | 0.36 | *0.62* | 0.58 | 0.77 |
+| FIQA | roberta_sentiment | 0.36 | 0.35 | 0.60 | 0.58 | *0.89* |
+| Twitter | ModernFinBERT | 0.71 | *0.70* | *0.68* | **0.81** | **0.94** |
+| Twitter | distilroberta_financial | *0.75* | **0.71** | *0.68* | *0.75* | *0.87* |
+| Twitter | finbert-tone | 0.75 | 0.66 | **0.68** | 0.64 | 0.83 |
+| Twitter | finbert | 0.73 | 0.67 | 0.65 | 0.70 | 0.86 |
 | Twitter | roberta_sentiment | 0.70 | 0.61 | 0.63 | 0.60 | 0.82 |
 | JeanBaptiste | ModernFinBERT | 0.74 | 0.58 | 0.71 | 0.56 | 0.84 |
+| JeanBaptiste | distilroberta_financial | **0.88** | **0.79** | **0.92** | **0.74** | 0.86 |
+| JeanBaptiste | finbert | *0.77* | *0.68* | 0.70 | *0.67* | **0.88** |
+| JeanBaptiste | finbert-tone | 0.74 | 0.60 | 0.72 | 0.56 | *0.86* |
+| JeanBaptiste | roberta_sentiment | 0.70 | 0.55 | *0.79* | 0.51 | 0.83 |
 ## Model Averages Across All Datasets
 | Model | Accuracy | F1-Score | Precision | Recall | ROC-AUC |
 |-------|----------|----------|-----------|--------|---------|
+| **ModernFinBERT** | **0.75** | *0.63* | 0.68 | **0.75** | **0.91** |
+| distilroberta_financial | *0.73* | **0.66** | **0.73** | *0.73* | 0.82 |
+| finbert | 0.66 | 0.59 | 0.65 | 0.68 | *0.84* |
+| finbert-tone | 0.62 | 0.54 | *0.68* | 0.59 | 0.82 |
+| roberta_sentiment | 0.59 | 0.50 | 0.67 | 0.56 | *0.84* |
 ### Legend:
 **Bold** = Best result per metric per dataset
+*Italic* = Second best result per metric per dataset