Create README.md
Browse files
README.md
ADDED
|
@@ -0,0 +1,55 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
tags:
|
| 3 |
+
- text-classification
|
| 4 |
+
- bert
|
| 5 |
+
- financial-sentiment
|
| 6 |
+
- natural-language-processing
|
| 7 |
+
datasets:
|
| 8 |
+
- FinancialSentimentAnalysis_NewsHeadlines
|
| 9 |
+
license: apache-2.0
|
| 10 |
+
model-index:
|
| 11 |
+
- name: FinancialSentimentClassifier_NewsArticles
|
| 12 |
+
results:
|
| 13 |
+
- task:
|
| 14 |
+
name: Text Classification
|
| 15 |
+
type: text-classification
|
| 16 |
+
metrics:
|
| 17 |
+
- type: accuracy
|
| 18 |
+
value: 0.915
|
| 19 |
+
name: Accuracy
|
| 20 |
+
- type: f1
|
| 21 |
+
value: 0.908
|
| 22 |
+
name: F1 Score
|
| 23 |
+
---
|
| 24 |
+
|
| 25 |
+
# FinancialSentimentClassifier_NewsArticles
|
| 26 |
+
|
| 27 |
+
## 📰 Overview
|
| 28 |
+
|
| 29 |
+
This model is a fine-tuned **BERT (Bidirectional Encoder Representations from Transformers)** model specialized for classifying the sentiment of financial news headlines into three distinct categories: **POSITIVE**, **NEGATIVE**, and **NEUTRAL**. It is built to support automated monitoring of market-moving news and quantifying the immediate impact of reports on specific entities.
|
| 30 |
+
|
| 31 |
+
## 🧠 Model Architecture
|
| 32 |
+
|
| 33 |
+
The architecture is based on the robust `bert-base-uncased` checkpoint, fine-tuned for sequence classification.
|
| 34 |
+
|
| 35 |
+
* **Base Model:** `bert-base-uncased`
|
| 36 |
+
* **Classification Head:** A standard linear layer is added on top of the pooled output of BERT (the `[CLS]` token representation).
|
| 37 |
+
* **Input Data:** Financial news headlines (English text).
|
| 38 |
+
* **Output:** Logits corresponding to the three classes: Negative (0), Neutral (1), Positive (2).
|
| 39 |
+
* **Training Dataset:** A proprietary dataset, **FinancialSentimentAnalysis_NewsHeadlines**, featuring headlines, entity mentions, and manually verified sentiment labels.
|
| 40 |
+
|
| 41 |
+
## 🎯 Intended Use
|
| 42 |
+
|
| 43 |
+
This model is primarily intended for research and industrial applications requiring high-precision financial text analysis:
|
| 44 |
+
|
| 45 |
+
1. **Automated Trading Signal Generation:** Classifying news sentiment to inform algorithmic trading strategies.
|
| 46 |
+
2. **Entity-Level Sentiment Tracking:** Monitoring sentiment specifically linked to companies or commodities mentioned in the headline (`Entity_Mentioned`).
|
| 47 |
+
3. **Risk Management:** Identifying sudden shifts in negative sentiment towards a portfolio.
|
| 48 |
+
4. **Academic Research:** Studying the correlation between news sentiment and market volatility.
|
| 49 |
+
|
| 50 |
+
## ⚠️ Limitations
|
| 51 |
+
|
| 52 |
+
1. **Scope:** The model is highly specialized for *financial* news language. Performance may degrade when applied to general domain text (e.g., social media or political articles).
|
| 53 |
+
2. **Subtlety:** It may struggle with complex, nuanced, or ironic financial reporting where the sentiment is not explicitly stated but implied through market context (e.g., "analysts remain cautious").
|
| 54 |
+
3. **Language:** Trained exclusively on English headlines.
|
| 55 |
+
4. **Recency:** Its vocabulary and context were established up to its training date. New economic terminology or highly unique events may pose a challenge.
|