pmatorras
/

financial-sentiment-analysis

Text Classification

multi-task-learning

Model card Files Files and versions

financial-sentiment-analysis / README.md

pmatorras's picture

Change lora repository name in readme

630093c verified about 1 month ago

|

history blame contribute delete

3.28 kB

	---
	language:
	- en
	license: mit
	library_name: transformers
	tags:
	- finance
	- sentiment
	- finbert
	- multi-task-learning
	datasets:
	- financial_phrasebank
	- zeroshot/twitter-financial-news-sentiment
	- TheFinAI/fiqa-sentiment-classification
	metrics:
	- accuracy
	- f1
	base_model: ProsusAI/finbert
	widget:
	- text: "The company reported a record 20% increase in revenue this quarter."
	example_title: "Strong Earnings"
	- text: "Analysts are worried about the looming debt crisis."
	example_title: "Negative Outlook"
	---

	# Financial-Sentiment-LLM (Multi-Task FinBERT)

	A production-ready financial sentiment classifier fine-tuned on FinBERT. This model utilizes a Multi-Task Architecture (Classification + Regression) to achieve state-of-the-art performance across diverse financial text sources, including professional news, social media, and forum discussions.

	- 🚀 Live Demo: [Hugging Face Space](https://huggingface.co/spaces/pmatorras/financial-sentiment-demo)
	- 💻 Source Code: [GitHub Repository](https://github.com/pmatorras/financial-sentiment-llm)
	- 👨‍💻 Author: [Pablo Matorras-Cuevas](https://pablo.matorras.com/)

	## Model Performance

	This Multi-Task model achieves 85.4% overall accuracy, significantly outperforming standard baselines, particularly on noisy social media data.

	\| Metric / Dataset \| FinBERT (Multi-Task) \| FinBERT (LoRA) \|
	\| :--- \| :--- \| :--- \|
	\| Overall Accuracy \| 85.4% \| 83.2% \|
	\| Macro F1-Score \| 0.83 \| 0.80 \|
	\| Financial PhraseBank (News) \| 95.9% \| 97.1% \|
	\| Twitter Financial News \| 83.3% \| 80.5% \|
	\| FiQA (Forums) \| 81.5% \| 72.6% \|

	> Note: For edge deployment or low-memory environments, check out the [LoRA version](https://huggingface.co/pmatorras/financial-sentiment-analysis-lora) which reduces storage by 99% (5MB vs 420MB).

	## Architecture

	Unlike standard sentiment classifiers, this model shares a `bert-base` backbone with two task-specific heads:
	1. Classification Head: Predicts `Negative`/`Neutral`/`Positive` (Optimized for News & Twitter).
	2. Regression Head: Predicts a continuous sentiment score (Optimized for FiQA forum discussions).

	This approach yielded a +6.1% accuracy boost on Twitter data compared to single-task training, proving that learning continuous sentiment intensity helps the model understand noisy social text better.

	## Usage

	You can use this model directly with the Hugging Face `pipeline` or `AutoModel`:

	```python
	from transformers import AutoTokenizer, AutoModelForSequenceClassification
	import torch

	# Load the model and tokenizer
	model_name = "pmatorras/financial-sentiment-multi-task"
	tokenizer = AutoTokenizer.from_pretrained(model_name)
	model = AutoModelForSequenceClassification.from_pretrained(model_name)

	# Inference
	text = "The stock market rally is driven by strong tech earnings."
	inputs = tokenizer(text, return_tensors="pt")

	with torch.no_grad():
	outputs = model(**inputs)
	probabilities = torch.nn.functional.softmax(outputs.logits, dim=-1)

	print(probabilities)
	```
	## Training Details
	- Base Model: ProsusAI/finbert
	- Optimizer: AdamW
	- Loss Function: Weighted sum of Cross-Entropy (Classification) and MSE (Regression).
	- Compute: Trained on NVIDIA RTX 4050.