Update README.md

aae765d verified about 1 month ago

4.08 kB

	---
	language: en
	license: apache-2.0
	tags:
	- finance
	- sentiment-analysis
	- text-classification
	- llama
	- qlora
	pipeline_tag: text-classification
	library_name: transformers
	---

	# Llama Sentiment Classifier (QLoRA Fine-Tuned)

	This repository provides a LLaMA-based financial sentiment classifier fine-tuned using QLoRA for 3-class sentiment classification on finance-domain text. The model is designed for downstream applications including sentiment-driven alpha signal generation and market-neutral trading strategies.

	---

	## Model Summary

	- Backbone: Meta-LLaMA3-8B
	- Task: Financial sentiment classification (3 classes)
	- Fine-tuning method: QLoRA (4-bit quantization + LoRA adapters)
	- Training framework: SWIFT (Scalable lightWeight Infrastructure for Fine-Tuning)
	- Architecture: LlamaForSequenceClassification

	---

	## Labels

	This model performs single-label classification into 3 discrete classes.

	In the paper, the unified sentiment label space is:

	- -1 = Negative
	- 0 = Neutral
	- 1 = Positive

	> Note: your Hugging Face config may show `LABEL_0`, `LABEL_1`, `LABEL_2`.

	---

	## Datasets

	### Part 1 — Fine-tuning datasets (4 sources), can be accessed through Romeo777777/FinLlaMa_Training_Dataset


	Training uses 4 finance-domain sentiment datasets:

	1. Financial PhraseBank v1.0
	- 4,840 manually annotated sentences
	- 3-class sentiment (positive/neutral/negative)

	2. NASDAQ News Sentiment
	- synthetic sentiment dataset generated using GPT-4o
	- labeled into positive/neutral/negative

	3. Twitter Financial News Sentiment
	- 11,932 finance-related tweets
	- original label encoding: 0 bearish, 1 bullish, 2 neutral

	4. FIQA2018 (FiQA)
	- financial opinion mining dataset
	- sentiment score in [-1, 1] (continuous), then discretized

	---

	### Part 2 — Real-world news evaluation dataset

	For downstream evaluation and trading strategy experiments, the paper uses a news + price dataset covering:

	- 502 S&P 500 companies
	- ~77,000 news headlines
	- time range: 2024-01-01 to 2025-05-30

	---

	## Preprocessing

	### Label normalization (unified mapping)

	All datasets are standardized into the unified label space: {-1, 0, 1}.

	Mapping rules:

	- Financial PhraseBank:
	`"negative" → -1`, `"neutral" → 0`, `"positive" → 1`

	- FIQA2018 (score in [-1, 1]):
	`< -0.2 → -1`, `> 0.2 → 1`, otherwise `0`

	- Twitter Financial Sentiment:
	`0 → -1`, `2 → 0`, `1 → 1`

	- NASDAQ News (0~5 score):
	`<= 1 → -1`, `>= 4 → 1`, otherwise `0`

	### Tokenization

	Uses the official LLaMA 3 tokenizer.

	---

	## Training Method

	### QLoRA fine-tuning

	The model is fine-tuned with QLoRA, which keeps the backbone weights in 4-bit quantized form and trains LoRA adapters in higher precision.

	The paper’s QLoRA setup includes NF4 quantization + double quantization for memory efficiency.

	### SWIFT training framework

	Fine-tuning is orchestrated using SWIFT, a modular training framework that simplifies adapter integration and efficient training for quantized models.

	---

	## Hyperparameters (paper)

	- LoRA rank r = 16
	- LoRA alpha = 32
	- LoRA dropout = 0.1
	- learning rate 1e-4, optimizer AdamW
	- batch size 8, epochs 1
	- gradient accumulation 4
	- gradient checkpointing enabled
	- cosine learning rate schedule

	---

	## Evaluation Results (paper)

	On a test set of 9,064 finance news samples, the fine-tuned model reports:

	- Accuracy: 92.18%
	- Micro-F1: 0.9218
	- Macro-F1: 0.5787

	Class-wise performance highlights strong Positive and Neutral performance, but weaker Negative performance due to class imbalance.

	---

	## Usage

	```python
	from transformers import pipeline

	clf = pipeline(
	"text-classification",
	model="Romeo777777/Llama_Sentiment_Classifier",
	tokenizer="Romeo777777/Llama_Sentiment_Classifier",
	return_all_scores=True,
	)

	print(clf("Bitcoin is pumping hard today!"))