Update README.md

34190e6 verified about 1 year ago

3.82 kB

	---
	license: mit
	language:
	- en
	base_model:
	- FacebookAI/roberta-base
	tags:
	- psychology
	- social-science
	---

	# AttributioNet: A Fine-Tuned RoBERTa Model for Attribution Classification

	## Overview
	This repository contains a fine-tuned [RoBERTa](https://huggingface.co/roberta-base) model designed for multi-label classification of attributions for self and others. The model predicts four attribution categories (in this order):

	- Self-Dispositional
	- Self-Situational
	- Other-Dispositional
	- Other-Situational

	The training process and evaluation results, including calibration, loss curves, and ROC curves, are documented below.

	## Model Details
	- Base Model: roberta-base
	- Fine-Tuning Approach: Multi-label classification
	- Number of Labels: 4
	- Loss Function: Binary Cross-Entropy with Logits (BCEWithLogitsLoss)
	- Optimizer: AdamW
	- Batch Size: 16
	- Learning Rate: 2e-5
	- Epochs: 3

	## Dataset
	The dataset consists of ~217,000 sentences labeled with attributions. Labels are provided as binary indicators for each category. The data was split into:
	- Training Set: 60%
	- Validation Set: 20%
	- Test Set: 20%

	## Training & Evaluation
	### Performance Metrics
	- Overall ROC AUC Score: 0.9439
	- Overall PR AUC Score: 0.8479
	- Per-Class Performance:
	- Self-Dispositional: ROC AUC: 0.9643, PR AUC: 0.8062
	- Self-Situational: ROC AUC: 0.9534, PR AUC: 0.8871
	- Other-Dispositional: ROC AUC: 0.9421, PR AUC: 0.8771
	- Other-Situational: ROC AUC: 0.9159, PR AUC: 0.8211

	### Evaluation Metrics
	- Classification report (saved as `classification_report.csv`)
	- Calibration curve (`calibration_curve.png`)
	- ROC curves (`per_class_roc_curves.png`)
	- Training and validation loss (`learning_curve.png`)
	- Training loss progression (`training_loss_plot.png`)

	## Usage
	### Installation
	Ensure you have `transformers`, `datasets`, and `torch` installed:
	```bash
	pip install transformers datasets torch
	```

	### Loading the Model
	You can load the model and tokenizer using the `transformers` library:
	```python
	from transformers import RobertaTokenizer, RobertaForSequenceClassification

	tokenizer = RobertaTokenizer.from_pretrained("ryanboyd/AttributioNet")
	model = RobertaForSequenceClassification.from_pretrained("ryanboyd/AttributioNet")
	```

	Alternatively, you can use a custom-made Python package, `blamegame`, to easily download/deploy this model to individual texts or batch process CSV files.
	For more information, see: [https://pypi.org/project/blamegame/](https://pypi.org/project/blamegame/)

	### Inference
	```python
	def predict(text):
	inputs = tokenizer(text, return_tensors="pt", padding=True, truncation=True, max_length=128)
	outputs = model(**inputs)
	probs = torch.sigmoid(outputs.logits).detach().numpy()
	predictions = (probs > 0.5).astype(int)
	return predictions

	sample_text = "The situation was beyond my control."
	predictions = predict(sample_text)
	print(predictions) # Binary labels for each class
	```

	## Fine-Tuning Details
	The fine-tuning process was carried out using the Hugging Face `Trainer` API with custom modifications:
	- Custom loss function for multi-label classification
	- Per-class F1-score computation for evaluation
	- Model checkpointing based on best macro F1-score

	Training was performed using the following command:
	```python
	trainer.train()
	```

	## Results Visualization
	### Calibration Curve
	![Calibration Curve](calibration_curve.png)

	### Training vs Validation Loss
	![Learning Curve](learning_curve.png)

	### Per-Class ROC Curves
	![ROC Curves](per_class_roc_curves.png)

	### Training Loss Over Time
	![Training Loss](training_loss_plot.png)


	## Citation
	If you use this model, please cite this work appropriately. An official citation will be coming soon.