---
language: en
license: apache-2.0
library_name: transformers
tags:
- text-classification
- personal-narrative
- political-discourse
- computational-social-science
- websci25
datasets:
- custom-reddit-dataset
base_model: falkne/storytelling-LM-europarl-mixed-en
---

# Personal Narrative Classifier (WebSci'25)

This is the official repository for the text classification model presented in the paper: **"Personal Narratives Empower Politically Disinclined Individuals to Engage in Political Discussions"**, which received a Best Paper Honorable Mention at the 17th ACM Web Science Conference (WebSci'25).

The model is a fine-tuned BERT-based classifier (`falkne/storytelling-LM-europarl-mixed-en`) designed to identify personal narratives in online comments.

## Model Description

This model classifies a given text as either a "Personal Narrative" or "Not a Personal Narrative". It was developed to support a large-scale computational analysis of how personal stories affect engagement in online political discussions on Reddit.

- **Label 0**: Not a Personal Narrative
- **Label 1**: Personal Narrative

## Intended Uses & Limitations

### Intended Use

This model is intended for researchers in computational social science, political science, communication, and HCI to study online discourse. It can be used to:
- Quantify the use of personal narratives in various online communities.
- Analyze the reception and impact of story-based arguments.
- Replicate and extend the findings of the original paper.

### Limitations

As noted in the paper, this model has several limitations:
- The training and evaluation data comes from political subreddits on Reddit from 2020-2021. Its performance may vary on other platforms or time periods.
- The definition of "political activity" was based on subreddit engagement, which may not capture all forms of political interest.
- The model does not analyze the content or veracity of the narratives. Personal narratives can also be used to spread misinformation, which is an avenue for future research.

## How to Use

You can use this model with the `transformers` library pipeline for easy inference.

```python
from transformers import pipeline

repo_id = "tejasvichebrolu/personal-narrative-classifier"
classifier = pipeline("text-classification", model=repo_id)

# Example texts
narrative_text = "I’m in Alabama and oh my god it was so humid yesterday. I was so unproductive from how bad it was."
non_narrative_text = "The most straightforward solution is to encourage others to engage with politics online."

# Get predictions
results = classifier([narrative_text, non_narrative_text])
for text, result in zip([narrative_text, non_narrative_text], results):
    print(f"Text: {text}")
    # The pipeline may return LABEL_0/LABEL_1 or the names from the config
    print(f"  -> Prediction: {result['label']}, Score: {result['score']:.4f}\n")
```

## Training and Evaluation

The model was fine-tuned on a dataset of 2,000 manually labeled Reddit comments. It achieved a macro average F1-score of **0.82** in 5-fold cross-validation. For more details on the training procedure and performance, please refer to the paper.

## Citation

If you use this model or its findings in your research, please cite our paper:

```bibtex
@inproceedings{chebrolu2025narratives,
  title={{Personal Narratives Empower Politically Disinclined Individuals to Engage in Political Discussions}},
  author={{Chebrolu, Tejasvi and Kumaraguru, Ponnurangam and Rajadesingan, Ashwin}},
  booktitle={{Proceedings of the 17th ACM Web Science Conference 2025 (Websci '25)}},
  year={{2025}},
  organization={{ACM}},
  doi={10.1145/3717867.3717899}
}
```