File size: 3,679 Bytes
852873f | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 | ---
language: en
license: apache-2.0
library_name: transformers
tags:
- text-classification
- personal-narrative
- political-discourse
- computational-social-science
- websci25
datasets:
- custom-reddit-dataset
base_model: falkne/storytelling-LM-europarl-mixed-en
---
# Personal Narrative Classifier (WebSci'25)
This is the official repository for the text classification model presented in the paper: **"Personal Narratives Empower Politically Disinclined Individuals to Engage in Political Discussions"**, which received a Best Paper Honorable Mention at the 17th ACM Web Science Conference (WebSci'25).
The model is a fine-tuned BERT-based classifier (`falkne/storytelling-LM-europarl-mixed-en`) designed to identify personal narratives in online comments.
## Model Description
This model classifies a given text as either a "Personal Narrative" or "Not a Personal Narrative". It was developed to support a large-scale computational analysis of how personal stories affect engagement in online political discussions on Reddit.
- **Label 0**: Not a Personal Narrative
- **Label 1**: Personal Narrative
## Intended Uses & Limitations
### Intended Use
This model is intended for researchers in computational social science, political science, communication, and HCI to study online discourse. It can be used to:
- Quantify the use of personal narratives in various online communities.
- Analyze the reception and impact of story-based arguments.
- Replicate and extend the findings of the original paper.
### Limitations
As noted in the paper, this model has several limitations:
- The training and evaluation data comes from political subreddits on Reddit from 2020-2021. Its performance may vary on other platforms or time periods.
- The definition of "political activity" was based on subreddit engagement, which may not capture all forms of political interest.
- The model does not analyze the content or veracity of the narratives. Personal narratives can also be used to spread misinformation, which is an avenue for future research.
## How to Use
You can use this model with the `transformers` library pipeline for easy inference.
```python
from transformers import pipeline
repo_id = "tejasvichebrolu/personal-narrative-classifier"
classifier = pipeline("text-classification", model=repo_id)
# Example texts
narrative_text = "I’m in Alabama and oh my god it was so humid yesterday. I was so unproductive from how bad it was."
non_narrative_text = "The most straightforward solution is to encourage others to engage with politics online."
# Get predictions
results = classifier([narrative_text, non_narrative_text])
for text, result in zip([narrative_text, non_narrative_text], results):
print(f"Text: {text}")
# The pipeline may return LABEL_0/LABEL_1 or the names from the config
print(f" -> Prediction: {result['label']}, Score: {result['score']:.4f}\n")
```
## Training and Evaluation
The model was fine-tuned on a dataset of 2,000 manually labeled Reddit comments. It achieved a macro average F1-score of **0.82** in 5-fold cross-validation. For more details on the training procedure and performance, please refer to the paper.
## Citation
If you use this model or its findings in your research, please cite our paper:
```bibtex
@inproceedings{chebrolu2025narratives,
title={{Personal Narratives Empower Politically Disinclined Individuals to Engage in Political Discussions}},
author={{Chebrolu, Tejasvi and Kumaraguru, Ponnurangam and Rajadesingan, Ashwin}},
booktitle={{Proceedings of the 17th ACM Web Science Conference 2025 (Websci '25)}},
year={{2025}},
organization={{ACM}},
doi={10.1145/3717867.3717899}
}
``` |