tejasvichebrolu's picture
Model upload from WebSci'25 paper
852873f verified
---
language: en
license: apache-2.0
library_name: transformers
tags:
- text-classification
- personal-narrative
- political-discourse
- computational-social-science
- websci25
datasets:
- custom-reddit-dataset
base_model: falkne/storytelling-LM-europarl-mixed-en
---
# Personal Narrative Classifier (WebSci'25)
This is the official repository for the text classification model presented in the paper: **"Personal Narratives Empower Politically Disinclined Individuals to Engage in Political Discussions"**, which received a Best Paper Honorable Mention at the 17th ACM Web Science Conference (WebSci'25).
The model is a fine-tuned BERT-based classifier (`falkne/storytelling-LM-europarl-mixed-en`) designed to identify personal narratives in online comments.
## Model Description
This model classifies a given text as either a "Personal Narrative" or "Not a Personal Narrative". It was developed to support a large-scale computational analysis of how personal stories affect engagement in online political discussions on Reddit.
- **Label 0**: Not a Personal Narrative
- **Label 1**: Personal Narrative
## Intended Uses & Limitations
### Intended Use
This model is intended for researchers in computational social science, political science, communication, and HCI to study online discourse. It can be used to:
- Quantify the use of personal narratives in various online communities.
- Analyze the reception and impact of story-based arguments.
- Replicate and extend the findings of the original paper.
### Limitations
As noted in the paper, this model has several limitations:
- The training and evaluation data comes from political subreddits on Reddit from 2020-2021. Its performance may vary on other platforms or time periods.
- The definition of "political activity" was based on subreddit engagement, which may not capture all forms of political interest.
- The model does not analyze the content or veracity of the narratives. Personal narratives can also be used to spread misinformation, which is an avenue for future research.
## How to Use
You can use this model with the `transformers` library pipeline for easy inference.
```python
from transformers import pipeline
repo_id = "tejasvichebrolu/personal-narrative-classifier"
classifier = pipeline("text-classification", model=repo_id)
# Example texts
narrative_text = "I’m in Alabama and oh my god it was so humid yesterday. I was so unproductive from how bad it was."
non_narrative_text = "The most straightforward solution is to encourage others to engage with politics online."
# Get predictions
results = classifier([narrative_text, non_narrative_text])
for text, result in zip([narrative_text, non_narrative_text], results):
print(f"Text: {text}")
# The pipeline may return LABEL_0/LABEL_1 or the names from the config
print(f" -> Prediction: {result['label']}, Score: {result['score']:.4f}\n")
```
## Training and Evaluation
The model was fine-tuned on a dataset of 2,000 manually labeled Reddit comments. It achieved a macro average F1-score of **0.82** in 5-fold cross-validation. For more details on the training procedure and performance, please refer to the paper.
## Citation
If you use this model or its findings in your research, please cite our paper:
```bibtex
@inproceedings{chebrolu2025narratives,
title={{Personal Narratives Empower Politically Disinclined Individuals to Engage in Political Discussions}},
author={{Chebrolu, Tejasvi and Kumaraguru, Ponnurangam and Rajadesingan, Ashwin}},
booktitle={{Proceedings of the 17th ACM Web Science Conference 2025 (Websci '25)}},
year={{2025}},
organization={{ACM}},
doi={10.1145/3717867.3717899}
}
```