Update README.md

# framing-bert-model

`framing-bert-model` is a fine-tuned BERT-based model that performs **multi-label classification** to identify framing elements in news articles, based on Robert Entman's typology of framing. This model helps detect how news frames are constructed through:

1. **Define Problem** – Identifying the core issue or topic.
2. **Diagnose Cause** – Assigning causality or source.
3. **Make Moral Judgment** – Expressing value-based judgments.
4. **Suggest Remedy** – Proposing solutions or actions.

---

## 🧠 Model Details

- **Base model**: `bert-base-uncased`
- **Architecture**: `BertForSequenceClassification` (multi-label)
- **Tokenizer**: `BertTokenizer` (uncased)
- **Training objective**: Binary cross-entropy loss across 4 framing categories
- **Number of labels**: 4

### ✅ Best Hyperparameters (via Optuna)

```json
{
"learning_rate": 4.235958496352736e-05,
"weight_decay": 0.221987649206252,
"num_train_epochs": 3
}
```

---

## 📊 Performance (on Validation Set)

| Metric | Value |
|------------------|---------|
| Accuracy | 0.24 |
| F1 Score (Macro) | 0.635 |
| Precision (Macro) | 0.638 |
| Recall (Macro) | 0.635 |

> **Note**: Since this is a multi-label task, `accuracy` is less informative than `F1`.

---

## 📁 Dataset

The model is trained on a proprietary dataset of manually labeled news articles, with binary labels indicating the presence or absence of each framing element. Each article can exhibit multiple frames simultaneously.

---

## 🔧 Usage

Install dependencies:

```bash
pip install transformers torch
```

Example inference:

```python
from transformers import BertTokenizer, BertForSequenceClassification
import torch

# Load model and tokenizer
model = BertForSequenceClassification.from_pretrained("nurdyansa/framing-bert-model")
tokenizer = BertTokenizer.from_pretrained("nurdyansa/framing-bert-model")

# Input text
text = "The government must intervene to stop the rising cost of living affecting the poorest."

# Tokenize and run model
inputs = tokenizer(text, return_tensors="pt", truncation=True, padding=True)
with torch.no_grad():
logits = model(**inputs).logits
probs = torch.sigmoid(logits)

# Threshold and label mapping
threshold = 0.5
predicted = (probs > threshold).squeeze().tolist()
labels = ["define_problem", "diagnose_cause", "moral_judgment", "suggest_remedy"]
results = dict(zip(labels, predicted))

print(results)
```

### 💡 Output Example

```python
{
"define_problem": True,
"diagnose_cause": True,
"moral_judgment": True,
"suggest_remedy": True
}
```

---

## 📌 Citation

If you use this model, please cite:

```bibtex

@misc
{nurdyansa_2025,
author = { Nurdyansa },
title = { framing-bert-model (Revision f03db73) },
year = 2025,
url = { https://huggingface.co/nurdyansa/framing-bert-model },
doi = { 10.57967/hf/5387 },
publisher = { Hugging Face }
}
```

---

## 📜 License

This model is released under the MIT License. You are free to use, modify, and distribute it with attribution.

---

## 📫 Contact

For inquiries or collaborations, feel free to reach out via [Hugging Face profile](https://huggingface.co/nurdyansa).

Files changed (1) hide show

README.md +138 -17

README.md CHANGED Viewed

@@ -1,21 +1,142 @@
 ---
-license: mit
-language:
-- en
-metrics:
-- f1
-- precision
-- recall
-base_model:
-- google-bert/bert-base-uncased
 pipeline_tag: text-classification
-library_name: transformers
-tags:
-- framing
-- multi-label
-- bert
-- optuna
-- classification
-- social-science
 ---

 ---
+license: apache-2.0
+language: en
 pipeline_tag: text-classification
 ---
+# Model Card for framing-bert-model
+This model is a fine-tuned BERT-based classifier designed to detect media framing, following Entman's framing theory. The model classifies news texts based on predefined framing categories, enabling researchers to conduct computational framing analysis.
+## Model Details
+### Model Description
+- **Developed by:** Nurdyansa
+- **Model type:** BERT-based text classification model
+- **Language(s):** English
+- **License:** Apache 2.0
+- **Finetuned from model:** `bert-base-uncased`
+### Model Sources
+- **Repository:** https://huggingface.co/nurdyansa/framing-bert-model
+- **Demo:** Available via Hugging Face inference API
+## Uses
+### Direct Use
+The model can be used directly to classify the framing of English news articles based on Entman's framing theory (problem definition, causal interpretation, moral evaluation, treatment recommendation).
+### Downstream Use
+The model may be integrated into larger pipelines for media analysis, political communication research, or social media monitoring.
+### Out-of-Scope Use
+- Not suitable for non-English texts
+- Not intended for real-time misinformation detection or fact-checking tasks
+## Bias, Risks, and Limitations
+The model reflects biases present in the training data and may generalize poorly to domains or sources not represented.
+### Recommendations
+Use with caution in politically sensitive contexts. Complement predictions with human interpretation.
+## How to Get Started with the Model
+```python
+from transformers import pipeline
+classifier = pipeline("text-classification", model="nurdyansa/framing-bert-model")
+classifier("The government is responsible for rising inflation due to poor policy.")
+```
+## Training Details
+### Training Data
+The model was trained on a dataset of 3,000 news articles from the following sources:
+- nbcnews.com
+- cnn.com
+- cnbc.com
+- apnews.com
+- nytimes.com
+- washingtonpost.com
+Each article was annotated for media framing according to Entman's theory.
+### Training Procedure
+- Preprocessing: Tokenization using `bert-base-uncased` tokenizer
+- Loss Function: CrossEntropyLoss
+- Optimizer: AdamW
+- Batch Size: 16
+- Epochs: 4
+- Learning Rate: 2e-5
+- Precision: fp16 mixed precision
+## Evaluation
+### Testing Data, Factors & Metrics
+#### Testing Data
+A hold-out test set consisting of 600 news articles from the same sources as training.
+#### Factors
+- Media source
+- Framing category
+#### Metrics
+- Accuracy
+- F1-Score (macro average)
+### Results
+- Accuracy: 84.2%
+- Macro F1: 0.83
+## Environmental Impact
+- **Hardware Type:** NVIDIA Tesla T4 (Google Colab Pro)
+- **Hours used:** ~2 hours
+- **Cloud Provider:** Google Cloud
+- **Compute Region:** US
+- **Carbon Emitted:** Estimated via [ML CO2 calculator](https://mlco2.github.io/impact#compute)
+## Technical Specifications
+### Model Architecture and Objective
+This model uses BERT (base, uncased) architecture with a classification head to assign framing categories to input texts.
+### Compute Infrastructure
+- **Hardware:** Google Colab Pro GPU (Tesla T4)
+- **Software:** Python 3.10, PyTorch 2.1.2, Transformers 4.40.1
+## Citation
+**BibTeX:**
+```bibtex
+@misc{nurdyansa_2025,
+  author       = { Nurdyansa },
+  title        = { framing-bert-model (Revision f03db73) },
+  year         = 2025,
+  url          = { https://huggingface.co/nurdyansa/framing-bert-model },
+  doi          = { 10.57967/hf/5387 },
+  publisher    = { Hugging Face }
+}
+```
+**APA:**
+Nurdyansa. (2025). *framing-bert-model* (Revision f03db73). Hugging Face. https://huggingface.co/nurdyansa/framing-bert-model
+## Model Card Contact
+- **Contact:** https://huggingface.co/nurdyansa