YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

PassiveAggressiveClassifier Fake News Detector

Model Description

This model uses a PassiveAggressiveClassifier from scikit-learn to classify news articles as "Fake" or "Real". The input data consists of news articles from two datasets (True.csv and Fake.csv). Text data is preprocessed (lowercased, punctuation and numbers removed, extra spaces cleaned) and vectorized using TF-IDF. The model is trained on 80% of the data and tested on the remaining 20%.

Intended Uses & Limitations

Intended use: Detect fake news in English news articles.
Limitations: May not generalize to other languages or domains. The model is trained on a specific dataset and may not perform well on out-of-distribution samples.

Training Data

True.csv: Real news articles.
Fake.csv: Fake news articles.
Both datasets contain columns: title, text, subject, date.

Preprocessing

Lowercasing text
Removing punctuation and numbers
Removing extra spaces
TF-IDF vectorization with English stopwords, max_df=0.7

Metrics

Accuracy: 99.41% on the test set.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support