Instructions to use Phase-Technologies/netuark-classifier-ensemble with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Scikit-learn
How to use Phase-Technologies/netuark-classifier-ensemble with Scikit-learn:
from huggingface_hub import hf_hub_download import joblib model = joblib.load( hf_hub_download("Phase-Technologies/netuark-classifier-ensemble", "sklearn_model.joblib") ) # only load pickle files from sources you trust # read more about it here https://skops.readthedocs.io/en/stable/persistence.html - Notebooks
- Google Colab
- Kaggle
Upload README.md with huggingface_hub
Browse files
README.md
ADDED
|
@@ -0,0 +1,20 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
|
| 2 |
+
# NetuArk Posts Classifier (Ensemble Architecture)
|
| 3 |
+
|
| 4 |
+
This model is a novel ensemble classifier designed to categorize technology-related social media posts into their respective news sources.
|
| 5 |
+
|
| 6 |
+
## Model Details
|
| 7 |
+
- **Architecture:** Voting Classifier (Multinomial Naive Bayes + Logistic Regression)
|
| 8 |
+
- **Vectorization:** TF-IDF (N-grams 1-3)
|
| 9 |
+
- **Accuracy:** 99.81% on the NetuArk-6000 dataset.
|
| 10 |
+
- **Classes:** HackerNews, TechCrunch, TheVerge, FT, GuardianTech, Slashdot, ArsTechnica.
|
| 11 |
+
|
| 12 |
+
## Training Data
|
| 13 |
+
Trained on the [Xerv-AI/netuark-posts-6000](https://huggingface.co/datasets/Xerv-AI/netuark-posts-6000) dataset.
|
| 14 |
+
|
| 15 |
+
## Usage
|
| 16 |
+
```python
|
| 17 |
+
import joblib
|
| 18 |
+
model = joblib.load('netuark_ensemble_classifier.joblib')
|
| 19 |
+
prediction = model.predict(["New AI breakthrough on HackerNews"])
|
| 20 |
+
```
|