SayedShaun
/

distilbert-link-type-classifier-int8

Model card Files Files and versions

SayedShaun commited on about 1 month ago

Commit

2f7f8ea

·

verified ·

1 Parent(s): 45a137b

Create README.md

Files changed (1) hide show

README.md +43 -0

README.md ADDED Viewed

	@@ -0,0 +1,43 @@

+---
+language: en
+license: apache-2.0
+tags:
+- url-classification
+- text-classification
+- distilbert
+- web-mining
+- nlp
+- seo
+- crawler
+datasets:
+- ruggsea/infini-news-corpus
+metrics:
+- accuracy
+pipeline_tag: text-classification
+---
+# 🌐 URL Content vs Section Classifier (DistilBERT)
+This model classifies a **web URL** into one of two structural categories:
+- **content** → A specific article, blog post, or news story page
+- **section** → A category page, listing page, or homepage/navigation page
+It is designed for **web crawling, content extraction, and large-scale URL filtering**.
+---
+# 🚀 Model Overview
+This model is a fine-tuned version of:
+👉 **:contentReference[oaicite:0]{index=0}**
+It learns patterns in URL structure rather than natural language sentences.
+---
+## 🧠 Problem Type
+### Input
+A single URL string: