MisileLab
/

noMoreSpamYT

@@ -1,237 +1,245 @@
-# Vivian - YouTube Bot Comment Detector
-This model detects bot comments on YouTube videos using a fine-tuned KcELECTRA model with custom classification layers.
-## Model Description
-Vivian is a specialized model for identifying bot-generated comments on YouTube. It leverages the KcELECTRA base model with a custom architecture optimized for handling the class imbalance inherent in bot detection tasks.
-### Model Architecture
-- **Base Model**: [beomi/KcELECTRA-base](https://huggingface.co/beomi/KcELECTRA-base) - A Korean-focused ELECTRA model
-- **Modifications**:
-  - Frozen initial transformer layers to prevent overfitting
-  - Custom classification layers with dropout for regularization
-  - Combined CLS token and mean pooling for improved feature representation
-  - Focal Loss implementation to handle class imbalance
-### Key Features
-- Effective on Korean YouTube comments
-- Robust against class imbalance (few bot comments vs. many human comments)
-- Optimized for both precision and recall in bot detection
-## Intended Uses
-This model is designed for:
-- Content moderation on YouTube videos
-- Automated filtering of bot comments
-- Research on bot behavior in social media
-## Training Data
-The model was trained on the [MisileLab/youtube-bot-comments](https://huggingface.co/datasets/MisileLab/youtube-bot-comments) dataset, which contains:
-- YouTube comments collected from popular Korean videos
-- Manual annotations for bot vs. human comments
-- A 70/20/10 train/test/validation split
-## Performance
-The model achieves:
-- High precision in bot detection to minimize false positives
-- Good recall to catch the majority of bot comments
-- Balanced performance across different comment lengths and styles
-## Usage
-```python
-from transformers import AutoTokenizer, ElectraModel
-import torch
-import torch.nn as nn
-# Load the tokenizer
-tokenizer = AutoTokenizer.from_pretrained("beomi/KcELECTRA-base")
-# Define the model architecture (same as in training)
-class SpamUserClassificationLayer(nn.Module):
-    def __init__(self, encoder: ElectraModel):
-        super().__init__()
-        self.encoder = encoder
-        # Classification network optimized for imbalanced datasets
-        # Changed input dimension from 768 to 1536 (CLS + mean pooling)
-        self.dense1 = nn.Linear(1536, 512)
-        self.layernorm1 = nn.LayerNorm(512)
-        self.gelu1 = nn.GELU()
-        self.dropout1 = nn.Dropout(0.4)
-        self.dense2 = nn.Linear(512, 256)
-        self.layernorm2 = nn.LayerNorm(256)
-        self.gelu2 = nn.GELU()
-        self.dropout2 = nn.Dropout(0.3)
-    def forward(self, input_ids, attention_mask=None, token_type_ids=None):
-        # Get encoder outputs
-        outputs = self.encoder(
-            input_ids=input_ids,
-            attention_mask=attention_mask,
-            token_type_ids=token_type_ids,
-            output_attentions=True
-        )
-        # CLS token representation
-        cls_output = outputs.last_hidden_state[:, 0, :]  # [batch, 768]
-        # Mean pooling with proper attention masking
-        token_embeddings = outputs.last_hidden_state  # [batch, seq_len, 768]
-        input_mask_expanded = attention_mask.unsqueeze(-1).expand(token_embeddings.size()).float()
-        sum_embeddings = torch.sum(token_embeddings * input_mask_expanded, 1)
-        sum_mask = torch.clamp(input_mask_expanded.sum(1), min=1e-9)
-        mean_pooled = sum_embeddings / sum_mask  # [batch, 768]
-        # Concatenate CLS + mean pooling
-        combined_output = torch.cat([cls_output, mean_pooled], dim=1)  # [batch, 1536]
-        # Pass through classification network
-        x = self.dense1(combined_output)
-        x = self.layernorm1(x)
-        x = self.gelu1(x)
-        x = self.dropout1(x)
-        x = self.dense2(x)
-        x = self.layernorm2(x)
-        x = self.gelu2(x)
-        x = self.dropout2(x)
-        return x
-class SpamUserClassifier(nn.Module):
-    def __init__(self, pretrained_model_name="beomi/kcelectra-base"):
-        super().__init__()
-        self.encoder = ElectraModel.from_pretrained(pretrained_model_name)
-        # Freeze first 2 layers for imbalanced dataset scenario
-        for i, layer in enumerate(self.encoder.encoder.layer):
-            if i < 2:
-                for param in layer.parameters():
-                    param.requires_grad = False
-        self.nameLayer = SpamUserClassificationLayer(self.encoder)
-        self.contentLayer = SpamUserClassificationLayer(self.encoder)
-        self.dense = nn.Linear(512, 256)
-        self.layernorm = nn.LayerNorm(256)
-        self.gelu = nn.GELU()
-        self.dropout = nn.Dropout(0.3)
-        self.output_layer = nn.Linear(256, 1)
-        self.sigmoid = nn.Sigmoid()
-    def forward(self, name_input_ids, content_input_ids, name_attention_mask=None, name_token_type_ids=None,
-               content_attention_mask=None, content_token_type_ids=None, return_logits=False, return_probs=True):
-        namePrediction = self.nameLayer(name_input_ids, name_attention_mask, name_token_type_ids)
-        contentPrediction = self.contentLayer(content_input_ids, content_attention_mask, content_token_type_ids)
-        # Pass through classification network
-        x = self.dense(torch.cat([namePrediction, contentPrediction], dim=1))
-        x = self.layernorm(x)
-        x = self.gelu(x)
-        x = self.dropout(x)
-        logits = self.output_layer(x)
-        if return_logits:
-            return logits
-        else:
-            # Apply sigmoid and return probabilities or predictions
-            probs = self.sigmoid(logits)
-            # Return class predictions: 0 (not bot) or 1 (bot)
-            return probs if return_probs else (probs > 0.9).long().squeeze(-1)
-# Load the model
-device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
-model = SpamUserClassifier()
-model.load_state_dict(torch.load("model.pth", map_location=device))
-model.to(device)
-model.eval()
-# Example inference
-def classify_comment(author_name, comment_text, threshold=0.9):
-    # Tokenize author name
-    name_encoding = tokenizer(
-        author_name,
-        truncation=True,
-        padding="max_length",
-        max_length=128,
-        return_tensors="pt"
-    )
-    name_input_ids = name_encoding["input_ids"].to(device)
-    name_attention_mask = name_encoding["attention_mask"].to(device)
-    # Tokenize content
-    content_encoding = tokenizer(
-        comment_text,
-        truncation=True,
-        padding="max_length",
-        max_length=128,
-        return_tensors="pt"
-    )
-    content_input_ids = content_encoding["input_ids"].to(device)
-    content_attention_mask = content_encoding["attention_mask"].to(device)
-    # Get prediction
-    with torch.no_grad():
-        probs = model(
-            name_input_ids=name_input_ids,
-            content_input_ids=content_input_ids,
-            name_attention_mask=name_attention_mask,
-            content_attention_mask=content_attention_mask,
-            return_logits=False,
-            return_probs=True
-        )
-    # Get probability and prediction
-    probability = probs.item()
-    is_bot = probability > threshold
-    return {
-        "probability": probability,
-        "is_bot": is_bot
-    }
-# Example usage
-result = classify_comment(
-    author_name="SpamBot2023",
-    comment_text="Check out my channel for free gift cards!"
-)
-print(f"Bot probability: {result['probability']:.4f}")
-print(f"Is bot comment: {result['is_bot']}")
-```
-## Limitations
-- Primarily optimized for Korean YouTube comments
-- May have reduced performance on other languages or platforms
-- Cannot detect sophisticated bots that closely mimic human writing patterns
-- Limited to text-based features (doesn't consider user history or behavior patterns)
-## Citation
-If you use this model in your research, please cite:
-```
-@misc{vivian-youtube-bot-detector,
-  author = {MisileLab},
-  title = {Vivian: YouTube Bot Comment Detection System},
-  year = {2025},
-  publisher = {Hugging Face},
-  howpublished = {\url{https://huggingface.co/MisileLab/vivian}}
-}
-```
-## Contact
-For questions, issues, or feedback, please open an issue on the [GitHub repository](https://github.com/misilelab/h3).

+---
+license: mit
+datasets:
+- MisileLab/youtube-bot-comments
+language:
+- ko
+pipeline_tag: text-classification
+---
+# noMoreSpamYT - YouTube Bot Comment Detector
+This model detects bot comments on YouTube videos using a fine-tuned KcELECTRA model with custom classification layers.
+## Model Description
+noMoreSpamYT is a specialized model for identifying bot-generated comments on YouTube. It leverages the KcELECTRA base model with a custom architecture optimized for handling the class imbalance inherent in bot detection tasks.
+### Model Architecture
+- **Base Model**: [beomi/KcELECTRA-base](https://huggingface.co/beomi/KcELECTRA-base) - A Korean-focused ELECTRA model
+- **Modifications**:
+  - Frozen initial transformer layers to prevent overfitting
+  - Custom classification layers with dropout for regularization
+  - Combined CLS token and mean pooling for improved feature representation
+  - Focal Loss implementation to handle class imbalance
+### Key Features
+- Effective on Korean YouTube comments
+- Robust against class imbalance (few bot comments vs. many human comments)
+- Optimized for both precision and recall in bot detection
+## Intended Uses
+This model is designed for:
+- Content moderation on YouTube videos
+- Automated filtering of bot comments
+- Research on bot behavior in social media
+## Training Data
+The model was trained on the [MisileLab/youtube-bot-comments](https://huggingface.co/datasets/MisileLab/youtube-bot-comments) dataset, which contains:
+- YouTube comments collected from popular Korean videos
+- Manual annotations for bot vs. human comments
+- A 70/20/10 train/test/validation split
+## Performance
+The model achieves:
+- High precision in bot detection to minimize false positives
+- Good recall to catch the majority of bot comments
+- Balanced performance across different comment lengths and styles
+## Usage
+```python
+from transformers import AutoTokenizer, ElectraModel
+import torch
+import torch.nn as nn
+# Load the tokenizer
+tokenizer = AutoTokenizer.from_pretrained("beomi/KcELECTRA-base")
+# Define the model architecture (same as in training)
+class SpamUserClassificationLayer(nn.Module):
+    def __init__(self, encoder: ElectraModel):
+        super().__init__()
+        self.encoder = encoder
+        # Classification network optimized for imbalanced datasets
+        # Changed input dimension from 768 to 1536 (CLS + mean pooling)
+        self.dense1 = nn.Linear(1536, 512)
+        self.layernorm1 = nn.LayerNorm(512)
+        self.gelu1 = nn.GELU()
+        self.dropout1 = nn.Dropout(0.4)
+        self.dense2 = nn.Linear(512, 256)
+        self.layernorm2 = nn.LayerNorm(256)
+        self.gelu2 = nn.GELU()
+        self.dropout2 = nn.Dropout(0.3)
+    def forward(self, input_ids, attention_mask=None, token_type_ids=None):
+        # Get encoder outputs
+        outputs = self.encoder(
+            input_ids=input_ids,
+            attention_mask=attention_mask,
+            token_type_ids=token_type_ids,
+            output_attentions=True
+        )
+        # CLS token representation
+        cls_output = outputs.last_hidden_state[:, 0, :]  # [batch, 768]
+        # Mean pooling with proper attention masking
+        token_embeddings = outputs.last_hidden_state  # [batch, seq_len, 768]
+        input_mask_expanded = attention_mask.unsqueeze(-1).expand(token_embeddings.size()).float()
+        sum_embeddings = torch.sum(token_embeddings * input_mask_expanded, 1)
+        sum_mask = torch.clamp(input_mask_expanded.sum(1), min=1e-9)
+        mean_pooled = sum_embeddings / sum_mask  # [batch, 768]
+        # Concatenate CLS + mean pooling
+        combined_output = torch.cat([cls_output, mean_pooled], dim=1)  # [batch, 1536]
+        # Pass through classification network
+        x = self.dense1(combined_output)
+        x = self.layernorm1(x)
+        x = self.gelu1(x)
+        x = self.dropout1(x)
+        x = self.dense2(x)
+        x = self.layernorm2(x)
+        x = self.gelu2(x)
+        x = self.dropout2(x)
+        return x
+class SpamUserClassifier(nn.Module):
+    def __init__(self, pretrained_model_name="beomi/kcelectra-base"):
+        super().__init__()
+        self.encoder = ElectraModel.from_pretrained(pretrained_model_name)
+        # Freeze first 2 layers for imbalanced dataset scenario
+        for i, layer in enumerate(self.encoder.encoder.layer):
+            if i < 2:
+                for param in layer.parameters():
+                    param.requires_grad = False
+        self.nameLayer = SpamUserClassificationLayer(self.encoder)
+        self.contentLayer = SpamUserClassificationLayer(self.encoder)
+        self.dense = nn.Linear(512, 256)
+        self.layernorm = nn.LayerNorm(256)
+        self.gelu = nn.GELU()
+        self.dropout = nn.Dropout(0.3)
+        self.output_layer = nn.Linear(256, 1)
+        self.sigmoid = nn.Sigmoid()
+    def forward(self, name_input_ids, content_input_ids, name_attention_mask=None, name_token_type_ids=None,
+               content_attention_mask=None, content_token_type_ids=None, return_logits=False, return_probs=True):
+        namePrediction = self.nameLayer(name_input_ids, name_attention_mask, name_token_type_ids)
+        contentPrediction = self.contentLayer(content_input_ids, content_attention_mask, content_token_type_ids)
+        # Pass through classification network
+        x = self.dense(torch.cat([namePrediction, contentPrediction], dim=1))
+        x = self.layernorm(x)
+        x = self.gelu(x)
+        x = self.dropout(x)
+        logits = self.output_layer(x)
+        if return_logits:
+            return logits
+        else:
+            # Apply sigmoid and return probabilities or predictions
+            probs = self.sigmoid(logits)
+            # Return class predictions: 0 (not bot) or 1 (bot)
+            return probs if return_probs else (probs > 0.9).long().squeeze(-1)
+# Load the model
+device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
+model = SpamUserClassifier()
+model.load_state_dict(torch.load("model.pth", map_location=device))
+model.to(device)
+model.eval()
+# Example inference
+def classify_comment(author_name, comment_text, threshold=0.9):
+    # Tokenize author name
+    name_encoding = tokenizer(
+        author_name,
+        truncation=True,
+        padding="max_length",
+        max_length=128,
+        return_tensors="pt"
+    )
+    name_input_ids = name_encoding["input_ids"].to(device)
+    name_attention_mask = name_encoding["attention_mask"].to(device)
+    # Tokenize content
+    content_encoding = tokenizer(
+        comment_text,
+        truncation=True,
+        padding="max_length",
+        max_length=128,
+        return_tensors="pt"
+    )
+    content_input_ids = content_encoding["input_ids"].to(device)
+    content_attention_mask = content_encoding["attention_mask"].to(device)
+    # Get prediction
+    with torch.no_grad():
+        probs = model(
+            name_input_ids=name_input_ids,
+            content_input_ids=content_input_ids,
+            name_attention_mask=name_attention_mask,
+            content_attention_mask=content_attention_mask,
+            return_logits=False,
+            return_probs=True
+        )
+    # Get probability and prediction
+    probability = probs.item()
+    is_bot = probability > threshold
+    return {
+        "probability": probability,
+        "is_bot": is_bot
+    }
+# Example usage
+result = classify_comment(
+    author_name="SpamBot2023",
+    comment_text="Check out my channel for free gift cards!"
+)
+print(f"Bot probability: {result['probability']:.4f}")
+print(f"Is bot comment: {result['is_bot']}")
+```
+## Limitations
+- Primarily optimized for Korean YouTube comments
+- May have reduced performance on other languages or platforms
+- Cannot detect sophisticated bots that closely mimic human writing patterns
+- Limited to text-based features (doesn't consider user history or behavior patterns)
+## Citation
+If you use this model in your research, please cite:
+```
+@misc{noMoreSpamYT,
+  author = {MisileLab},
+  title = {noMoreSpamYT: YouTube Bot Comment Detection System},
+  year = {2025},
+  publisher = {Hugging Face},
+  howpublished = {\url{https://huggingface.co/MisileLab/noMoreSpamYT}}
+}
+```
+## Contact
+For questions, issues, or feedback, please open an issue on the [GitHub repository](https://github.com/MisileLab/h3/tree/main/projects/dsb/vivian).