File size: 5,907 Bytes

dc41bba

---
license: mit
language:
- en
metrics:
- accuracy
- f1
base_model:
- google-bert/bert-base-uncased
pipeline_tag: text-classification
tags:
- text-classification
- ai-detection
- academic-text
- ai-generated-text-detection
model-index:
- name: bert-ai-text-detector
  results:
  - task:
      type: text-classification
      name: AI-Generated Text Detection
    dataset:
      name: Custom Academic Text Dataset
      type: custom
    metrics:
    - type: accuracy
      value: 0.9957
    - type: f1
      value: 0.9958
    - type: precision
      value: 0.9923
    - type: recall
      value: 0.9994
---
# BERT-based AI-Generated Academic Text Detector

A high-accuracy BERT model for detecting AI-generated academic text with **99.57% accuracy** on paragraph-level samples.

## Online Demo

🌐 **Try the model online**: [https://followsci.com/ai-detection](https://followsci.com/ai-detection)

Free web interface with real-time detection, no installation or API key required.

## Model Details

### Model Description

- **Model Type**: BERT-base-uncased fine-tuned for binary text classification
- **Architecture**: BERT-base-uncased (110M parameters)
- **Task**: Binary classification (Human-written vs AI-generated text)
- **Input**: Academic text paragraphs (up to 512 tokens)
- **Output**: Binary label (0 = Human-written, 1 = AI-generated) with confidence scores

### Training Information

- **Training Samples**: 1,487,400 paragraph-level samples
- **Validation Samples**: 185,930 paragraph-level samples
- **Test Samples**: 185,930 paragraph-level samples
- **Total Dataset**: 1,859,260 paragraphs
- **Training Data**:
  - Human-written: Academic papers from arXiv
  - AI-generated: Text generated by various large language models (GPT, Claude, etc.)

## Performance

### Test Set Results

| Metric | Value |
|--------|-------|
| **Accuracy** | **99.57%** |
| **F1-Score** | **99.58%** |
| Precision | 99.23% |
| Recall | 99.94% |
| False Positive Rate | 0.82% |
| False Negative Rate | 0.06% |

### Confusion Matrix (Test Set)

| | Predicted: Human | Predicted: AI |
|---|---|---|
| **Actual: Human** | 89,740 (TN) | 740 (FP) |
| **Actual: AI** | 60 (FN) | 95,390 (TP) |

**Inference Speed:** ~20,900 samples/second on RTX 3090 (batch size 64)

## Usage

### Quick Start

```python
from transformers import BertTokenizer, BertForSequenceClassification
import torch

# Load model and tokenizer
model_name = "followsci/bert-ai-text-detector"
tokenizer = BertTokenizer.from_pretrained(model_name)
model = BertForSequenceClassification.from_pretrained(model_name)
model.eval()

# Detect AI text
text = "Your academic paragraph here..."
inputs = tokenizer(text, return_tensors="pt", truncation=True, max_length=512)

with torch.no_grad():
    outputs = model(**inputs)
    probs = torch.nn.functional.softmax(outputs.logits, dim=-1)
    ai_prob = probs[0][1].item() * 100
    human_prob = probs[0][0].item() * 100
    
    print(f"AI-generated probability: {ai_prob:.1f}%")
    print(f"Human-written probability: {human_prob:.1f}%")
    
    if ai_prob > 50:
        print("Prediction: AI-generated")
    else:
        print("Prediction: Human-written")
```

### Batch Processing

```python
texts = [
    "First paragraph...",
    "Second paragraph...",
    # ... more texts
]

inputs = tokenizer(
    texts,
    return_tensors="pt",
    truncation=True,
    max_length=512,
    padding=True
)

with torch.no_grad():
    outputs = model(**inputs)
    probs = torch.nn.functional.softmax(outputs.logits, dim=-1)
    
    for i, prob in enumerate(probs):
        ai_prob = prob[1].item() * 100
        print(f"Text {i+1}: AI probability = {ai_prob:.1f}%")
```

### Using with Transformers Pipeline

```python
from transformers import pipeline

classifier = pipeline(
    "text-classification",
    model="followsci/bert-ai-text-detector",
    tokenizer="followsci/bert-ai-text-detector"
)

result = classifier("Your text here...")
print(result)
```

## Training Details

### Training Configuration

- **Base Model**: `bert-base-uncased`
- **Batch Size**: 64
- **Learning Rate**: 5e-5 (with linear warmup)
- **Warmup Steps**: 5,000
- **Max Sequence Length**: 512
- **Optimizer**: AdamW
- **Epochs**: 3
- **Training Time**: ~11 hours (on RTX 3090)

### Dataset Distribution

| Split | Total Samples | Human (Label 0) | AI (Label 1) |
|-------|--------------|-----------------|--------------|
| Train | 1,487,400 | 723,780 (48.7%) | 763,620 (51.3%) |
| Validation | 185,930 | 90,470 (48.7%) | 95,460 (51.3%) |
| Test | 185,930 | 90,480 (48.7%) | 95,450 (51.3%) |

## Limitations

1. **Domain Specificity**: The model is trained primarily on academic text. Performance may degrade on:
   - Casual text or social media content
   - Technical documentation
   - Creative writing

2. **Binary Classification**: The model only distinguishes between "human" and "AI" text, without:
   - Identifying which AI model generated the text
   - Providing confidence intervals
   - Detecting partially AI-assisted text

3. **Paragraph-Level Detection**: The model is optimized for paragraph-level samples:
   - Performance on sentence-level or full-document level may vary
   - Best results achieved with structured academic paragraphs

4. **False Positives**: Approximately 0.82% false positive rate means some human-written text may be flagged as AI-generated.

## Ethical Considerations

- **Use Case**: This model is intended as a tool for academic integrity and research purposes
- **Bias**: The model may reflect biases present in the training data
- **Misuse**: Should not be used as the sole criterion for academic misconduct decisions
- **Transparency**: Results should be interpreted with context and domain expertise


## License

This model is licensed under the MIT License.

## Contact

- **Email**: raffoduanedonnenfeld@gmail.com

---

<p align="center">
  Made with ❤️ for Academic Integrity
</p>