File size: 1,053 Bytes
bf85809 3013b5a bf85809 3013b5a bf85809 3013b5a bf85809 3013b5a bf85809 3013b5a bf85809 3013b5a bf85809 3013b5a bf85809 3013b5a bf85809 3013b5a |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 |
---
language: id
license: apache-2.0
tags:
- indobert
- text-classification
- plagiarism-detection
- indonesian
- fine-tuned
pipeline_tag: text-classification
widget:
- text: "Apa pengganti y?"
example_title: "Contoh Plagiarisme"
- text: "Bagaimana cara belajar Python?"
example_title: "Contoh Paraphrase"
---
# IndoBERT Plagiarisme Detector
Model **IndoBERT-base-p1** fine-tuned untuk **deteksi plagiarisme teks bahasa Indonesia** (3 kelas):
- `LABEL_0` → 🟢 Tidak Mirip
- `LABEL_1` → 🟡 Paraphrase
- `LABEL_2` → 🔴 Plagiarisme (literal/copy-paste)
**Input model**: Masukkan dua teks/kalimat (question1 dan question2), model akan prediksi kemiripannya.
### Performa
- Accuracy: **78.33%** (test set 300 data)
- Dataset: 3000 data balanced + augmentasi sintetik
### Cara Pakai di Python
```python
from transformers import pipeline
detector = pipeline("text-classification", model="putraharifin/tubes_deep_learning")
result = detector("Apa pengganti y?", "Apa pengganti y dong")
print(result) # {'label': 'LABEL_2', 'score': 0.99} |