Update README.md
Browse files
README.md
CHANGED
|
@@ -1,10 +1,3 @@
|
|
| 1 |
-
# 🧠 IPAD — Inverse Prompt for AI Detection
|
| 2 |
-
|
| 3 |
-
> **Fine-tuned Phi-3-Medium-128k-Instruct with LoRA using [LLaMA-Factory](https://github.com/hiyouga/LLaMA-Factory)**
|
| 4 |
-
> Author: [@bellafc](https://huggingface.co/bellafc)
|
| 5 |
-
|
| 6 |
-
---
|
| 7 |
-
|
| 8 |
## 📘 Overview
|
| 9 |
|
| 10 |
Large Language Models (LLMs) have achieved human-level fluency in text generation, making it increasingly difficult to distinguish between human- and AI-authored content.
|
|
@@ -22,22 +15,6 @@ trained using [LLaMA-Factory](https://github.com/hiyouga/LLaMA-Factory) for **ro
|
|
| 22 |
|
| 23 |
---
|
| 24 |
|
| 25 |
-
## ⚙️ Model Details
|
| 26 |
-
|
| 27 |
-
| Property | Description |
|
| 28 |
-
|-----------|-------------|
|
| 29 |
-
| **Base model** | [`microsoft/Phi-3-medium-128k-instruct`](https://huggingface.co/microsoft/Phi-3-medium-128k-instruct) |
|
| 30 |
-
| **Architecture** | Decoder-only Transformer |
|
| 31 |
-
| **Fine-tuning** | LoRA (rank-8, α=16, dropout=0.05) |
|
| 32 |
-
| **Context length** | 128k tokens |
|
| 33 |
-
| **Framework** | [LLaMA-Factory](https://github.com/hiyouga/LLaMA-Factory) |
|
| 34 |
-
| **Task** | AI Text Detection (Discriminator) |
|
| 35 |
-
| **Language** | English |
|
| 36 |
-
| **License** | Apache 2.0 |
|
| 37 |
-
| **Author** | [@bellafc](https://huggingface.co/bellafc) |
|
| 38 |
-
|
| 39 |
-
---
|
| 40 |
-
|
| 41 |
## 🚀 Quick Usage
|
| 42 |
|
| 43 |
### 🧩 Prompt Inverter
|
|
@@ -94,4 +71,18 @@ probs = softmax(gen.scores[0], dim=-1)
|
|
| 94 |
yes_token_id = tokenizer(" yes", add_special_tokens=False).input_ids[0]
|
| 95 |
print("Generated:", generated_text)
|
| 96 |
print(f"P('yes') = {probs[0, yes_token_id].item():.4f}")
|
| 97 |
-
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
## 📘 Overview
|
| 2 |
|
| 3 |
Large Language Models (LLMs) have achieved human-level fluency in text generation, making it increasingly difficult to distinguish between human- and AI-authored content.
|
|
|
|
| 15 |
|
| 16 |
---
|
| 17 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 18 |
## 🚀 Quick Usage
|
| 19 |
|
| 20 |
### 🧩 Prompt Inverter
|
|
|
|
| 71 |
yes_token_id = tokenizer(" yes", add_special_tokens=False).input_ids[0]
|
| 72 |
print("Generated:", generated_text)
|
| 73 |
print(f"P('yes') = {probs[0, yes_token_id].item():.4f}")
|
| 74 |
+
```
|
| 75 |
+
|
| 76 |
+
---
|
| 77 |
+
|
| 78 |
+
## ⚙️ Model Details
|
| 79 |
+
|
| 80 |
+
| Property | Description |
|
| 81 |
+
|-----------|-------------|
|
| 82 |
+
| **Base model** | [`microsoft/Phi-3-medium-128k-instruct`](https://huggingface.co/microsoft/Phi-3-medium-128k-instruct) |
|
| 83 |
+
| **Context length** | 128k tokens |
|
| 84 |
+
| **Framework** | [LLaMA-Factory](https://github.com/hiyouga/LLaMA-Factory) |
|
| 85 |
+
| **Task** | AI Text Detection (Discriminator) |
|
| 86 |
+
| **Language** | English |
|
| 87 |
+
| **License** | Apache 2.0 |
|
| 88 |
+
| **Author** | [@bellafc](https://huggingface.co/bellafc) |
|