Add model card for REVEAL
Browse filesThis PR adds a comprehensive model card for REVEAL. It includes:
- Links to the paper: [Reasoning-Aware AIGC Detection via Alignment and Reinforcement](https://huggingface.co/papers/2604.19172).
- Link to the project page: https://aka.ms/reveal.
- Link to the Github repository: https://github.com/microsoft/AnthropomorphicIntelligence.
- Relevant metadata for `library_name: transformers` and `pipeline_tag: text-classification`.
README.md
ADDED
|
@@ -0,0 +1,25 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
library_name: transformers
|
| 3 |
+
pipeline_tag: text-classification
|
| 4 |
+
---
|
| 5 |
+
|
| 6 |
+
# REVEAL: Reasoning-Aware AIGC Detection via Alignment and Reinforcement
|
| 7 |
+
|
| 8 |
+
REVEAL is a detection framework designed to identify AI-generated content (AIGC) by generating interpretable reasoning chains before final classification. This approach enhances both the accuracy and the transparency of the detection process.
|
| 9 |
+
|
| 10 |
+
- **Paper:** [Reasoning-Aware AIGC Detection via Alignment and Reinforcement](https://huggingface.co/papers/2604.19172)
|
| 11 |
+
- **Project Page:** [https://aka.ms/reveal](https://aka.ms/reveal)
|
| 12 |
+
- **Repository:** [https://github.com/microsoft/AnthropomorphicIntelligence](https://github.com/microsoft/AnthropomorphicIntelligence)
|
| 13 |
+
|
| 14 |
+
## Introduction
|
| 15 |
+
|
| 16 |
+
The REVEAL framework addresses the challenges of reliable AIGC detection as Large Language Models (LLMs) continue to evolve. It utilizes a two-stage training strategy:
|
| 17 |
+
1. **Supervised Fine-tuning (SFT):** Establishes the model's reasoning capabilities.
|
| 18 |
+
2. **Reinforcement Learning (RL):** Refines accuracy and logical consistency while reducing hallucinations in the reasoning process.
|
| 19 |
+
|
| 20 |
+
The model is trained on **AIGC-text-bank**, a comprehensive multi-domain dataset encompassing diverse LLM sources and authorship scenarios.
|
| 21 |
+
|
| 22 |
+
## Model Details
|
| 23 |
+
- **Architecture**: Based on the `qwen3` model type.
|
| 24 |
+
- **Task**: Text Classification (AIGC Detection).
|
| 25 |
+
- **Training**: Fine-tuned on the AIGC-text-bank dataset using reasoning-aware alignment and reinforcement learning.
|