Add model card and metadata for VRAG-DFD

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +33 -0
README.md ADDED
@@ -0,0 +1,33 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ pipeline_tag: image-text-to-text
4
+ ---
5
+
6
+ # VRAG-DFD: Verifiable Retrieval-Augmentation for MLLM-based Deepfake Detection
7
+
8
+ [**Paper**](https://huggingface.co/papers/2604.13660) | [**Official GitHub**](https://github.com/abigcatcat/VRAG-DFD)
9
+
10
+ VRAG-DFD is a framework that introduces Verifiable Retrieval-Augmented Generation (RAG) into the Deepfake Detection (DFD) domain. By combining professional forensic knowledge retrieval with Reinforcement Learning (Group Relative Policy Optimization - GRPO), it empowers Multi-modal Large Language Models (MLLMs) to perform expert-level forensic analysis with critical reasoning.
11
+
12
+ ## Overview
13
+
14
+ In Deepfake Detection (DFD) tasks, existing MLLM-based methods often lack professional forgery knowledge. VRAG-DFD addresses this by:
15
+ - **Accurate Dynamic Retrieval**: Providing high-quality associated forgery knowledge via a Forensic Knowledge Database (FKD).
16
+ - **Critical Reasoning**: Using the Forensic Chain-of-Thought (F-CoT) dataset and RL to help the model distinguish between visual evidence and potentially noisy retrieval information.
17
+ - **Three-Stage Training**: A progressive pipeline consisting of Visual Alignment, Forensic SFT, and Critical RL (GRPO).
18
+
19
+ The model is based on the Qwen2.5-VL architecture and achieves state-of-the-art performance on DFD generalization testing.
20
+
21
+ ## Citation
22
+
23
+ If you use our dataset, code or find VRAG-DFD useful, please cite our paper:
24
+
25
+ ```bibtex
26
+ @article{vragdfd2025,
27
+ title={VRAG-DFD: Verifiable Retrieval-Augmentation for MLLM-based Deepfake Detection},
28
+ author={Hui Han and Shunli Wang and Yandan Zhao and Taiping Yao and Shouhong Ding},
29
+ journal={arXiv preprint},
30
+ year={2026},
31
+ note={Available soon}
32
+ }
33
+ ```