Safetensors
BWGZK commited on
Commit
f3f9382
·
verified ·
1 Parent(s): 63cfbe2

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +56 -0
README.md ADDED
@@ -0,0 +1,56 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - violetcliff/SmartHome-Bench
5
+ base_model:
6
+ - Qwen/Qwen2.5-VL-7B-Instruct
7
+ ---
8
+
9
+ # DeepIntuit
10
+
11
+ ## Model Description
12
+
13
+ **DeepIntuit** is a reasoning-enhanced video understanding model designed for **open-instance video classification**. Instead of directly mapping visual features to labels, the model learns to generate **intrinsic reasoning traces** that guide the final classification decision, improving robustness under large intra-class variation.
14
+
15
+ The model is introduced in:
16
+
17
+ **From Imitation to Intuition: Intrinsic Reasoning for Open-Instance Video Classification**
18
+ 📄 Paper: [https://arxiv.org/abs/2603.10300](https://arxiv.org/abs/2603.10300)
19
+ 💻 Code: [https://github.com/BWGZK-keke/DeepIntuit](https://github.com/BWGZK-keke/DeepIntuit)
20
+
21
+ ---
22
+
23
+ ## Training Pipeline
24
+
25
+ DeepIntuit is trained through a three-stage pipeline:
26
+
27
+ 1. **Cold Start Alignment**
28
+ Supervised training to initialize structured reasoning generation.
29
+
30
+ 2. **Reasoning Refinement (GRPO)**
31
+ Reinforcement learning improves reasoning quality and prediction consistency.
32
+
33
+ 3. **Intuitive Calibration**
34
+ A lightweight classifier is trained on generated reasoning traces for stable prediction.
35
+
36
+ ---
37
+
38
+ ## Intended Use
39
+
40
+ DeepIntuit is designed for research on:
41
+
42
+ * video understanding
43
+ * open-instance video classification
44
+ * reasoning-enhanced multimodal learning
45
+ * safety-sensitive video analysis
46
+
47
+
48
+ ## Citation
49
+
50
+ ```bibtex
51
+ @article{zhang2026deepintuit,
52
+ title={From Imitation to Intuition: Intrinsic Reasoning for Open-Instance Video Classification},
53
+ author={Zhang, Ke and Zhao, Xiangchen and Tian, Yunjie and Zheng, Jiayu and Patel, Vishal M and Fu, Di},
54
+ year={2026}
55
+ }
56
+ ```