Update model card with pipeline tag and project links

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +37 -7
README.md CHANGED
@@ -1,16 +1,46 @@
1
  ---
2
- license: apache-2.0
3
- language:
4
- - en
5
- - zh
6
  base_model:
7
  - Qwen/Qwen2.5-VL-7B-Instruct
8
  datasets:
9
  - Gnonymous/Web-CogDataset
 
 
 
 
 
10
  ---
11
 
12
- This model was the Web-CogReasoner model mentioned in the paper [Web-CogReasoner: Towards Knowledge-Induced Cognitive Reasoning for Web Agents](https://huggingface.co/papers/2508.01858).
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
13
 
14
- Web-CogReasoner is trained using our [Web-CogDataset](https://huggingface.co/datasets/Gnonymous/Web-CogDataset).
15
 
16
- It achieves 84.4 @ Web-CogBench, 86.3 @ VisualWebBench, 30.2% @ WebVoyager, 17.0% and 10.1% @ Online Multimodal-Mind2Web Cross-Tasks and Cross-Webs
 
 
 
 
 
 
 
 
1
  ---
 
 
 
 
2
  base_model:
3
  - Qwen/Qwen2.5-VL-7B-Instruct
4
  datasets:
5
  - Gnonymous/Web-CogDataset
6
+ language:
7
+ - en
8
+ - zh
9
+ license: apache-2.0
10
+ pipeline_tag: image-text-to-text
11
  ---
12
 
13
+ # Web-CogReasoner
14
+
15
+ [**Web-CogReasoner**](https://huggingface.co/papers/2508.01858) is a knowledge-driven multimodal agent designed for cognitive reasoning in web environments. It introduces a paradigm shift by systematically building agent capabilities through a two-stage training process: knowledge content learning (Factual, Conceptual) and cognitive processes (Procedural).
16
+
17
+ - **Paper:** [Web-CogReasoner: Towards Knowledge-Induced Cognitive Reasoning for Web Agents](https://huggingface.co/papers/2508.01858)
18
+ - **Project Page:** [https://eohan.me/Web-CogReasoner](https://eohan.me/Web-CogReasoner)
19
+ - **Repository:** [https://github.com/Gnonymous/Web-CogReasoner](https://github.com/Gnonymous/Web-CogReasoner)
20
+
21
+ Web-CogReasoner is trained using the [Web-CogDataset](https://huggingface.co/datasets/Gnonymous/Web-CogDataset) and employs a novel knowledge-driven Chain-of-Thought (CoT) reasoning framework to generalize to unseen web tasks.
22
+
23
+ ## Performance
24
+
25
+ Web-CogReasoner demonstrates significant superiority over existing models across various benchmarks:
26
+
27
+ | Benchmark | Score |
28
+ | :--- | :---: |
29
+ | Web-CogBench | 84.4 |
30
+ | VisualWebBench | 86.3 |
31
+ | WebVoyager | 30.2% |
32
+ | Online Multimodal-Mind2Web (Cross-Tasks) | 17.0% |
33
+ | Online Multimodal-Mind2Web (Cross-Webs) | 10.1% |
34
+
35
+ ## Citation
36
 
37
+ If you find this work helpful, please cite the following paper:
38
 
39
+ ```bibtex
40
+ @article{guo2025web,
41
+ title={Web-CogReasoner: Towards Knowledge-Induced Cognitive Reasoning for Web Agents},
42
+ author={Guo, Yuhan and Guo, Cong and Sun, Aiwen and He, Hongliang and Yang, Xinyu and Lu, Yue and Zhang, Yingji and Guo, Xuntao and Zhang, Dong and Liu, Jianzhuang and others},
43
+ journal={arXiv preprint arXiv:2508.01858},
44
+ year={2025}
45
+ }
46
+ ```