GGUF
conversational
Kaguya-19 commited on
Commit
6c1786d
·
verified ·
1 Parent(s): a5ba9ad

Upload folder using huggingface_hub

Browse files
Files changed (3) hide show
  1. .gitattributes +1 -0
  2. AgentCPM-Report-Q4_K_M.gguf +3 -0
  3. README.md +327 -0
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ AgentCPM-Report-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
AgentCPM-Report-Q4_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:986f1822c6f9e27503d4cc63656fcbc7f1efd551a6f0153a495a13925087dd8e
3
+ size 4965525920
README.md CHANGED
@@ -1,3 +1,330 @@
1
  ---
2
  license: apache-2.0
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
  ---
4
+ ---
5
+ license: apache-2.0
6
+ ---
7
+ # AgentCPM-Report: Gemini-2.5-pro-DeepResearch Level Local DeepResearch
8
+
9
+ <p align="center">
10
+ <a href='https://huggingface.co/openbmb/AgentCPM-Report'><img src='https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-AgentCPM--Report-yellow'>
11
+ <a href='https://huggingface.co/openbmb/AgentCPM-Report-GGUF'><img src='https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-AgentCPM--Report--GGUF-yellow'>
12
+ <a href='https://github.com/OpenBMB/UltraRAG'><img src='https://img.shields.io/badge/GitHub-UltraRAG-blue?logo=github'>
13
+ </p>
14
+
15
+ ## Links
16
+ - [AgentCPM-Report](https://huggingface.co/openbmb/AgentCPM-Report) The Gemini-2.5-pro-DeepResearch Level Local DeepResearch Model
17
+ - [AgentCPM-Report-GGUF](https://huggingface.co/openbmb/AgentCPM-Report-GGUF) The GGUF version
18
+ - [AgentCPM](https://github.com/OpenBMB/AgentCPM) Our code for AgentCPM Series
19
+ - [UltraRAG](https://github.com/OpenBMB/UltraRAG) The low code RAG Framework
20
+
21
+ ## News
22
+ - [2026-01-20] 🚀🚀🚀 We open-sourced AgentCPM-Report built on MiniCPM4.1-8B, capable of matching top closed-source commercial systems like Gemini-2.5-pro-DeepResearch in report generation.
23
+
24
+ ## Overview
25
+ AgentCPM-Report is an open-source large language model agent jointly developed by [THUNLP](https://nlp.csai.tsinghua.edu.cn), Renmin University of China [RUCBM](https://github.com/RUCBM), and [ModelBest](https://modelbest.cn/en). It is based on the [MiniCPM4.1](https://github.com/OpenBMB/MiniCPM4.1) 8B-parameter base model. It accepts user instructions as input and autonomously generates long-form reports. Key highlights:
26
+
27
+ - **Strong advantages in insight and comprehensiveness**: The first 8B edge-side model to surpass closed-source DeepResearch systems on deep research report generation tasks, redefining the performance ceiling for small-scale agent systems—especially achieving SOTA results on the Insight metric.
28
+ - **Lightweight and local deployment**: Supports agile local deployment. With frameworks like UltraRAG, it enables large-scale knowledge base construction and can generate reports that are even more professional and in-depth than large models. Lightweight models plus local knowledge bases make it feasible to deploy a deep-research report writing system on a personal computer, laying the foundation for report writing based on personal privacy data or private-domain data.
29
+
30
+ ## Demo Cases
31
+ `YouTube link or Bilibili link for the video`
32
+
33
+ ## Quick Start
34
+ ### Docker Deployment
35
+ We provide a minimal one-click `docker-compose` deployment integrated with UltraRAG, including the RAG framework UltraRAG2.0, the model inference framework llama.cpp, and the vector database milvus. If you want CPU inference, we also provide a gpu-based version —just switch `docker-compose.cpu.yml` to `docker-compose.yml`.
36
+
37
+ ``` bash
38
+ git clone git@github.com:OpenBMB/UltraRAG.git
39
+ cd UltraRAG
40
+ git checkout agentcpm-report-demo
41
+ cd agentcpm-report-demo
42
+ cp env.example .env
43
+ docker-compose -f docker-compose.cpu.yml up -d --build
44
+ docker-compose -f docker-compose.cpu.yml logs -f ultrarag-ui
45
+ ```
46
+ The first startup pulls images, downloads the model, and configures the environment, which takes about 30 minutes.
47
+ Then open `http://localhost:5050`. If you can see the UI, your deployment is successful.
48
+ Follow the UI instructions to upload local files, chunk them, and build indexes; then in the Chat section, select AgentCPM-Report in the pipeline to start your workflow.
49
+
50
+ (Optional) You can import [Wiki2024](https://modelscope.cn/datasets/UltraRAG/UltraRAG_Benchmark/tree/master/corpus/wiki24) as the writing database.
51
+
52
+ You can read more tutorials about AgentCPM-Report in the [documentation](https://ultrarag.openbmb.cn/pages/cn/pipeline/agentcpm-report).
53
+
54
+
55
+ ## Evaluation
56
+ <table align="center">
57
+ <thead>
58
+ <tr>
59
+ <th align="center">DeepResearch Bench</th>
60
+ <th align="center">Overall</th>
61
+ <th align="center">Comprehensiveness</th>
62
+ <th align="center">Insight</th>
63
+ <th align="center">Instruction Following</th>
64
+ <th align="center">Readability</th>
65
+ </tr>
66
+ </thead>
67
+ <tbody>
68
+ <tr>
69
+ <td align="center">Doubao-research</td>
70
+ <td align="center">44.34</td>
71
+ <td align="center">44.84</td>
72
+ <td align="center">40.56</td>
73
+ <td align="center">47.95</td>
74
+ <td align="center">44.69</td>
75
+ </tr>
76
+ <tr>
77
+ <td align="center">Claude-research</td>
78
+ <td align="center">45</td>
79
+ <td align="center">45.34</td>
80
+ <td align="center">42.79</td>
81
+ <td align="center">47.58</td>
82
+ <td align="center">44.66</td>
83
+ </tr>
84
+ <tr>
85
+ <td align="center">OpenAI-deepresearch</td>
86
+ <td align="center">46.45</td>
87
+ <td align="center">46.46</td>
88
+ <td align="center">43.73</td>
89
+ <td align="center">49.39</td>
90
+ <td align="center">47.22</td>
91
+ </tr>
92
+ <tr>
93
+ <td align="center">Gemini-2.5-Pro-deepresearch</td>
94
+ <td align="center">49.71</td>
95
+ <td align="center">49.51</td>
96
+ <td align="center">49.45</td>
97
+ <td align="center">50.12</td>
98
+ <td align="center">50</td>
99
+ </tr>
100
+ <tr>
101
+ <td align="center">WebWeaver(Qwen3-30B-A3B)</td>
102
+ <td align="center">46.77</td>
103
+ <td align="center">45.15</td>
104
+ <td align="center">45.78</td>
105
+ <td align="center">49.21</td>
106
+ <td align="center">47.34</td>
107
+ </tr>
108
+ <tr>
109
+ <td align="center">WebWeaver(Claude-Sonnet-4)</td>
110
+ <td align="center">50.58</td>
111
+ <td align="center">51.45</td>
112
+ <td align="center">50.02</td>
113
+ <td align="center">50.81</td>
114
+ <td align="center">49.79</td>
115
+ </tr>
116
+ <tr>
117
+ <td align="center">Enterprise-DR(Gemini-2.5-Pro)</td>
118
+ <td align="center">49.86</td>
119
+ <td align="center">49.01</td>
120
+ <td align="center">50.28</td>
121
+ <td align="center">50.03</td>
122
+ <td align="center">49.98</td>
123
+ </tr>
124
+ <tr>
125
+ <td align="center">RhinoInsigh(Gemini-2.5-Pro)</td>
126
+ <td align="center">50.92</td>
127
+ <td align="center">50.51</td>
128
+ <td align="center">51.45</td>
129
+ <td align="center">51.72</td>
130
+ <td align="center">50</td>
131
+ </tr>
132
+ <tr>
133
+ <td align="center">AgentCPM-Report</td>
134
+ <td align="center">50.11</td>
135
+ <td align="center">50.54</td>
136
+ <td align="center">52.64</td>
137
+ <td align="center">48.87</td>
138
+ <td align="center">44.17</td>
139
+ </tr>
140
+ </tbody>
141
+ </table>
142
+
143
+ <table align="center">
144
+ <thead>
145
+ <tr>
146
+ <th align="center">DeepResearch Gym</th>
147
+ <th align="center">Avg.</th>
148
+ <th align="center">Clarity</th>
149
+ <th align="center">Depth</th>
150
+ <th align="center">Balance</th>
151
+ <th align="center">Breadth</th>
152
+ <th align="center">Support</th>
153
+ <th align="center">Insightfulness</th>
154
+ </tr>
155
+ </thead>
156
+ <tbody>
157
+ <tr>
158
+ <td align="center">Doubao-research</td>
159
+ <td align="center">84.46</td>
160
+ <td align="center">68.85</td>
161
+ <td align="center">93.12</td>
162
+ <td align="center">83.96</td>
163
+ <td align="center">93.33</td>
164
+ <td align="center">84.38</td>
165
+ <td align="center">83.12</td>
166
+ </tr>
167
+ <tr>
168
+ <td align="center">Claude-research</td>
169
+ <td align="center">80.25</td>
170
+ <td align="center">86.67</td>
171
+ <td align="center">96.88</td>
172
+ <td align="center">84.41</td>
173
+ <td align="center">96.56</td>
174
+ <td align="center">26.77</td>
175
+ <td align="center">90.22</td>
176
+ </tr>
177
+ <tr>
178
+ <td align="center">OpenAI-deepresearch</td>
179
+ <td align="center">91.27</td>
180
+ <td align="center">84.90</td>
181
+ <td align="center">98.10</td>
182
+ <td align="center">89.80</td>
183
+ <td align="center">97.40</td>
184
+ <td align="center">88.40</td>
185
+ <td align="center">89.00</td>
186
+ </tr>
187
+ <tr>
188
+ <td align="center">Gemini-2.5-pro-deepresearch</td>
189
+ <td align="center">96.02</td>
190
+ <td align="center">90.71</td>
191
+ <td align="center">99.90</td>
192
+ <td align="center">93.37</td>
193
+ <td align="center">99.69</td>
194
+ <td align="center">95.00</td>
195
+ <td align="center">97.45</td>
196
+ </tr>
197
+ <tr>
198
+ <td align="center">WebWeaver (Qwen3-30b-a3b)</td>
199
+ <td align="center">77.27</td>
200
+ <td align="center">71.88</td>
201
+ <td align="center">85.51</td>
202
+ <td align="center">75.80</td>
203
+ <td align="center">84.78</td>
204
+ <td align="center">63.77</td>
205
+ <td align="center">81.88</td>
206
+ </tr>
207
+ <tr>
208
+ <td align="center">WebWeaver (Claude-sonnet-4)</td>
209
+ <td align="center">96.77</td>
210
+ <td align="center">90.50</td>
211
+ <td align="center">99.87</td>
212
+ <td align="center">94.30</td>
213
+ <td align="center">100.00</td>
214
+ <td align="center">98.73</td>
215
+ <td align="center">97.22</td>
216
+ </tr>
217
+ <tr>
218
+ <td align="center">AgentCPM-Report</td>
219
+ <td align="center">98.48</td>
220
+ <td align="center">95.1</td>
221
+ <td align="center">100.0</td>
222
+ <td align="center">98.5</td>
223
+ <td align="center">100.0</td>
224
+ <td align="center">97.3</td>
225
+ <td align="center">100.0</td>
226
+ </tr>
227
+ </tbody>
228
+ </table>
229
+
230
+ <table align="center">
231
+ <thead>
232
+ <tr>
233
+ <th align="center">DeepConsult</th>
234
+ <th align="center">Avg.</th>
235
+ <th align="center">Win</th>
236
+ <th align="center">Tie</th>
237
+ <th align="center">Lose</th>
238
+ </tr>
239
+ </thead>
240
+ <tbody>
241
+ <tr>
242
+ <td align="center">Doubao-research</td>
243
+ <td align="center">5.42</td>
244
+ <td align="center">29.95</td>
245
+ <td align="center">40.35</td>
246
+ <td align="center">29.7</td>
247
+ </tr>
248
+ <tr>
249
+ <td align="center">Claude-research</td>
250
+ <td align="center">4.6</td>
251
+ <td align="center">25</td>
252
+ <td align="center">38.89</td>
253
+ <td align="center">36.11</td>
254
+ </tr>
255
+ <tr>
256
+ <td align="center">OpenAI-deepresearch</td>
257
+ <td align="center">5</td>
258
+ <td align="center">0</td>
259
+ <td align="center">100</td>
260
+ <td align="center">0</td>
261
+ </tr>
262
+ <tr>
263
+ <td align="center">Gemini-2.5-Pro-deepresearch</td>
264
+ <td align="center">6.7</td>
265
+ <td align="center">61.27</td>
266
+ <td align="center">31.13</td>
267
+ <td align="center">7.6</td>
268
+ </tr>
269
+ <tr>
270
+ <td align="center">WebWeaver(Qwen3-30B-A3B)</td>
271
+ <td align="center">4.57</td>
272
+ <td align="center">28.65</td>
273
+ <td align="center">34.9</td>
274
+ <td align="center">36.46</td>
275
+ </tr>
276
+ <tr>
277
+ <td align="center">WebWeaver(Claude-Sonnet-4)</td>
278
+ <td align="center">6.96</td>
279
+ <td align="center">66.86</td>
280
+ <td align="center">10.47</td>
281
+ <td align="center">22.67</td>
282
+ </tr>
283
+ <tr>
284
+ <td align="center">Enterprise-DR(Gemini-2.5-Pro)</td>
285
+ <td align="center">6.82</td>
286
+ <td align="center">71.57</td>
287
+ <td align="center">19.12</td>
288
+ <td align="center">9.31</td>
289
+ </tr>
290
+ <tr>
291
+ <td align="center">RhinoInsigh(Gemini-2.5-Pro)</td>
292
+ <td align="center">6.82</td>
293
+ <td align="center">68.51</td>
294
+ <td align="center">11.02</td>
295
+ <td align="center">20.47</td>
296
+ </tr>
297
+ <tr>
298
+ <td align="center">AgentCPM-Report</td>
299
+ <td align="center">6.6</td>
300
+ <td align="center">57.6</td>
301
+ <td align="center">13.73</td>
302
+ <td align="center">28.68</td>
303
+ </tr>
304
+ </tbody>
305
+ </table>
306
+
307
+ Our evaluation datasets include DeepResearch Bench, DeepConsult, and DeepResearch Gym. The writing-time knowledge base includes about 2.7 million [Arxiv papers](https://www.kaggle.com/api/v1/datasets/download/Cornell-University/arxiv) and about 200,000 internal webpage summaries.
308
+
309
+ ## Acknowledgements
310
+ This project would not be possible without the support and contributions of the open-source community. During development, we referred to and used multiple excellent open-source frameworks, models, and data resources, including [verl](https://github.com/volcengine/verl), [UltraRAG](https://github.com/OpenBMB/UltraRAG), [MiniCPM4.1](https://github.com/OpenBMB/MiniCPM4.1), and [SurveyGo](https://surveygo.modelbest.cn/).
311
+
312
+ ## Contributions
313
+ Project leads: Yishan Li, Wentong Chen
314
+
315
+ Contributors: Yishan Li, Wentong Chen, Yukun Yan, Mingwei Li, Sen Mei, Xiaorong Wang, Kunpeng Liu, Xin Cong, Shuo Wang, Zhong Zhang, Yaxi Lu, Zhenghao Liu, Yankai Lin, Zhiyuan Liu, Maosong Sun
316
+
317
+ Advisors: Yukun Yan, Yankai Lin, Zhiyuan Liu, Maosong Sun
318
+
319
+ ## Citation
320
+
321
+ If **AgentCPM-Report** is helpful for your research, please cite it as follows:
322
+
323
+ ```bibtex
324
+ @software{AgentCPMReport2026,
325
+ title = {AgentCPM-Report: Gemini-2.5-pro-DeepResearch Level Local DeepResearch},
326
+ author = {Yishan Li, Wentong Chen, Yukun Yan, Mingwei Li, Sen Mei, Xiaorong Wang, Kunpeng Liu, Xin Cong, Shuo Wang, Zhong Zhang, Yaxi Lu, Zhenghao Liu, Yankai Lin, Zhiyuan Liu, Maosong Sun},
327
+ year = {2026},
328
+ url = {https://github.com/OpenBMB/AgentCPM}
329
+ }
330
+ ```