nielsr HF Staff commited on
Commit
e2465c7
·
verified ·
1 Parent(s): 2c22344

Improve model card: Add Fin-PRM paper link, pipeline tag, library name, and tags

Browse files

This PR significantly improves the model card for the `DianJin-R1-7B` model by:

- **Linking to the presenting paper**: Adding a prominent link to "[Fin-PRM: A Domain-Specialized Process Reward Model for Financial Reasoning in Large Language Models](https://huggingface.co/papers/2508.15202)" at the top of the card.
- **Adding BibTeX**: Including the BibTeX citation for the Fin-PRM paper in the `Citation` section.
- **Adding `pipeline_tag: text-generation`**: This ensures the model is discoverable under the appropriate task on the Hugging Face Hub.
- **Adding `library_name: transformers`**: This enables the automated "How to use in Transformers" widget, as the existing `Quickstart` code snippet explicitly uses the `transformers` library (`AutoModelForCausalLM`, `AutoTokenizer`).
- **Adding relevant `tags`**: Including `financial`, `qwen`, and `llm` for better searchability and categorization.

These changes enhance the model's visibility and user experience on the Hugging Face Hub.

Files changed (1) hide show
  1. README.md +149 -1
README.md CHANGED
@@ -1,9 +1,17 @@
1
  ---
2
  license: mit
 
 
 
 
 
 
3
  ---
4
 
5
  ## DianJin-R1-7B
6
 
 
 
7
  <div align="center">
8
  <img alt="image" src="https://raw.githubusercontent.com/aliyun/qwen-dianjin/refs/heads/master/images/dianjin_logo.png">
9
  <p align="center">
@@ -43,7 +51,9 @@ model = AutoModelForCausalLM.from_pretrained(
43
  )
44
  tokenizer = AutoTokenizer.from_pretrained(model_name)
45
 
46
- prompt = "假设你是一位金融行业专家,请回答下列问题。\n在宏观分析中,描述在既定利率水平下产品市场达到均衡状态的曲线是什么?\n请一步步思考。"
 
 
47
  messages = [
48
  {"role": "system", "content": "You are Qwen, created by Alibaba Cloud. You are a helpful assistant."},
49
  {"role": "user", "content": prompt}
@@ -66,3 +76,141 @@ generated_ids = [
66
  response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
67
  ```
68
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: mit
3
+ pipeline_tag: text-generation
4
+ library_name: transformers
5
+ tags:
6
+ - financial
7
+ - qwen
8
+ - llm
9
  ---
10
 
11
  ## DianJin-R1-7B
12
 
13
+ This repository hosts the **DianJin-R1-7B** model, a model for financial reasoning, further explored in the context of [Fin-PRM: A Domain-Specialized Process Reward Model for Financial Reasoning in Large Language Models](https://huggingface.co/papers/2508.15202).
14
+
15
  <div align="center">
16
  <img alt="image" src="https://raw.githubusercontent.com/aliyun/qwen-dianjin/refs/heads/master/images/dianjin_logo.png">
17
  <p align="center">
 
51
  )
52
  tokenizer = AutoTokenizer.from_pretrained(model_name)
53
 
54
+ prompt = "假设你是一位金融行业专家,请回答下列问题。
55
+ 在宏观分析中,描述在既定利率水平下产品市场达到均衡状态的曲线是什么?
56
+ 请一步步思考。"
57
  messages = [
58
  {"role": "system", "content": "You are Qwen, created by Alibaba Cloud. You are a helpful assistant."},
59
  {"role": "user", "content": prompt}
 
76
  response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
77
  ```
78
 
79
+ ## 🚀 News
80
+ - **2025.08.08** 🔥🔥🔥 "[Evaluating, Synthesizing, and Enhancing for Customer Support Conversation](https://arxiv.org/abs/2508.04423)" is now published and open source!
81
+ - **2025.05.22** 🔥🔥🔥 "[M<sup>3</sup>FinMeeting: A Multilingual, Multi-Sector, and Multi-Task Financial Meeting Understanding Evaluation Dataset](https://arxiv.org/abs/2506.02510)" has been officially accepted by ACL-2025!
82
+ - **2025.04.23** [DianJin-R1](DianJin-R1/README.md) series open source release! This release includes the DianJin-R1-Data dataset, as well as two powerful models: DianJin-R1-7B and DianJin-R1-13B. Please check out our technical report "[DianJin-R1: Evaluating and Enhancing Financial Reasoning in Large Language Models](https://arxiv.org/abs/2504.15716)" for more details and explore the capabilities of these new models.
83
+ - **2025.01.06** The [CFLUE](https://github.com/aliyun/cflue) dataset has been fully open-sourced and is now available for download! 🚀🚀🚀
84
+ - **2024.05.16** The paper "[Benchmarking Large Language Models on CFLUE - A Chinese Financial Language Understanding Evaluation Dataset](https://arxiv.org/abs/2405.10542)" has been officially accepted by ACL-2024! 🚀🚀🚀
85
+
86
+ The **data** and **models** that have been released so far are as follows:
87
+
88
+ <table style="width: 100%; text-align: center;">
89
+ <tr>
90
+ <td></td>
91
+ <th>ModelScope</th>
92
+ <th>HuggingFace</th>
93
+ <th>Paper</th>
94
+ </tr>
95
+ <tr>
96
+ <th>CSC</th>
97
+ <td><a href="https://www.modelscope.cn/datasets/tongyi_dianjin/DianJin-CSC-Data">CSC</a></td>
98
+ <td><a href="https://huggingface.co/datasets/DianJin/DianJin-CSC-Data">CSC</a></td>
99
+ <td><a href="https://arxiv.org/abs/2508.04423">Paper</a></td>
100
+ </tr>
101
+ <tr>
102
+ <th>M<sup>3</sup>FinMeeting</th>
103
+ <td colspan="2">Releasing Soon</td>
104
+ <td><a href="https://arxiv.org/abs/2506.02510">ACL-2025</a></td>
105
+ </tr>
106
+ <tr>
107
+ <th rowspan="3">DianJin-R1</th>
108
+ <td><a href="https://www.modelscope.cn/models/tongyi_dianjin/DianJin-R1-32B">DianJin-R1-32B</a></td>
109
+ <td><a href="https://huggingface.co/DianJin/DianJin-R1-32B">DianJin-R1-32B</a></td>
110
+ <td rowspan="3"><a href="https://arxiv.org/abs/2504.15716">Technical Report</a></td>
111
+ </tr>
112
+ <tr>
113
+ <td><a href="https://www.modelscope.cn/models/tongyi_dianjin/DianJin-R1-7B">DianJin-R1-7B</a></td>
114
+ <td><a href="https://huggingface.co/DianJin/DianJin-R1-7B">DianJin-R1-7B</a></td>
115
+ </tr>
116
+ <tr>
117
+ <td><a href="https://www.modelscope.cn/datasets/tongyi_dianjin/DianJin-R1-Data">DianJin-R1-Data</a></td>
118
+ <td><a href="https://huggingface.co/datasets/DianJin/DianJin-R1-Data">DianJin-R1-Data</a></td>
119
+ </tr>
120
+ <tr>
121
+ <th>CFLUE</th>
122
+ <td><a href="https://modelscope.cn/datasets/tongyi_dianjin/CFLUE">CFLUE</a></td>
123
+ <td><a href="https://huggingface.co/datasets/DianJin/CFLUE">CFLUE</a></td>
124
+ <td><a href="https://arxiv.org/abs/2405.10542">ACL-2024</a></td>
125
+ </tr>
126
+ </table>
127
+
128
+ ## 📝 Introduction
129
+
130
+ Welcome to Qwen DianJin 👋
131
+
132
+ Tongyi DianJin is a financial intelligence solution platform built by Alibaba Cloud,
133
+ dedicated to providing financial business developers with a convenient artificial intelligence application development environment.
134
+ We not only focus on launching advanced large language models (LLM) and large multimodal models (LMM), but also serve as a financial assistant that integrates various artificial intelligence technologies.
135
+ Through our platform, you can explore and experience innovative applications related to artificial general intelligence (AGI), driving development and innovation in the financial sector.
136
+
137
+ We welcome you to explore and experience, and together embark on a journey of intelligent finance!
138
+
139
+ ## ✨ Features
140
+
141
+ ### 💡 Intelligent Applications
142
+
143
+ Provide standardized API capabilities for financial scenarios, such as research report summarization, information extraction from news, and intent recognition for financial customer service.
144
+
145
+ - ✅ Financial Services: Such as credit card repayment reminders, mobile banking navigation, renewal prompts, marketing material generation, etc.
146
+ - ✅ Investment Research & News: Such as research report summarization, information extraction, financial translation, trading metrics Q&A, etc.
147
+ - ✅ Operational Data Query: Such as operational metrics Q&A, anomaly alerts, and other intelligent operational capabilities.
148
+ - ...
149
+
150
+ ### 💡 Open Platform
151
+
152
+ Equip developers with a suite of financial APIs and tools, making it easy to integrate and extend functionality.
153
+
154
+ - ✅ Document Q&A: Optimized document parsing and recall ranking strategies, providing knowledge base Q&A capabilities tailored for financial scenarios.
155
+ - ✅ Metrics Q&A: Capable of answering questions about metrics and plotting metrics, enhancing understanding of financial expertise.
156
+ - ✅ Multi-Agent System: Includes configuration and orchestration of various types of nodes, supporting more personalized configurations based on the capabilities provided by DianJin.
157
+ - ...
158
+
159
+ ## 🔖 Citation
160
+
161
+ If you find our work helpful, feel free to give us a cite.
162
+
163
+ ```bibtex
164
+ @article{finprm2025,
165
+ title={Fin-PRM: A Domain-Specialized Process Reward Model for Financial Reasoning in Large Language Models},
166
+ author={Zhu, Jie and Dou, Huaixia and Li, Junhui and Guo, Lifan and Chen, Feng and Zhang, Chi},
167
+ journal={arXiv preprint arXiv:2508.15202},
168
+ year={2025},
169
+ url={https://arxiv.org/abs/2508.15202}
170
+ }
171
+
172
+ @article{csc,
173
+ title = {Evaluating, Synthesizing, and Enhancing for Customer Support Conversation},
174
+ author = {Jie Zhu, Huaixia Dou, Junhui Li, Lifan Guo, Feng Chen, Chi Zhang, Fang Kong},
175
+ journal = {https://arxiv.org/abs/2508.04423},
176
+ year = "2025"
177
+ }
178
+
179
+ @inproceedings{m3finmeeting,
180
+ title = "M^{3}FinMeeting: A Multilingual, Multi-Sector, and Multi-Task Financial Meeting Understanding Evaluation Dataset",
181
+ author = "Jie Zhu, Junhui Li, Yalong Wen, Xiandong Li, Lifan Guo, Feng Chen",
182
+ booktitle = "Findings of ACL",
183
+ year = "2025"
184
+ }
185
+
186
+ @article{dianjin-r1,
187
+ title = {DianJin-R1: Evaluating and Enhancing Financial Reasoning in Large Language Models},
188
+ author = {Jie Zhu, Qian Chen, Huaixia Dou, Junhui Li, Lifan Guo, Feng Chen, Chi Zhang},
189
+ journal = {arxiv.org/abs/2504.15716},
190
+ year = "2025"
191
+ }
192
+
193
+ @inproceedings{cflue,
194
+ title = "Benchmarking Large Language Models on CFLUE - A Chinese Financial Language Understanding Evaluation Dataset",
195
+ author = "Jie Zhu, Junhui Li, Yalong Wen, Lifan Guo",
196
+ booktitle = "Findings of ACL",
197
+ year = "2024",
198
+ pages = "5673--5693",
199
+ }
200
+ ```
201
+
202
+ ## 🤝 Contact Us
203
+
204
+ Thank you very much for your interest in the Tongyi Dianjin series!
205
+ If you would like to leave a message for our research or product team, feel free to contact us via our official email or by scanning the code to join our DingTalk group: CFLUE@alibabacloud.com.
206
+ Our team is committed to providing you with assistance and support.
207
+
208
+ <img src="images/dianjin_dingding.png" alt="DianJin Logo" style="width: 200px;">
209
+
210
+
211
+ ## ⚠️ Disclaimer
212
+
213
+ We assume no legal liability for the use of the DianJin open-source model and data. Users are responsible for independently assessing and assuming any potential risks associated with using the DianJin model or data, and should always exercise caution.
214
+ We recommend that users independently verify and analyze the model's outputs, and make informed decisions based on their specific needs and real-world scenarios.
215
+ By providing open-source data and models, we aim to offer valuable tools for academic research and industry applications, promoting advancements in artificial intelligence technology within data analysis, financial innovation, and other related fields.
216
+ We encourage users to fully leverage their creativity, deeply explore the potential of the DianJin model, expand its application scenarios, and collectively drive progress and practical implementation of AI technologies across various domains.