Safetensors
English
qwen2

Enhance model card: Add pipeline tag, library name, paper and code links

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +33 -6
README.md CHANGED
@@ -1,16 +1,22 @@
1
  ---
2
- license: mit
 
3
  datasets:
4
  - AceSearcher/Search-SFT
5
  - AceSearcher/Search-RFT-Prompts
6
  language:
7
  - en
8
- base_model:
9
- - Qwen/Qwen2.5-1.5B-Instruct
 
10
  ---
 
11
  ## Introduction
12
  Here is the checkpoint used in the paper **AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play**. It uses `Qwen-2.5-Instruct-1.5B` as the backbone.
13
 
 
 
 
14
  ## Model Usage
15
  For question decomposition on QA tasks:
16
  ```
@@ -109,14 +115,35 @@ Wrap your answer with <answer> and </answer> tags."""
109
 
110
  For Decomposition for document-level financial reasoning tasks:
111
  ```
112
- decompose_prompt = """You have the following passages and table:\nPassages:\n{passage}\nPlease break down the question '{question}' into multiple specific sub-questions that address individual components of the original question, with the table and passages as the reference. Use ### to mark the start of each sub-question."""
 
 
 
 
 
 
 
 
 
113
 
114
- qa_prompt = """You have the following passages and table:\nPassages:\n{passage}\nFor the question '{question}', here is a referenced breakdown:\n{decompose}.\n\nWrite a Python program to solve the question. Store the final result in the variable ans."""
115
 
116
 
117
  question = "What would the change in furniture and fixtures between 2018 and 2019 be if furniture and fixtures were $5,000 thousand in 2018 instead? (in thousand)"
118
 
119
- context_text = "\n|||December 31,||\n||Useful Life|2019|2018|\n|Computer equipment and software|3 \u2013 5 years|$57,474|$52,055|\n|Furniture and fixtures|7 years|6,096|4,367|\n|Leasehold improvements|2 \u2013 6 years|22,800|9,987|\n|Renovation in progress|n/a|8|1,984|\n|Build-to-suit property|25 years|\u2014|51,058|\n|Total property and equipment, gross||86,378|119,451|\n|Less: accumulated depreciation and amortization||(49,852)|(42,197)|\n|Total property and equipment, net||$36,526|$77,254|\n 7. OTHER BALANCE SHEET AMOUNTS The components of property and equipment, net is as follows (in thousands): Depreciation expense for the years ended December 31, 2019, 2018, and 2017 was $11.8 million, $10.2 million, and $10.3 million, respectively.\n"
 
 
 
 
 
 
 
 
 
 
 
 
120
 
121
  decompose_prompt = decompose_prompt.replace("{passage}" , context_text)
122
  decompose_prompt = decompose_prompt.replace("{question}", question)
 
1
  ---
2
+ base_model:
3
+ - Qwen/Qwen2.5-1.5B-Instruct
4
  datasets:
5
  - AceSearcher/Search-SFT
6
  - AceSearcher/Search-RFT-Prompts
7
  language:
8
  - en
9
+ license: mit
10
+ pipeline_tag: text-generation
11
+ library_name: transformers
12
  ---
13
+
14
  ## Introduction
15
  Here is the checkpoint used in the paper **AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play**. It uses `Qwen-2.5-Instruct-1.5B` as the backbone.
16
 
17
+ **Paper:** [AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play](https://huggingface.co/papers/2509.24193)
18
+ **Code:** https://github.com/ritaranx/AceSearcher/
19
+
20
  ## Model Usage
21
  For question decomposition on QA tasks:
22
  ```
 
115
 
116
  For Decomposition for document-level financial reasoning tasks:
117
  ```
118
+ decompose_prompt = """You have the following passages and table:
119
+ Passages:
120
+ {passage}
121
+ Please break down the question '{question}' into multiple specific sub-questions that address individual components of the original question, with the table and passages as the reference. Use ### to mark the start of each sub-question."""
122
+
123
+ qa_prompt = """You have the following passages and table:
124
+ Passages:
125
+ {passage}
126
+ For the question '{question}', here is a referenced breakdown:
127
+ {decompose}.
128
 
129
+ Write a Python program to solve the question. Store the final result in the variable ans."""
130
 
131
 
132
  question = "What would the change in furniture and fixtures between 2018 and 2019 be if furniture and fixtures were $5,000 thousand in 2018 instead? (in thousand)"
133
 
134
+ context_text = "
135
+ |||December 31,||
136
+ ||Useful Life|2019|2018|
137
+ |Computer equipment and software|3 \u2013 5 years|$57,474|$52,055|
138
+ |Furniture and fixtures|7 years|6,096|4,367|
139
+ |Leasehold improvements|2 \u2013 6 years|22,800|9,987|
140
+ |Renovation in progress|n/a|8|1,984|
141
+ |Build-to-suit property|25 years|\u2014|51,058|
142
+ |Total property and equipment, gross||86,378|119,451|
143
+ |Less: accumulated depreciation and amortization||(49,852)|(42,197)|
144
+ |Total property and equipment, net||$36,526|$77,254|
145
+ 7. OTHER BALANCE SHEET AMOUNTS The components of property and equipment, net is as follows (in thousands): Depreciation expense for the years ended December 31, 2019, 2018, and 2017 was $11.8 million, $10.2 million, and $10.3 million, respectively.
146
+ "
147
 
148
  decompose_prompt = decompose_prompt.replace("{passage}" , context_text)
149
  decompose_prompt = decompose_prompt.replace("{question}", question)