AceSearcher
/

AceSearcher-1.5B

@@ -1,16 +1,22 @@
 ---
-license: mit
 datasets:
 - AceSearcher/Search-SFT
 - AceSearcher/Search-RFT-Prompts
 language:
 - en
-base_model:
-- Qwen/Qwen2.5-1.5B-Instruct
 ---
 ## Introduction
 Here is the checkpoint used in the paper **AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play**. It uses `Qwen-2.5-Instruct-1.5B` as the backbone.
 ## Model Usage
 For question decomposition on QA tasks:
 ```
@@ -109,14 +115,35 @@ Wrap your answer with <answer> and </answer> tags."""
 For Decomposition for document-level financial reasoning tasks:
 ```
-decompose_prompt = """You have the following passages and table:\nPassages:\n{passage}\nPlease break down the question '{question}' into multiple specific sub-questions that address individual components of the original question, with the table and passages as the reference. Use ### to mark the start of each sub-question."""
-qa_prompt = """You have the following passages and table:\nPassages:\n{passage}\nFor the question '{question}', here is a referenced breakdown:\n{decompose}.\n\nWrite a Python program to solve the question. Store the final result in the variable ans."""
 question = "What would the change in furniture and fixtures between 2018 and 2019 be if furniture and fixtures were $5,000 thousand in 2018 instead? (in thousand)"
-context_text = "\n|||December 31,||\n||Useful Life|2019|2018|\n|Computer equipment and software|3 \u2013 5 years|$57,474|$52,055|\n|Furniture and fixtures|7 years|6,096|4,367|\n|Leasehold improvements|2 \u2013 6 years|22,800|9,987|\n|Renovation in progress|n/a|8|1,984|\n|Build-to-suit property|25 years|\u2014|51,058|\n|Total property and equipment, gross||86,378|119,451|\n|Less: accumulated depreciation and amortization||(49,852)|(42,197)|\n|Total property and equipment, net||$36,526|$77,254|\n 7. OTHER BALANCE SHEET AMOUNTS The components of property and equipment, net is as follows (in thousands): Depreciation expense for the years ended December 31, 2019, 2018, and 2017 was $11.8 million, $10.2 million, and $10.3 million, respectively.\n"
 decompose_prompt = decompose_prompt.replace("{passage}" , context_text)
 decompose_prompt = decompose_prompt.replace("{question}", question)

 ---
+base_model:
+- Qwen/Qwen2.5-1.5B-Instruct
 datasets:
 - AceSearcher/Search-SFT
 - AceSearcher/Search-RFT-Prompts
 language:
 - en
+license: mit
+pipeline_tag: text-generation
+library_name: transformers
 ---
 ## Introduction
 Here is the checkpoint used in the paper **AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play**. It uses `Qwen-2.5-Instruct-1.5B` as the backbone.
+**Paper:** [AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play](https://huggingface.co/papers/2509.24193)
+**Code:** https://github.com/ritaranx/AceSearcher/
 ## Model Usage
 For question decomposition on QA tasks:
 ```
 For Decomposition for document-level financial reasoning tasks:
 ```
+decompose_prompt = """You have the following passages and table:
+Passages:
+{passage}
+Please break down the question '{question}' into multiple specific sub-questions that address individual components of the original question, with the table and passages as the reference. Use ### to mark the start of each sub-question."""
+qa_prompt = """You have the following passages and table:
+Passages:
+{passage}
+For the question '{question}', here is a referenced breakdown:
+{decompose}.
+Write a Python program to solve the question. Store the final result in the variable ans."""
 question = "What would the change in furniture and fixtures between 2018 and 2019 be if furniture and fixtures were $5,000 thousand in 2018 instead? (in thousand)"
+context_text = "
+|||December 31,||
+||Useful Life|2019|2018|
+|Computer equipment and software|3 \u2013 5 years|$57,474|$52,055|
+|Furniture and fixtures|7 years|6,096|4,367|
+|Leasehold improvements|2 \u2013 6 years|22,800|9,987|
+|Renovation in progress|n/a|8|1,984|
+|Build-to-suit property|25 years|\u2014|51,058|
+|Total property and equipment, gross||86,378|119,451|
+|Less: accumulated depreciation and amortization||(49,852)|(42,197)|
+|Total property and equipment, net||$36,526|$77,254|
+ 7. OTHER BALANCE SHEET AMOUNTS The components of property and equipment, net is as follows (in thousands): Depreciation expense for the years ended December 31, 2019, 2018, and 2017 was $11.8 million, $10.2 million, and $10.3 million, respectively.
+"
 decompose_prompt = decompose_prompt.replace("{passage}" , context_text)
 decompose_prompt = decompose_prompt.replace("{question}", question)