adaamko commited on
Commit
9d38282
·
verified ·
1 Parent(s): f97498e

Add extractive alternative section, restore library_name

Browse files
Files changed (1) hide show
  1. README.md +26 -21
README.md CHANGED
@@ -1,18 +1,18 @@
1
  ---
2
- base_model: Qwen/Qwen3.5-2B
3
- datasets:
4
- - KRLabsOrg/tool-output-extraction-swebench
5
  language:
6
  - en
7
- license: apache-2.0
8
- pipeline_tag: text-generation
9
- library_name: transformers
10
  tags:
11
  - code
12
  - tool-output
13
  - pruning
14
  - coding-agents
15
  - extraction
 
 
 
 
 
16
  thumbnail: https://raw.githubusercontent.com/KRLabsOrg/squeez/main/assets/squeez_mascot.png
17
  ---
18
 
@@ -116,18 +116,11 @@ messages = [
116
  "Do not rewrite, summarize, or invent lines."
117
  )},
118
  {"role": "user", "content": (
119
- "<query>
120
- Find the failing authentication test
121
- </query>
122
- "
123
- "<tool_output>
124
- "
125
- "PASSED tests/test_login.py::test_valid_credentials
126
- "
127
- "FAILED tests/test_login.py::test_token_refresh - AssertionError: expected 200 got 401
128
- "
129
- "PASSED tests/test_login.py::test_logout
130
- "
131
  "</tool_output>"
132
  )},
133
  ]
@@ -149,8 +142,7 @@ print(response)
149
 
150
  **Input** — Chat messages with system prompt:
151
  - System: extraction instructions (see above)
152
- - User: `<query>{task}</query>
153
- <tool_output>{raw_output}</tool_output>`
154
 
155
  **Output** — Verbatim lines in XML tags:
156
  ```
@@ -191,6 +183,19 @@ Examples:
191
  - cat src/auth/middleware.py | squeez "find the referer validation logic"
192
  ```
193
 
 
 
 
 
 
 
 
 
 
 
 
 
 
194
  ## Limitations
195
 
196
  - Best on software engineering tool output; not designed for general-purpose summarization
@@ -214,4 +219,4 @@ Apache 2.0
214
  primaryClass={cs.SE},
215
  url={https://arxiv.org/abs/2604.04979},
216
  }
217
- ```
 
1
  ---
2
+ license: apache-2.0
 
 
3
  language:
4
  - en
 
 
 
5
  tags:
6
  - code
7
  - tool-output
8
  - pruning
9
  - coding-agents
10
  - extraction
11
+ datasets:
12
+ - KRLabsOrg/tool-output-extraction-swebench
13
+ base_model: Qwen/Qwen3.5-2B
14
+ pipeline_tag: text-generation
15
+ library_name: transformers
16
  thumbnail: https://raw.githubusercontent.com/KRLabsOrg/squeez/main/assets/squeez_mascot.png
17
  ---
18
 
 
116
  "Do not rewrite, summarize, or invent lines."
117
  )},
118
  {"role": "user", "content": (
119
+ "<query>\nFind the failing authentication test\n</query>\n"
120
+ "<tool_output>\n"
121
+ "PASSED tests/test_login.py::test_valid_credentials\n"
122
+ "FAILED tests/test_login.py::test_token_refresh - AssertionError: expected 200 got 401\n"
123
+ "PASSED tests/test_login.py::test_logout\n"
 
 
 
 
 
 
 
124
  "</tool_output>"
125
  )},
126
  ]
 
142
 
143
  **Input** — Chat messages with system prompt:
144
  - System: extraction instructions (see above)
145
+ - User: `<query>{task}</query>\n<tool_output>{raw_output}</tool_output>`
 
146
 
147
  **Output** — Verbatim lines in XML tags:
148
  ```
 
183
  - cat src/auth/middleware.py | squeez "find the referer validation logic"
184
  ```
185
 
186
+ ## Smaller extractive alternative
187
+
188
+ If you don't need a generative model, swap in
189
+ [`KRLabsOrg/verbatim-rag-modern-bert-v2`](https://huggingface.co/KRLabsOrg/verbatim-rag-modern-bert-v2)
190
+ (150M ModernBERT span model). Same CLI:
191
+
192
+ ```bash
193
+ export SQUEEZ_LOCAL_MODEL=KRLabsOrg/verbatim-rag-modern-bert-v2
194
+ pytest -q 2>&1 | squeez "find the failing test"
195
+ ```
196
+
197
+ Head-to-head numbers are on the [verbatim-rag-modern-bert-v2 model card](https://huggingface.co/KRLabsOrg/verbatim-rag-modern-bert-v2).
198
+
199
  ## Limitations
200
 
201
  - Best on software engineering tool output; not designed for general-purpose summarization
 
219
  primaryClass={cs.SE},
220
  url={https://arxiv.org/abs/2604.04979},
221
  }
222
+ ```