Update README.md
Browse files
README.md
CHANGED
|
@@ -171,7 +171,7 @@ We report results derived from the Agentless scaffold. Departing from the origin
|
|
| 171 |
"sphinx-doc__sphinx-8475"
|
| 172 |
|
| 173 |
### TAU-bench methodology
|
| 174 |
-
We evaluate TAU-Bench with the average passrate of 5 samples for each query, with GPT-4.1 as user model and without any custom tools. The maximum number of interaction steps is
|
| 175 |
We prepend a general principle to the policy prompt.
|
| 176 |
#### General
|
| 177 |
- In each round, you need to carefully examine the tools provided to you to determine if any can be used.
|
|
|
|
| 171 |
"sphinx-doc__sphinx-8475"
|
| 172 |
|
| 173 |
### TAU-bench methodology
|
| 174 |
+
We evaluate TAU-Bench with the average passrate of 5 samples for each query, with GPT-4.1 as user model and without any custom tools. The maximum number of interaction steps is 40.
|
| 175 |
We prepend a general principle to the policy prompt.
|
| 176 |
#### General
|
| 177 |
- In each round, you need to carefully examine the tools provided to you to determine if any can be used.
|