realolipop commited on
Commit
32c3b05
·
verified ·
1 Parent(s): 363f471

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -171,7 +171,7 @@ We report results derived from the Agentless scaffold. Departing from the origin
171
  "sphinx-doc__sphinx-8475"
172
 
173
  ### TAU-bench methodology
174
- We evaluate TAU-Bench with the average passrate of 5 samples for each query, with GPT-4.1 as user model and without any custom tools. The maximum number of interaction steps is 40.
175
  Our general system prompt is:
176
  ```
177
  - In each round, you need to carefully examine the tools provided to you to determine if any can be used.
 
171
  "sphinx-doc__sphinx-8475"
172
 
173
  ### TAU-bench methodology
174
+ We evaluate TAU-Bench with GPT-4.1 as user model and without any custom tools. The maximum number of interaction steps is 40.
175
  Our general system prompt is:
176
  ```
177
  - In each round, you need to carefully examine the tools provided to you to determine if any can be used.