You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[Eval,Arch] Update GPTQ eval and add headless_mode for Controller (#2994)
* update and polish gptq eval
* fix typo
* Update evaluation/gpqa/README.md
Co-authored-by: Graham Neubig <[email protected]>
* Update evaluation/gpqa/run_infer.py
Co-authored-by: Graham Neubig <[email protected]>
* add headless mode to all appropriate agent controller call
* delegate set to error when in headless mode
* try to deduplicate a bit
* make headless_mode default to True and only change it to false for AgentSession
---------
Co-authored-by: Graham Neubig <[email protected]>
<insert correct answer here, must be one of A, B, C, D> (Please dont use any additional characters. Just the letter of the correct answer (A/B/C/D).)
50
+
||FINAL_ANSWER>>
51
+
""".strip()
52
+
53
+
54
+
defgpqa_codeact_user_response(
55
+
state: State,
56
+
encapsulate_solution: bool=False,
57
+
try_parse: Callable[[Action], str] |None=None,
58
+
) ->str:
59
+
msg= (
60
+
'Please continue working on the task on whatever approach you think is suitable.\n'
61
+
'Feel free to use all tools for calculations and solving the problem, and web-search for finding relevant facts during the process if needed\n'
62
+
'If you have finished reporting the answer in the expected format, (and only once that is done), please run the following command to submit: <execute_bash> exit </execute_bash>.\n'
63
+
'Again you are being told a million times to first report the answer in the requested format (see again below for reference) before exiting. DO NOT EXIT WITHOUT REPORTING THE ANSWER FIRST.\n'
64
+
'That is, when you have decided on the answer report in the following format:\n'
65
+
f'{ACTION_FORMAT}\n'
66
+
'<execute_bash> exit </execute_bash>\n'
67
+
'IMPORTANT: YOU SHOULD NEVER ASK FOR HUMAN HELP TO SOLVE THIS TASK.\n'
'CodeActAgent': '\n\n SUPER IMPORTANT: When you think you have solved the question, first report it back to the user in the requested format. Only once that is done, in the next turn, please run the following command: <execute_bash> exit </execute_bash>.\n'
- SUPER IMPORTANT: When you have reported the answer to the user in the requested format, (and only once that is done) in the next turn, please run the following command: <execute_bash> exit </execute_bash>.
216
+
- Again you are being told a million times to first report the answer in the requested format (see again below for reference) before exiting. DO NOT EXIT WITHOUT REPORTING THE ANSWER FIRST.
217
+
That is, when you have decided on the answer report in the following format:
218
+
219
+
{ACTION_FORMAT}
220
+
<execute_bash> exit </execute_bash>
221
+
222
+
Again do not quit without reporting the answer first.
223
+
Ok now its time to start solving the question. Good luck!
224
+
"""
187
225
188
226
# Here's how you can run the agent (similar to the `main` function) and get the final task state
0 commit comments