Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix: Bump max_iterations when resuming due to throttling #3410

Merged
merged 17 commits into from
Aug 20, 2024

Conversation

tofarr
Copy link
Collaborator

@tofarr tofarr commented Aug 15, 2024

This is intended to address #3323

There was a bug where when you reached the max_iterations, the UI would keep asking you to resume after every single request. This resets the iteration count. (100 by default - so you get 100 more requests before the session pauses again)

Testing

I didn't want to make 100 requests each time, so I temporarily updated the max_iterations in my local config.toml from 100 to 2. ;)

Sample Output

I asked the agent to write a couple of poems - it now resets the iteration count when resuming, so that you can hit max_iterations (In my case 3) before the agent stops again:
image
image
image

@tofarr tofarr marked this pull request as ready for review August 15, 2024 20:18
@@ -225,6 +225,8 @@ async def set_agent_state_to(self, new_state: AgentState):
):
# user intends to interrupt traffic control and let the task resume temporarily
self.state.traffic_control_state = TrafficControlState.PAUSED
# Reset the iterations (So user will not be immediately asked again)
self.state.iteration = 0
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this place in the code is also hit when max_budget_per_task is exceeded, and the user wants to continue. In that case, to reset iterations doesn't seem right.

Copy link
Collaborator Author

@tofarr tofarr Aug 15, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Would something like the following make more sense:

            # Reset the iterations (So user will not be immediately asked again)
            if self.max_budget_per_task is None or self.state.metrics.accumulated_cost < self.max_budget_per_task:
                self.state.iteration = 0

Or do you think it would make sense to expand the max_iterations instead (Though this could result in balooning of task size)? I suppose we could even have an explicit reset iterations action.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It should work. It could use a bit of testing, I guess I'd suggest also a test with a delegate agent getting this iterations issue, just in case, but it looks good to me. Cc: @li-boxuan

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Expanding max_iterations seems to make more sense than resetting self.state.iteration. Resetting self.state.iteration will make backend output look weird.

Another issue, as pointed out by @enyst , is we also need to take care of budget control. Maybe we should also expand the budget (rather than simply resetting it).

After all these, you could delete all TrafficControlState related code.

@tofarr tofarr changed the title Fix: Reset iteration count when resuming due to throttling Fix: Bump max_iterations when resuming due to throttling Aug 16, 2024
@tofarr tofarr requested review from enyst and li-boxuan August 16, 2024 14:35
if self.state.iteration >= self.state.max_iterations:
self.state.max_iterations *= 2
if self.state.metrics.accumulated_cost >= self.max_budget_per_task:
self.max_budget_per_task *= 2
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This piece of code alone doesn't work in a multi-agent context. This essentially expands the current agent/controller's limit, but when it returns to its parent, the old limit would be hit immediately. It would still work but a little bit confusing/surprising to users. The backend logs would look weird, too.

I would suggest take a look at code in if delegate_done section. You probably need to do something similar to "propagate back" the max_iterations and max_budget_per_task.

@tobitege
Copy link
Collaborator

tobitege commented Aug 17, 2024

@tofarr I hope you don't mind that I tried fixing the lint error and main merge, but then mypy also tried to reformat the file again and caused that as an error... 🤯 That thing annoys me.

@tobitege
Copy link
Collaborator

Instead of multiplying the max_iterations, now just add the initial max_iterations to it (the param that originally got passed into AgentController).
Tested locally and works.

@tobitege tobitege requested review from li-boxuan and enyst August 19, 2024 18:45
Copy link
Collaborator

@tobitege tobitege left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

At least from my point of view, this looks good 😁
Thanks, Tim!

@enyst
Copy link
Collaborator

enyst commented Aug 19, 2024

@li-boxuan has pointed out here a weird consequence with delegates: if the user asks for, lets say, searching for info on the web, an agent will delegate to another, and if this 'child' agent, the browsing agent, hits max_iterations, then:

  • the user can choose to continue, at, lets say, 100/100
  • the browsing agent does more stuff
  • it ends, and control returns to the 'parent' agent
  • which will stop immediately with error, because the parent state wasn't updated, so the user sees it's at 178/100 or 102/100, whichever it got to.

If that's the case indeed, it will look odd.

I would suggest to replicate it and see. The user can say continue again, I assume, I just suggest to give it a try and see how it feels like, and how you think about it.

@tofarr tofarr merged commit f5aa111 into main Aug 20, 2024
@tofarr tofarr deleted the fix-reset-iterations-on-resume branch August 20, 2024 12:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants