✨ Replace max_iterations with max_llm_calls #628

fabianvf · 2025-02-06T19:04:21Z

Add a counter to the model provider to optionally limit the number of requests that can be made.
Remove all references to max_iterations (except for in the rpc_server params, we can remove that in a few releases)

- Add a counter to the model provider to optionally limit the number of requests that can be made. - Remove all references to max_iterations (except for in the rpc_server params, we can remove that in a few releases) Signed-off-by: Fabian von Feilitzsch <[email protected]>

shawn-hurley · 2025-02-06T19:59:48Z

kai/llm_interfacing/model_provider.py

@@ -22,7 +22,17 @@
 LOG = get_logger(__name__)


+class LLMCallBudgetReached(Exception):
+    def __init__(
+        self, message: str = "The defined LLM call budget has been reached"


Should we say the budget that was set?

shawn-hurley · 2025-02-06T20:00:56Z

kai/llm_interfacing/model_provider.py

 class ModelProvider:
+
+    llm_call_budget: int = -1


Is this approach going to cause issues with Jonah's async PR?

@JonahSussman

I don't think so. Each llm call can still check if the budget is reached before actually making the call. Asyncio is cooperatively multitasked

I don't know. wouldn't multple calls to get_code_plan_solution cause the budget for ALL calls in the system to be reset (AFAICT Model provider is shared).

are we supporting multiple calls? The whole system kind of breaks down in that case anyway doesn't it?

It is, but it might be nice to have this not be one more thing that we have to remember if/when we do have multiple requests was my other thought.

But maybe we have to re-architect the initialization of task managers/agents at that time.

JonahSussman · 2025-02-06T20:08:42Z

kai/rpc_server/server.py

@@ -463,7 +456,9 @@ class GetCodeplanAgentSolutionParams(BaseModel):
    file_path: Path
    incidents: list[ExtendedIncident]

+    # Deprecated in favor of llm_call_budget


https://docs.pydantic.dev/latest/concepts/fields/#deprecated-fields You can make this a bonafide deprecated field if you want

Or you could alias the other field to the new one

+1 that's a good idea

fabianvf force-pushed the llm-call-budget branch 2 times, most recently from 81e7510 to 61ab9c4 Compare February 6, 2025 19:21

fabianvf force-pushed the llm-call-budget branch from 61ab9c4 to 3e25470 Compare February 6, 2025 19:32

shawn-hurley reviewed Feb 6, 2025

View reviewed changes

JonahSussman reviewed Feb 6, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

✨ Replace max_iterations with max_llm_calls #628

✨ Replace max_iterations with max_llm_calls #628

fabianvf commented Feb 6, 2025

shawn-hurley Feb 6, 2025

shawn-hurley Feb 6, 2025

fabianvf Feb 6, 2025

JonahSussman Feb 6, 2025

shawn-hurley Feb 6, 2025

fabianvf Feb 6, 2025

shawn-hurley Feb 7, 2025

JonahSussman Feb 6, 2025

JonahSussman Feb 6, 2025

fabianvf Feb 6, 2025

✨ Replace max_iterations with max_llm_calls #628

Are you sure you want to change the base?

✨ Replace max_iterations with max_llm_calls #628

Conversation

fabianvf commented Feb 6, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment