Simplify max_output_tokens handling in LLM classes #9296

neubig · 2025-06-23T02:59:36Z

Summary

This PR simplifies the handling of max_output_tokens in OpenHands LLM classes by removing complex default-setting logic and keeping the value as None when not explicitly set.

Changes Made

Core Changes

Removed complex default logic: Eliminated the complex logic in init_model_info() that attempted to determine default values for max_output_tokens based on various conditions
Simplified parameter handling: Now if max_output_tokens is None, it remains None instead of being set to computed defaults
Conditional parameter inclusion: Updated completion function calls in all LLM classes to conditionally include max_output_tokens only when it's not None

Files Modified

openhands/llm/llm.py: Removed lines 489-507 containing complex default determination logic, added conditional parameter inclusion
openhands/llm/async_llm.py: Added conditional max_tokens parameter inclusion
openhands/llm/streaming_llm.py: Added conditional max_tokens parameter inclusion
tests/unit/test_llm.py: Updated test expectations to expect None values instead of computed defaults

Testing

✅ All 32 LLM unit tests pass
✅ Pre-commit hooks pass (ruff, mypy, formatting)
✅ Verified headless mode functionality works with the changes

Benefits

Simplified codebase: Removes unnecessary complexity in determining defaults
Clearer behavior: More predictable behavior when max_output_tokens is not set
Better maintainability: Less complex logic to maintain and debug
Consistent handling: Uniform approach across all LLM implementations

Backward Compatibility

This change maintains backward compatibility - the LLM providers will use their own defaults when max_output_tokens is not provided, which is the expected behavior.

References

Addresses user request to simplify max_output_tokens handling
Follows the principle of keeping None as None instead of complex default computation

Fixes #9227

@neubig can click here to continue refining the PR

To run this PR locally, use the following command:

docker run -it --rm   -p 3000:3000   -v /var/run/docker.sock:/var/run/docker.sock   --add-host host.docker.internal:host-gateway   -e SANDBOX_RUNTIME_CONTAINER_IMAGE=docker.all-hands.dev/all-hands-ai/runtime:89ec682-nikolaik   --name openhands-app-89ec682   docker.all-hands.dev/all-hands-ai/openhands:89ec682

- Remove complex default-setting logic in init_model_info() - Keep max_output_tokens as None if not explicitly set - Use conditional parameter inclusion in completion calls - Update tests to expect None values instead of computed defaults This simplifies the codebase by removing unnecessary complexity in determining max_output_tokens defaults.

Remove unnecessary conditional dictionary building. Since litellm.completion accepts Optional[int] = None for both max_completion_tokens and max_tokens parameters, we can simply pass None directly instead of conditionally including the parameters. This makes the code much cleaner and easier to understand.

The code is self-explanatory and doesn't need verbose comments.

The assertions are clear without explanatory comments.

enyst · 2025-06-23T06:20:08Z

openhands/llm/llm.py

-                model in self.config.model
-                for model in ['claude-3-7-sonnet', 'claude-3.7-sonnet']
-            ):
-                self.config.max_output_tokens = 64000  # litellm set max to 128k, but that requires a header to be set


I think at least this one needs a quick test with 3.7. I’m not at my computer, but I can do that in a bit

I confirm it works well. I think this was a litellm issue at some point in time, it must have been fixed, and we forgot this code here.

enyst · 2025-06-23T06:21:10Z

Re: the complex logic here.
Some of it is really old and I’ve wanted before to clean it up. Some of it is newer and was solving an issue (comment inline).

enyst

I got LLMs to tell me stories, I also tried to look up the history of this code, and I don't find a reason not to clean it out. So let's go with this. If there are issues, we'll find out. 😅

neubig · 2025-06-23T10:48:52Z

Thank you!

This reverts commit 1e33624.

openhands-agent added 4 commits June 23, 2025 02:59

Remove unnecessary comments

ae7a96d

The code is self-explanatory and doesn't need verbose comments.

Remove unnecessary test comments

89ec682

The assertions are clear without explanatory comments.

neubig requested a review from xingyaoww June 23, 2025 03:08

neubig mentioned this pull request Jun 23, 2025

[Bug]: Context length is hit when asking simple question: what day is today? #9227

Closed

1 task

neubig added the needs-review The PR author would like someone to review. label Jun 23, 2025

enyst reviewed Jun 23, 2025

View reviewed changes

enyst approved these changes Jun 23, 2025

View reviewed changes

enyst removed the needs-review The PR author would like someone to review. label Jun 23, 2025

neubig merged commit 1e33624 into main Jun 23, 2025
28 checks passed

neubig deleted the simplify-max-output-tokens branch June 23, 2025 10:48

enyst mentioned this pull request Jun 25, 2025

Fix: Normalize file_text=None to empty string for create command #9355

Closed

enyst added a commit that referenced this pull request Jun 25, 2025

Revert "Simplify max_output_tokens handling in LLM classes (#9296)"

9f12227

This reverts commit 1e33624.

neubig added the run-eval-50 Runs evaluation with 50 instances label Jun 25, 2025

xingyaoww added a commit that referenced this pull request Jun 25, 2025

Revert "Simplify max_output_tokens handling in LLM classes (#9296)"

a8608b5

This reverts commit 1e33624.

xingyaoww mentioned this pull request Jun 25, 2025

Revert "Simplify max_output_tokens handling in LLM classes" #9364

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Simplify max_output_tokens handling in LLM classes #9296

Simplify max_output_tokens handling in LLM classes #9296

Uh oh!

neubig commented Jun 23, 2025 •

edited by github-actions bot

Loading

Uh oh!

enyst Jun 23, 2025

Uh oh!

enyst Jun 23, 2025

Uh oh!

enyst commented Jun 23, 2025

Uh oh!

enyst left a comment

Uh oh!

Uh oh!

neubig commented Jun 23, 2025

Uh oh!

Uh oh!

Simplify max_output_tokens handling in LLM classes #9296

Simplify max_output_tokens handling in LLM classes #9296

Uh oh!

Conversation

neubig commented Jun 23, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes Made

Core Changes

Files Modified

Testing

Benefits

Backward Compatibility

References

Uh oh!

enyst Jun 23, 2025

Choose a reason for hiding this comment

Uh oh!

enyst Jun 23, 2025

Choose a reason for hiding this comment

Uh oh!

enyst commented Jun 23, 2025

Uh oh!

enyst left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

neubig commented Jun 23, 2025

Uh oh!

Uh oh!

neubig commented Jun 23, 2025 •

edited by github-actions bot

Loading