max_retries and backoff config not applied; improve retry logic for quota errors #857

louisgthier · 2025-04-13T15:41:52Z

In libs/genai/langchain_google_genai/chat_models.py, the max_retries parameter appears to be unused — the retry mechanism always defaults to 2 retries regardless of the value passed.

Additionally, the following backoff-related parameters are currently hardcoded:

multiplier = 2
min_seconds = 1
max_seconds = 60

It would be helpful to make these configurable so that users can adjust the retry behavior as needed.

Lastly, for cases where the API returns a quota-exceeded error and provides a recommended retry delay, it would be good to parse that value and wait accordingly, if the suggested delay is less than the configured timeout. This would allow for more efficient handling of rate limits.

The text was updated successfully, but these errors were encountered:

lkuligin · 2025-04-16T12:41:04Z

would you be open to send a PR for this, please?

lkuligin added enhancement New feature or request good first issue Good for newcomers labels Apr 16, 2025

bajajku linked a pull request May 9, 2025 that will close this issue

genai: Enhance retry mechanism in ChatGoogleGenerativeAI with customizable parameter #915

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

max_retries and backoff config not applied; improve retry logic for quota errors #857

max_retries and backoff config not applied; improve retry logic for quota errors #857

louisgthier commented Apr 13, 2025

lkuligin commented Apr 16, 2025

max_retries and backoff config not applied; improve retry logic for quota errors #857

max_retries and backoff config not applied; improve retry logic for quota errors #857

Comments

louisgthier commented Apr 13, 2025

lkuligin commented Apr 16, 2025