Use the configured OpenAI Base URL for Automations #1065

arcuru · 2025-01-10T00:07:07Z

This change makes Automations (and possibly other entrypoints) use the configured OpenAI-compatible server if that has been set. Without this change it tries to use the hardcoded OpenAI provider.

All the other calls in this file use a similar method to pass in the base URL.

I have not been able to manually test this because the docker build is taking an extremely long time to build locally.

debanjum

Good catch! Didn't realize we weren't passing api_base_url to the send_message_to_model_wrapper_sync method

arcuru · 2025-01-10T20:07:55Z

I was able to build this locally for testing and I hit at least one uncovered issue related to sending the LLM calls to an OpenAI Compatible Server, which may need to be fixed as well for this scenario.

I think the root of the uncovered problem is from these logs:

server-1    | [18:46:28.889975] DEBUG    khoj.processor.conversation.utils:       utils.py:537
server-1    |                            Fallback to default chat model
server-1    |                            tokenizer: gpt-4o.
server-1    |                            Configure tokenizer for model:
server-1    |                            local/chat/phi4:14b in Khoj settings to
server-1    |                            improve context stuffing.

It appears that the name "gpt-4o" is hardcoded as the default tokenizer, and this call ended up failing even when I created a "gpt-4o" chat model in khoj, adding a "gpt-4o" endpoint on my LiteLLM server, and even configuring my OpenAI key for the gpt-4o model directly. Setting a tokenizer for that model in server/admin/ had no effect.

There are further errors in the logs after this, but they seem to be coming from chunking while running the query to create the automation. While it's good that this is hitting a failure immediately I really hope you're not just feeding the output of the selected GUI options into the LLM to convert it to a cron job template.

TBH, I've hit so many GUI and setup bugs while trying to selfhost Khoj that I'm not going to spend more time on it. I filed the bugs that probably impact your cloud service as well so you can fix for your paying customers.

debanjum · 2025-01-11T01:16:44Z

Hey @arcuru , unfortunate you've hit multiple issues in setting up automations for self-hosted Khoj. I wonder if you're hitting the same timeout issues as #1035 (comment). Anyway let me look into self-hosted Khoj + OpenAI API proxy + Automation setups and see what can be improved for a less annoying experience.

Until then I've answered some of your concerns below:

It appears that the name "gpt-4o" is hardcoded as the default tokenizer, and this call ended up failing even when I created a "gpt-4o" chat model in khoj, adding a "gpt-4o" endpoint on my LiteLLM server, and even configuring my OpenAI key for the gpt-4o model directly. Setting a tokenizer for that model in server/admin/ had no effect.

This isn't an error just a debug log (notice the [18:46:28.889975] DEBUG prefix) that calculating prompt size is going to be less accurate. We fallback to use the gpt-4o tokenizer if can't infer the tokenizer to use for the current chat model, so in scenarios like this where the OpenAI API is being used with non-OpenAI models.

This only actually becomes a problem if the max prompt size set for the chat model is close the actual max prompt size of the model and your chat history starts hitting that limit. You could set max prompt size = 10K for phi4, given it has a 14K context window (which is small by modern standards).

Not sure why you're seeing this when using chat model to gpt-4o though.

I really hope you're not just feeding the output of the selected GUI options into the LLM to convert it to a cron job template.

The LLM isn't used to set crontime when you've explicitly specified it via GUI. We used to have the LLM set the crontime job schedule when we allowed creating an automation directly from the chat (e.g you tell Khoj in chat "Share synthetic biology news every tuesday at 9pm").

The LLM is used to convert your original query into an automation query and email subject. So if you say "Notify me if it's going to rain today" it converts it into a chat query: "Is it going to rain today?" and email subject: "Rain Notification". The results of the chat query: "Is it going to rain today?" is compared against your original query: "Notify me ..." to decide if a notification email should be sent or not.

Nonetheless, appreciate the PR and feedback!

AlipAbdullah · 2025-01-14T13:50:37Z

Hey please inform all of them that I can opt-out all of the developers and any community and whatever is link with my google.com domain anytime I wanted too. Sincerely yours, Alip Clinton/Alip Abdullah Google AI Ownership Manager

…

On Sat, Jan 11, 2025, 09:18 Debanjum ***@***.***> wrote: Merged #1065 <#1065> into master. — Reply to this email directly, view it on GitHub <#1065 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/BOI5IU6M2LMOZADLIJ74QRL2KBWNJAVCNFSM6AAAAABU5IJNHOVHI2DSMVQWIX3LMV45UABCJFZXG5LFIV3GK3TUJZXXI2LGNFRWC5DJN5XDWMJVHA4TAOJTGQ2DGMQ> . You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>

Use the configured OpenAI Base URL for Automations

10f494c

debanjum approved these changes Jan 10, 2025

View reviewed changes

debanjum merged commit 6e0c767 into khoj-ai:master Jan 11, 2025
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use the configured OpenAI Base URL for Automations #1065

Use the configured OpenAI Base URL for Automations #1065

arcuru commented Jan 10, 2025

debanjum left a comment

arcuru commented Jan 10, 2025 •

edited

Loading

debanjum commented Jan 11, 2025

AlipAbdullah commented Jan 14, 2025 via email

Use the configured OpenAI Base URL for Automations #1065

Use the configured OpenAI Base URL for Automations #1065

Conversation

arcuru commented Jan 10, 2025

debanjum left a comment

Choose a reason for hiding this comment

arcuru commented Jan 10, 2025 • edited Loading

debanjum commented Jan 11, 2025

AlipAbdullah commented Jan 14, 2025 via email

arcuru commented Jan 10, 2025 •

edited

Loading