Skip to content

Hosted models perform better than local BYOM ones #43121

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
awaescher opened this issue Jan 6, 2025 · 2 comments
Open

Hosted models perform better than local BYOM ones #43121

awaescher opened this issue Jan 6, 2025 · 2 comments

Comments

@awaescher
Copy link

Hosted models do a better job summarizing pages than local BYOM ones. The UI behaves differently too. I guess this has anything to do how the website content is presented to the model internally. This seems to apply for each slash tool, not just summarization.

See the following screenshots for a comparison of the same question towards the same model - the only difference is whether it's a hosted (left) or local BYOM (right) one:

Image

@awaescher
Copy link
Author

I guess version1.74 fixed this

Image

@awaescher
Copy link
Author

awaescher commented Jan 16, 2025

Awesome to see this is fixed, however I noticed that on some pages the local models cannot summarize the website. My best guess is it depends on the website content length:

Most websites I tried won't work with local models

This StackOverflow site can be summarized ...
https://meta.stackoverflow.com/questions/432632/isnt-this-question-a-bad-review-audit

... while this can't:
https://meta.stackoverflow.com/questions/432657/upcoming-experiment-for-commenting

Image

More examples

https://news.ycombinator.com
https://huggingface.co/enterprise
https://blog.codinghorror.com/the-field-of-dreams-strategy/
https://blog.codinghorror.com/stay-gold-america
https://andrewlock.net/creating-a-source-generator-part-1-creating-an-incremental-source-generator

(by the way, I tried increasing the context length in the BYOM settings without success)


The issue title is still a good match because hosted models like llama3.1:8b can summarize all the websites shown above even though it may state rightfully that the context is too large:

Image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: No status
Development

No branches or pull requests

2 participants