You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Jul 4, 2025. It is now read-only.
Hostfix: remove not needed params from load_model (#2209)
* refactor: remove --pooling flag from model loading
The --pooling flag was removed as the mean pooling functionality not needed in chat models. This fixes the regression
* feat(local-engine): add ctx_len parameter support
Adds support for the ctx_len parameter by appending --ctx-size with its value. Removed outdated parameter mappings from the kParamsMap to reflect current implementation details and ensure consistency.
* feat: add conditional model parameters based on path
When the model path contains both "jan" and "nano" (case-insensitive), automatically add
speculative decoding parameters to adjust generation behavior. This improves
flexibility by enabling environment-specific configurations without manual
parameter tuning. Also includes necessary headers for string manipulation and
fixes whitespace in ctx_len handling.
* chore: remove redundant comment
The comment was redundant as the code's purpose is clear without it, improving readability.
0 commit comments