Skip to content

feat: allow fit to take additional eval data in linear and ensemble models #1096

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 14 commits into from
Oct 29, 2024

Conversation

shobsi
Copy link
Contributor

@shobsi shobsi commented Oct 21, 2024

The additional eval data would be used to measure the fitted model and attach the measurement to the underlying BQML model, which can be used as benchmark for the model consumers in BigQuery Studio and otherwise.

Thank you for opening a Pull Request! Before submitting your PR, there are a few things you can do to make sure it goes smoothly:

  • Make sure to open an issue as a bug/issue before writing your code! That way we can discuss the change, evaluate designs, and agree on the general idea
  • Ensure the tests and linter pass
  • Code coverage does not decrease (if any source code was changed)
  • Appropriate docs were updated
    • LinearRegression: screenshot/4iDNvucVfsEEZiq,
    • LogisticRegression: screenshot/AqvzuVt4RBPjpw2
    • RandomForestClassifier: screenshot/AGxZJoyRNYdZWY4
    • RandomForestRegressor: screenshot/BuvCqmsp7UJ9Cdp
    • XGBClassifier: screenshot/BY7Eb7Ar22zsZBF
    • XGBRegressor: screenshot/66ey2Ci4xWPPU3w

Fixes internal issue 372344226 🦕

The additional eval data would be used to measure the fitted model and
attach the measurement to the underlying BQML model, which can be used
as benchmark for the model consumers in BigQuery Studio and otherwise.
@shobsi shobsi requested review from a team as code owners October 21, 2024 08:23
@shobsi shobsi requested a review from sycai October 21, 2024 08:23
@product-auto-label product-auto-label bot added size: l Pull request size is large. api: bigquery Issues related to the googleapis/python-bigquery-dataframes API. labels Oct 21, 2024
@shobsi shobsi marked this pull request as draft October 21, 2024 08:23
@shobsi shobsi requested review from jiaxunwu and TrevorBergeron and removed request for sycai October 23, 2024 19:29
@shobsi shobsi changed the title feat: allow fit to take additional eval data in linear models feat: allow fit to take additional eval data in linear and ensemble models Oct 23, 2024
@shobsi shobsi marked this pull request as ready for review October 23, 2024 21:47
@shobsi shobsi merged commit 254875c into main Oct 29, 2024
23 checks passed
@shobsi shobsi deleted the shobs-ml-split-fit branch October 29, 2024 00:44
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
api: bigquery Issues related to the googleapis/python-bigquery-dataframes API. size: l Pull request size is large.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants