perf(weave): address data loading perf issue on eval compare #4192

chance-wnb · 2025-04-19T00:27:50Z

Description

This addresses a part of the evaluation comparison performance issue. No the customer scenario will no longer crash.

UI wise, there is no noticeable behavior change.

Here I attach a video(internal) to explain the change.

Testing

This PR is manually tested against the customer scenario and locally.

circle-job-mirror · 2025-04-19T00:31:18Z

Preview this PR with FeatureBee: https://beta.wandb.ai/?betaVersion=e0ff91d73a74f2bddded908c73577a5c11c57ed8

gtarpenning

I played around with it and its really nice, I did notice one small bug which is that if you click the next example button rapidly, we get into a state where the query fires but the loading state if permanently stuck. I think this has to do with the mutation counter but i'm not sure. A very edge case, but because we don't have a way of going to a specific example (which I think we should eventually add btw), I could imagine a user spamming the next example button to get to the one they know is interesting. We might be able to get away with just loading 5 ahead or something 🤷🏻

...se3/pages/CompareEvaluationsPage/sections/ExampleCompareSection/exampleCompareSectionUtil.ts

chance-wnb · 2025-04-21T20:21:17Z

I played around with it and its really nice, I did notice one small bug which is that if you click the next example button rapidly, we get into a state where the query fires but the loading state if permanently stuck. I think this has to do with the mutation counter but i'm not sure. A very edge case, but because we don't have a way of going to a specific example (which I think we should eventually add btw), I could imagine a user spamming the next example button to get to the one they know is interesting. We might be able to get away with just loading 5 ahead or something 🤷🏻

Great job on discovering this bug, it is a solid bug. This is because in the loadRowDataIntoCache() I was passing cachedRowData.current which is the at-the-time value of the cache, which does not reflect the latest value during race conditions. My fix is to change the parameter to pass the refs (the boxed object), and the issue went away.

gtarpenning

lgtm, love the code removal. I think the cache handling is neat. I trust that the mutation counter is common practice, and after watching your video it makes sense; @tssweeney might be curious about it!

chance-wnb · 2025-04-21T20:59:02Z

lgtm, love the code removal. I think the cache handling is neat. I trust that the mutation counter is common practice, and after watching your video it makes sense; @tssweeney might be curious about it!

In regard to the mutation counter thingy, I now feel that semantically giving it a better name seems to justify its usage better. now I renamed it to cacheVersion.

tssweeney · 2025-04-21T23:15:13Z

...ePanelComponents/Home/Browse3/pages/wfReactInterface/tsDataModelHooksEvaluationComparison.ts

@@ -499,30 +499,6 @@ const fetchEvaluationComparisonResults = async (
      });
    });

-  // 3.5 Populate the inputs


Given this, you are no long populating inputs in

export type EvaluationComparisonResults = { // Inputs are the intersection of all inputs used in the evaluations. // Note, we are able to "merge" the same input digest even if it is // used in different evaluations. inputs: { [rowDigest: string]: DatasetRow; };

as such, I would either:
a) populate this field using other forms (ex. the predict_and_score input) or
b) remove this field from the type definition and force resolution of any typing issues caused from this

i think (b) corresponds with this PR

Thanks for catching this. It was an oversight, it should be removed.

tssweeney · 2025-04-21T23:20:47Z

...se3/pages/CompareEvaluationsPage/sections/ExampleCompareSection/exampleCompareSectionUtil.ts

+              inputDigest: predictAndScoreRes.rowDigest,
+              inputRef: predictAndScoreRes.exampleRef,
+              output: flattenObjectPreservingWeaveTypes({output}),
+              scores: Object.fromEntries(


in @andrewtruong's upcoming PR, we can't count that the data comes from a dataset (gasp!) it might be good to at least leave a comment here as this would be a possible location to record the raw predict_and_score inputs as the presumed data row.

tssweeney · 2025-04-21T23:22:54Z

...rowse3/pages/CompareEvaluationsPage/sections/ExampleCompareSection/ExampleCompareSection.tsx

@@ -225,6 +227,16 @@ export const ExampleCompareSection: React.FC<{
    return filteredRows[targetIndex];
  }, [filteredRows, targetIndex]);

+  const {targetRowValue, loading: loadingInputValue} = useExampleCompareData(


my reading of the implementation of useExampleCompareData is that filteredRows is only used to get the index using filteredRows[targetIndex]. It might be cleaner to just pass filteredRows[targetIndex] to useExampleCompareData, simplifying the params and internal logic of useExampleCompareData

filteredRows should be needed to invalidate the cache in case the filter condition changes.

tssweeney · 2025-04-22T02:05:43Z

...se3/pages/CompareEvaluationsPage/sections/ExampleCompareSection/exampleCompareSectionUtil.ts

+    // Including `cacheVersion` in the dependency array ensures the memo recalculates
+    // when it changes, even though it's not directly used in the calculation.
+    // eslint-disable-next-line react-hooks/exhaustive-deps
+  }, [cacheVersion, filteredRows, targetIndex]);


I am not the biggest fan of this pattern (in particular, explicitly depending on cacheVersion that is not used in the hook). But I would agree it is hard to get around this one. Approving, but do think there might be a cleaner way.

chance-wnb requested review from a team as code owners April 19, 2025 00:27

chance-wnb requested a review from gtarpenning April 19, 2025 00:27

chance-wnb force-pushed the chance/eval_perf branch from 46f2202 to 5bc5915 Compare April 19, 2025 00:29

chance-wnb force-pushed the chance/eval_perf branch 4 times, most recently from c581e64 to 66241f8 Compare April 19, 2025 18:36

gtarpenning reviewed Apr 21, 2025

View reviewed changes

chance-wnb force-pushed the chance/eval_perf branch 2 times, most recently from 98ba681 to 27b30c2 Compare April 21, 2025 20:14

chance-wnb requested a review from gtarpenning April 21, 2025 20:26

gtarpenning approved these changes Apr 21, 2025

View reviewed changes

chance-wnb requested a review from tssweeney April 21, 2025 20:33

chance-wnb force-pushed the chance/eval_perf branch from 27b30c2 to c7f0529 Compare April 21, 2025 20:57

chance-wnb force-pushed the chance/eval_perf branch from c7f0529 to d29f6de Compare April 21, 2025 21:00

tssweeney reviewed Apr 21, 2025

View reviewed changes

tssweeney reviewed Apr 22, 2025

View reviewed changes

tssweeney approved these changes Apr 22, 2025

View reviewed changes

perf(weve): address data loading perf issue and eval compare

0d75513

chance-wnb force-pushed the chance/eval_perf branch from d29f6de to 0d75513 Compare April 22, 2025 03:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(weave): address data loading perf issue on eval compare #4192

perf(weave): address data loading perf issue on eval compare #4192

chance-wnb commented Apr 19, 2025 •

edited

Loading

circle-job-mirror bot commented Apr 19, 2025 •

edited

Loading

gtarpenning left a comment

chance-wnb commented Apr 21, 2025 •

edited

Loading

gtarpenning left a comment

chance-wnb commented Apr 21, 2025

tssweeney Apr 21, 2025

tssweeney Apr 21, 2025

chance-wnb Apr 22, 2025

tssweeney Apr 21, 2025

tssweeney Apr 21, 2025

chance-wnb Apr 21, 2025

tssweeney Apr 22, 2025

perf(weave): address data loading perf issue on eval compare #4192

Are you sure you want to change the base?

perf(weave): address data loading perf issue on eval compare #4192

Conversation

chance-wnb commented Apr 19, 2025 • edited Loading

Description

Testing

circle-job-mirror bot commented Apr 19, 2025 • edited Loading

gtarpenning left a comment

Choose a reason for hiding this comment

chance-wnb commented Apr 21, 2025 • edited Loading

gtarpenning left a comment

Choose a reason for hiding this comment

chance-wnb commented Apr 21, 2025

tssweeney Apr 21, 2025

Choose a reason for hiding this comment

tssweeney Apr 21, 2025

Choose a reason for hiding this comment

chance-wnb Apr 22, 2025

Choose a reason for hiding this comment

tssweeney Apr 21, 2025

Choose a reason for hiding this comment

tssweeney Apr 21, 2025

Choose a reason for hiding this comment

chance-wnb Apr 21, 2025

Choose a reason for hiding this comment

tssweeney Apr 22, 2025

Choose a reason for hiding this comment

chance-wnb commented Apr 19, 2025 •

edited

Loading

circle-job-mirror bot commented Apr 19, 2025 •

edited

Loading

chance-wnb commented Apr 21, 2025 •

edited

Loading