Reduce binary size of refine functions #1095

tfeher · 2025-07-09T17:29:20Z

The refine functions that work with GPU data use IVF-Flat under the hood to perform the refinement operation. This PR adds extern template declarations for ivfflat_interleaved_scan and uses these in the refine functions. This way we avoid recompiling the IVF-Flat search kernels, and save binary size.

tfeher · 2025-07-09T17:37:43Z

The ivfflat_interleaved_scan function is expected to be the largest contributor in binary size for the refine_device. We still have a few other kernel calls in refine_device, In a separate PR I will check if we can get rid of those.

tfeher · 2025-07-09T18:30:49Z

The binary size is nicely reduced, but test fail due to undefined symbols. ~~It worked locally,~~ I will look into it.

filname	compile time	binary size
refine_device_half_float.cu.o	112.149 s	2.185 MB
refine_device_float_float.cu.o	111.532 s	2.263 MB
refine_device_uint8_t_float.cu.o	111.155 s	2.183 MB
refine_device_int8_t_float.cu.o	110.015 s	2.183 MB

tfeher · 2025-07-14T18:16:21Z

The error is related to the filter type used for instantiating the search kernels. I am looking into the details.

tfeher added 2 commits July 9, 2025 19:22

Use already instantiated ivf_flat search kernels for refinement

d19c868

Add headers for data types

3fa8b68

tfeher requested a review from a team as a code owner July 9, 2025 17:29

tfeher self-assigned this Jul 9, 2025

github-actions bot added the cpp label Jul 9, 2025

tfeher added improvement Improves an existing functionality non-breaking Introduces a non-breaking change and removed cpp labels Jul 9, 2025

github-actions bot added the cpp label Jul 9, 2025

cjnolet added this to Vector Search, ML, & Data Mining Release Board Jul 10, 2025

github-project-automation bot moved this to Todo in Vector Search, ML, & Data Mining Release Board Jul 10, 2025

cjnolet moved this from Todo to In Progress in Vector Search, ML, & Data Mining Release Board Jul 10, 2025

Merge branch 'branch-25.08' into refine_binary_size

643f3b4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Reduce binary size of refine functions #1095

Reduce binary size of refine functions #1095

tfeher commented Jul 9, 2025

Uh oh!

tfeher commented Jul 9, 2025

Uh oh!

tfeher commented Jul 9, 2025 •

edited

Loading

Uh oh!

tfeher commented Jul 14, 2025

Uh oh!

Uh oh!

Reduce binary size of refine functions #1095

Are you sure you want to change the base?

Reduce binary size of refine functions #1095

Conversation

tfeher commented Jul 9, 2025

Uh oh!

tfeher commented Jul 9, 2025

Uh oh!

tfeher commented Jul 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tfeher commented Jul 14, 2025

Uh oh!

Uh oh!

tfeher commented Jul 9, 2025 •

edited

Loading