Overrides rewrite in PointRangeQuery to optimize AllDocs/NoDocs cases #14609

ebradshaw · 2025-05-04T23:13:55Z

Overrides rewrite in PointRangeQuery range to handle cases where the query either fully contains or fully excludes all documents within the shard.

Often, particularly when using time based partitioning, range queries may overlap several indexes. Many of these indexes have timestamp values that are fully contained by the query, in which case the query can be rewritten to a MatchAllDocsQuery or a FieldExistsQuery. On the other hand, many indexes can be fully excluded if they're outside the requested time range, in which case the query can be rewritten to a MatchNoDocsQuery.

While a similar optimization exists at the leaf level in the createWeight function, rewriting at the shard level enables other optimizations downstream.

Please let me know if this has been ruled out in the past for other reasons or if the implementation misses anything. Thanks.

…range either fully contains or fully excludes all documents within the shard.

jpountz

I left some minor comments but the change looks good to me in general. Thank you!

jpountz · 2025-05-05T20:48:51Z

lucene/core/src/java/org/apache/lucene/search/PointRangeQuery.java

 * @lucene.experimental
+ * @see PointValues


I don't think we have a rule around it, but the lucene.experimental / lucene.internal tag is usually the last one, so let's not swap these two lines?

jpountz · 2025-05-05T20:56:55Z

lucene/core/src/java/org/apache/lucene/search/PointRangeQuery.java

+      }
+    } else if (fullyExcludedCount == dims) {
+      return new MatchNoDocsQuery();
+    }


I wonder if we can somehow reuse the existing relate(byte[], byte[]) method?

+1 to using relate(byte[], byte[]). Actually looking at relate(byte[], byte[]) method, I realized that we don't need fullyExcludedCount == dims for rewriting into MatchNoDocsQuery. Any dimension being completely outside is sufficient condition for the query to be rewritten as MatchNoDocsQuery

jainankitk · 2025-05-07T23:12:38Z

lucene/core/src/java/org/apache/lucene/search/PointRangeQuery.java

+        fullyContainedCount++;
+      } else if (qLow.compareTo(gMax) > 0 || qHigh.compareTo(gMin) < 0) {
+        fullyExcludedCount++;
+      }


We can return super.rewrite(searcher) if any dimension is not fullyContained / Excluded?

jainankitk · 2025-05-07T23:27:02Z

lucene/core/src/java/org/apache/lucene/search/PointRangeQuery.java

+      }
+    } else if (fullyExcludedCount == dims) {
+      return new MatchNoDocsQuery();
+    }


+1 to using relate(byte[], byte[]). Actually looking at relate(byte[], byte[]) method, I realized that we don't need fullyExcludedCount == dims for rewriting into MatchNoDocsQuery. Any dimension being completely outside is sufficient condition for the query to be rewritten as MatchNoDocsQuery

Overrides rewrite in PointRangeQuery to handle cases where the query …

fee0383

…range either fully contains or fully excludes all documents within the shard.

github-project-automation bot added this to OpenSearch Lucene & Core Performance Tracking May 4, 2025

github-project-automation bot moved this to Open in OpenSearch Lucene & Core Performance Tracking May 4, 2025

github-actions bot added the module:core/search label May 4, 2025

ebradshaw added 2 commits May 4, 2025 19:18

Fixes wildcarded imports

72aa631

Null check on values

052a961

jpountz reviewed May 5, 2025

View reviewed changes

jainankitk reviewed May 7, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Overrides rewrite in PointRangeQuery to optimize AllDocs/NoDocs cases #14609

Overrides rewrite in PointRangeQuery to optimize AllDocs/NoDocs cases #14609

ebradshaw commented May 4, 2025

jpountz left a comment

jpountz May 5, 2025

jpountz May 5, 2025

jainankitk May 7, 2025

jainankitk May 7, 2025

jainankitk May 7, 2025

Overrides rewrite in PointRangeQuery to optimize AllDocs/NoDocs cases #14609

Are you sure you want to change the base?

Overrides rewrite in PointRangeQuery to optimize AllDocs/NoDocs cases #14609

Conversation

ebradshaw commented May 4, 2025

jpountz left a comment

Choose a reason for hiding this comment

jpountz May 5, 2025

Choose a reason for hiding this comment

jpountz May 5, 2025

Choose a reason for hiding this comment

jainankitk May 7, 2025

Choose a reason for hiding this comment

jainankitk May 7, 2025

Choose a reason for hiding this comment

jainankitk May 7, 2025

Choose a reason for hiding this comment