GH-130415: Narrow `str` to `""` based on boolean tests #130476

fluhus · 2025-02-22T22:42:41Z

Assign value to string when an if evaluates to false.

@brandtbucher

Issue: Better constant narrowing in the JIT optimizer #130415

ghost · 2025-02-22T22:42:44Z

All commit authors signed the Contributor License Agreement.

bedevere-app · 2025-02-22T22:42:45Z

Most changes to Python require a NEWS entry. Add one using the blurb_it web app or the blurb command-line tool.

If this change has little impact on Python users, wait for a maintainer to apply the skip news label instead.

Python/optimizer_bytecodes.c

Lib/test/test_capi/test_opt.py

bedevere-app · 2025-02-22T22:49:36Z

A Python core developer has requested some changes be made to your pull request before we can consider merging it. If you could please address their requests along with any other requests in other reviews from core developers that would be appreciated.

Once you have made the requested changes, please leave a comment on this pull request containing the phrase I have made the requested changes; please review again. I will then notify any core developers who have left a review that you're ready for them to take another look at this pull request.

fluhus · 2025-02-22T23:08:12Z

Added requested corrections. Thanks, @brandtbucher !

markshannon

Thanks for contribution.
Unfortunately, I think there is a critical flaw in this approach as it could result in mis-optimizations in the future.

This would be a useful optimization, so if you're willing to pursue this further, it would be appreciated.

markshannon · 2025-02-24T16:16:08Z

Lib/test/test_capi/test_opt.py

+                dummy = "aaa"
+                # Hopefully the optimizer can't guess what the value is.
+                # empty is always "", but we can only prove that it's a string:
+                empty = dummy[:0]


I can easily see the optimizer turning "aaa"[:0] into "".
empty doesn't need to be a constant, we just need it to be mostly "", for profiling.
Use something like empty = "a"[:(n % 1000) == 0]

Since we check the actual path taken as part of the test, we need the value to always be "", not just mostly "". So maybe:

false = i == TIER2_THRESHOLD empty = "X"[:false]

The optimizer can't prove false is False, so it's good enough for our purposes.

markshannon · 2025-02-24T16:24:31Z

Python/optimizer_bytecodes.c

+            // *can't* narrow res, since that would cause the guard to be
+            // removed and the narrowed value to be invalid:
+            if (next_opcode == _GUARD_IS_FALSE_POP) {
+                sym_set_const(value, Py_GetConstant(Py_CONSTANT_EMPTY_STR));


This is strictly incorrect. We don't know that value is "" until after the _GUARD_IS_FALSE_POP.
The reason that matters is that when we start attaching type information to side exits, as we probably will in 3.15, then this could lead us to infer that value is "" on both branches. Which would be wrong.

There are two possible fixes for this.

Combine TO_BOOL_STR and _GUARD_IS_FALSE_POP/_GUARD_IS_TRUE_POP into a single (super)instruction, then optimize that.

Annotate the bool value resulting from the TO_BOOL with its input, then in _GUARD_IS_FALSE_POP convert the input value to TO_BOOL.

I prefer the second option, although it may be more work, as it is more flexible and can be extended more easily.

Yeah, @Fidget-Spinner and I suggested something like the latter on the issue (new symbols like JitBoolOf(JitOptSymbol *source, bool inverted) and JitEqualTo(JitOptSymbol *lhs, JitOptSymbol *rhs, bool inverted)). That's probably the direction we're headed in longer term.

However, I don't think we should let perfect be the enemy of good here. We have nice, working optimizations in these PRs; just because we might sink info onto side exits in the future probably shouldn't prevent us from making changes like this now for 3.14, which are perfectly correct for the current optimizer (which doesn't sink anything).

I'm inclined to land these changes and other similar ones for ==/!= now, and make the symbolic representation of derived boolean values more complex later as an improvement (it will also be able to handle more uncommon cases like x = y == 42; if x: ...). I'm really worried that if we try to "future-proof" optimizations based on what we could do six months from now, it will prevent actual improvements in the near term.

But I'll defer to you here. If having value be narrowed one uop too early in the instruction stream is enough to block this PR, I can work with these new contributors on the more complex solution. But as-is, this has no bugs and works as intended. We don't sink value info onto side exits, so it's correct.

bedevere-app · 2025-02-24T16:30:01Z

A Python core developer has requested some changes be made to your pull request before we can consider merging it. If you could please address their requests along with any other requests in other reviews from core developers that would be appreciated.

Once you have made the requested changes, please leave a comment on this pull request containing the phrase I have made the requested changes; please review again. I will then notify any core developers who have left a review that you're ready for them to take another look at this pull request.

…0476)

fluhus requested a review from Fidget-Spinner as a code owner February 22, 2025 22:42

bedevere-app bot added the awaiting review label Feb 22, 2025

bedevere-app bot mentioned this pull request Feb 22, 2025

Better constant narrowing in the JIT optimizer #130415

Open

17 tasks

brandtbucher self-assigned this Feb 22, 2025

brandtbucher added performance Performance or resource usage interpreter-core (Objects, Python, Grammar, and Parser dirs) topic-JIT labels Feb 22, 2025

brandtbucher requested changes Feb 22, 2025

View reviewed changes

Python/optimizer_bytecodes.c Outdated Show resolved Hide resolved

Lib/test/test_capi/test_opt.py Outdated Show resolved Hide resolved

bedevere-app bot added awaiting changes and removed awaiting review labels Feb 22, 2025

brandtbucher approved these changes Feb 22, 2025

View reviewed changes

bedevere-app bot added awaiting merge and removed awaiting changes labels Feb 22, 2025

markshannon requested changes Feb 24, 2025

View reviewed changes

bedevere-app bot added awaiting changes and removed awaiting merge labels Feb 24, 2025

markshannon mentioned this pull request Feb 24, 2025

GH-130415: Optimize JIT path for _TO_BOOL_INT branching #130477

Closed

fluhus and others added 7 commits March 2, 2025 16:14

Add failing regression test for _TO_BOOL_STR

80c1c6c

Improve test to outsmart JIT

23695d6

Add optimization path to _TO_BOOL_STR

fcf4116

📜🤖 Added by blurb_it.

e23b84a

Correct res type and change f var to empty

ab263fe

Add Amit Lavon to ACKS

d5bfcdc

Assign truthiness to empty strings in JIT

601392d

fluhus force-pushed the hack-night2 branch from 89317ab to 601392d Compare March 3, 2025 01:16

Improve un-proveable empty string in JIT test

58884ab

brandtbucher approved these changes Mar 3, 2025

View reviewed changes

bedevere-app bot added awaiting merge and removed awaiting changes labels Mar 3, 2025

brandtbucher changed the title ~~GH-130415: Add JIT optimization path for _TO_BOOL_STR~~ GH-130415: Narrow str to "" based on boolean tests Mar 3, 2025

Merge branch 'main' into hack-night2

905d3ef

brandtbucher merged commit 691354c into python:main Mar 4, 2025
58 checks passed

bedevere-app bot removed the awaiting merge label Mar 4, 2025

fluhus deleted the hack-night2 branch March 21, 2025 20:23

seehwan pushed a commit to seehwan/cpython that referenced this pull request Apr 16, 2025

pythonGH-130415: Narrow str to "" based on boolean tests (pythonGH-13…

2d5f7cd

…0476)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GH-130415: Narrow `str` to `""` based on boolean tests #130476

GH-130415: Narrow `str` to `""` based on boolean tests #130476

fluhus commented Feb 22, 2025 •

edited by bedevere-app bot

Loading

ghost commented Feb 22, 2025 •

edited by ghost

Loading

bedevere-app bot commented Feb 22, 2025

bedevere-app bot commented Feb 22, 2025

fluhus commented Feb 22, 2025

markshannon left a comment

markshannon Feb 24, 2025

brandtbucher Mar 3, 2025 •

edited

Loading

markshannon Feb 24, 2025

brandtbucher Feb 24, 2025

bedevere-app bot commented Feb 24, 2025

GH-130415: Narrow str to "" based on boolean tests #130476

GH-130415: Narrow str to "" based on boolean tests #130476

Conversation

fluhus commented Feb 22, 2025 • edited by bedevere-app bot Loading

ghost commented Feb 22, 2025 • edited by ghost Loading

bedevere-app bot commented Feb 22, 2025

bedevere-app bot commented Feb 22, 2025

fluhus commented Feb 22, 2025

markshannon left a comment

Choose a reason for hiding this comment

markshannon Feb 24, 2025

Choose a reason for hiding this comment

brandtbucher Mar 3, 2025 • edited Loading

Choose a reason for hiding this comment

markshannon Feb 24, 2025

Choose a reason for hiding this comment

brandtbucher Feb 24, 2025

Choose a reason for hiding this comment

bedevere-app bot commented Feb 24, 2025

GH-130415: Narrow `str` to `""` based on boolean tests #130476

GH-130415: Narrow `str` to `""` based on boolean tests #130476

fluhus commented Feb 22, 2025 •

edited by bedevere-app bot

Loading

ghost commented Feb 22, 2025 •

edited by ghost

Loading

brandtbucher Mar 3, 2025 •

edited

Loading