Skip to content

Change the storage of frame to use threadLocal rather than Dict #21993

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Mar 4, 2022

Conversation

potiuk
Copy link
Member

@potiuk potiuk commented Mar 4, 2022

There is a very probable WeakKeyDict bug in Python standard
library (to be confirmed and investigated further) that
manifests itself in a very rare failure of the
test_stacktrace_on_failure_starts_with_task_execute_method

This turned out to be related to an unexpected behaviour
(and most likely a bug - to be confirmed) of WeakKeyDict
when you have potentially two different objects with the
same equals and hash values added to the same
WeakKeyDict as keys.

More info on similar report (but raised for a bit different
reason) bug in Python can be found here:

https://bugs.python.org/issue44140

While we are still investigating the root cause and possibly
submit a bug to Python, this PR changes the mechanism
to store the frame in a Thread Local variable rather than
WeakRefDict.


^ Add meaningful description above

Read the Pull Request Guidelines for more information.
In case of fundamental code change, Airflow Improvement Proposal (AIP) is needed.
In case of a new dependency, check compliance with the ASF 3rd Party License Policy.
In case of backwards incompatible changes please leave a note in UPDATING.md.

@potiuk potiuk requested review from kaxil, XD-DENG and ashb as code owners March 4, 2022 12:54
@potiuk potiuk requested review from malthe and uranusjr March 4, 2022 12:55
@potiuk potiuk force-pushed the fix-flaky-stackrace-behaviour branch from d04ea11 to 23c2ccc Compare March 4, 2022 13:33
@malthe malthe self-requested a review March 4, 2022 13:39
@potiuk potiuk closed this Mar 4, 2022
@potiuk potiuk reopened this Mar 4, 2022
There is a very probable WeakKeyDict bug in Python standard
library (to be confirmed and investigated further) that
manifests itself in a very rare failure of the
test_stacktrace_on_failure_starts_with_task_execute_method

This turned out to be related to an unexpected behaviour
(and most likely a bug - to be confirmed) of WeakKeyDict
when you have potentially two different objects with the
same `equals` and `hash` values added to the same
WeakKeyDict as keys.

More info on similar report (but raised for a bit different
reason) bug in Python can be found here:

https://bugs.python.org/issue44140

While we are still investigating the root cause and possibly
submit a bug to Python, this PR changes the mechanism
to store the frame in a Thread Local variable rather than
WeakRefDict.
@potiuk potiuk force-pushed the fix-flaky-stackrace-behaviour branch from 23c2ccc to d418f54 Compare March 4, 2022 15:24
@malthe malthe self-requested a review March 4, 2022 15:28
@github-actions
Copy link

github-actions bot commented Mar 4, 2022

The PR most likely needs to run full matrix of tests because it modifies parts of the core of Airflow. However, committers might decide to merge it quickly and take the risk. If they don't merge it quickly - please rebase it to the latest main at your convenience, or amend the last commit of the PR, and push it with --force-with-lease.

@github-actions github-actions bot added the full tests needed We need to run full set of tests for this PR to merge label Mar 4, 2022
@potiuk potiuk closed this Mar 4, 2022
@potiuk potiuk reopened this Mar 4, 2022
@potiuk
Copy link
Member Author

potiuk commented Mar 4, 2022

Looks like a docs-build inventory fetching problem only. Close/reopen to rebuild to be 100% sure.

@potiuk potiuk merged commit 1949f5d into apache:main Mar 4, 2022
@potiuk potiuk deleted the fix-flaky-stackrace-behaviour branch March 4, 2022 17:46
@ephraimbuddy ephraimbuddy added this to the Airflow 2.2.5 milestone Mar 23, 2022
@ephraimbuddy ephraimbuddy added the changelog:skip Changes that should be skipped from the changelog (CI, tests, etc..) label Mar 23, 2022
ephraimbuddy pushed a commit that referenced this pull request Mar 23, 2022
There is a very probable WeakKeyDict bug in Python standard
library (to be confirmed and investigated further) that
manifests itself in a very rare failure of the
test_stacktrace_on_failure_starts_with_task_execute_method

This turned out to be related to an unexpected behaviour
(and most likely a bug - to be confirmed) of WeakKeyDict
when you have potentially two different objects with the
same `equals` and `hash` values added to the same
WeakKeyDict as keys.

More info on similar report (but raised for a bit different
reason) bug in Python can be found here:

https://bugs.python.org/issue44140

We submitted a PR to fix the problem found
python/cpython#31685

(cherry picked from commit 1949f5d)
ephraimbuddy pushed a commit that referenced this pull request Mar 24, 2022
There is a very probable WeakKeyDict bug in Python standard
library (to be confirmed and investigated further) that
manifests itself in a very rare failure of the
test_stacktrace_on_failure_starts_with_task_execute_method

This turned out to be related to an unexpected behaviour
(and most likely a bug - to be confirmed) of WeakKeyDict
when you have potentially two different objects with the
same `equals` and `hash` values added to the same
WeakKeyDict as keys.

More info on similar report (but raised for a bit different
reason) bug in Python can be found here:

https://bugs.python.org/issue44140

We submitted a PR to fix the problem found
python/cpython#31685

(cherry picked from commit 1949f5d)
ephraimbuddy pushed a commit that referenced this pull request Mar 26, 2022
There is a very probable WeakKeyDict bug in Python standard
library (to be confirmed and investigated further) that
manifests itself in a very rare failure of the
test_stacktrace_on_failure_starts_with_task_execute_method

This turned out to be related to an unexpected behaviour
(and most likely a bug - to be confirmed) of WeakKeyDict
when you have potentially two different objects with the
same `equals` and `hash` values added to the same
WeakKeyDict as keys.

More info on similar report (but raised for a bit different
reason) bug in Python can be found here:

https://bugs.python.org/issue44140

We submitted a PR to fix the problem found
python/cpython#31685

(cherry picked from commit 1949f5d)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
changelog:skip Changes that should be skipped from the changelog (CI, tests, etc..) full tests needed We need to run full set of tests for this PR to merge
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants