Speedup import and add regression check for import time #238

DanielYang59 · 2024-10-20T10:59:25Z

Summary

Speedup import, to close pymatviz import cost way too high #209
Add check for import time
Update .pytest-split-durations (how to do this?)

Profile command: python -X importtime -c "import pymatviz" 2> pmv.log && tuna pmv.log

pymatviz/utils.py

pymatviz/coordination.py

DanielYang59 · 2024-10-20T13:17:41Z

I have a bad feeling about this PR, I think I might prefer to wait until the import fix in monty get merged:

It's badly polluting the code base by lazily almost everything about pymatgen, because any import involving from monty.json import xxx could potentially slow down the import (if torch is installed).

Meanwhile for some reason every time I lazily import one package, there seems to be another package popping up from nowhere and make the total import time roughly the same. I think the reason is at the end the same costly packages are imported somewhere, and by refactoring iteratively, they're just broken into some smaller chunks (before refactor, they're all imported in the first module that needed them, then we keep relocating the current most expensive piece to another) so the most expensive chunk keep shrinking but the total import time is roughly unchanged.

Profile on main branch (without `torch`):

The tip of this branch:

janosh · 2024-10-20T14:46:48Z

Thanks a lot for digging into this, really appreciated! I suspect you're right, we're dependent on upstream PRs being merged to achieve significant improvements here.

pymatviz/structure_viz/plotly.py

janosh · 2024-10-22T20:01:30Z

i think it would be good to add performance regression tests to this PR and run them both on main and in this branch to compare. we should have them to catch future slow downs before they enter main. i posted some half-baked import time tests in #209 (comment) which might help. no worries if you're busy, i'll try and get to it myself eventually

DanielYang59 · 2024-10-23T01:17:39Z

I will do this :) I was planning to add a runtime unit test as well (perhaps to core parts of pymatgen as well), now we only test the results, and runtime is not covered at all. Thanks again for bringing up this idea!

DanielYang59 · 2024-10-23T03:26:08Z

Regarding the actual test methodology, I haven't got much finding about best practice on doing an import time test. Your method certainly works, what about spawning a subprocess (is there any pitfall about this, like subprocess overhead maybe?), something like:

import subprocess
import time


def measure_import_time(import_command: str) -> float:
    start_time = time.time()
    subprocess.run(['python', '-c', 'import pymatviz'], check=True)  # import_command
    return time.time() - start_time

I hope I'm correct here, I guess there's a slight pitfall with the sys.modules method though, because it only pops the outer module. Say if we have monty which imports torch indirectly/internally, if we just pop monty, the actual expensive torch would still remain in cache, so:

if name == module_name or name.startswith(f"{module_name}.") doesn't seem to capture indirect imports?
If we were to profile several modules at the same time (say pmv.a and pmv.b both import module), the shared internal module would still be cached.
it's a bit hard to average across several runs?

Is there any advantage to use relative time (time.perf_counter()) over real-world time (time.time())? The latter seems more comprehensible to me (though at the end of the day we still need to collect the reference time from CI).

janosh · 2024-10-23T05:32:50Z

Your method certainly works, what about spawning a subprocess (is there any pitfall about this, like subprocess overhead maybe?), something like:

i'm not sure but i think subprocess should be fine since afaik it runs in a completely isolated process with separate import caching

Is there any advantage to use relative time (time.perf_counter()) over real-world time (time.time())?

yes, have a look at this video which explains the virtues of time.perf_counter

DanielYang59 · 2024-10-23T05:37:00Z

Your method certainly works, what about spawning a subprocess (is there any pitfall about this, like subprocess overhead maybe?), something like:

i'm not sure but i think subprocess should be fine since afaik it runs in a completely isolated process with separate import caching

Thanks! Let's use subprocess for now, this should make things much easier (if there is not pitfall)

Is there any advantage to use relative time (time.perf_counter()) over real-world time (time.time())?

yes, have a look at this video which explains the virtues of time.perf_counter

Thanks for sharing, I would have a look later but I certainly trust your judgement.

By the way, how do we regenerate the .pytest-split-durations record (not now, after we finish everything, current import is still very slow and might need more tuning).

DanielYang59 · 2024-10-23T05:47:23Z

Currently like 12% of the import time is used to import import plotly.figure_factory in ptable.plotly where it's only used once across the entire code base, I have lazily imported it ed48b13.

After that:

The sympy import by pymatgen should also be done lazily bump monty to use the monty.json import speedup patch, add import time regression test, lazy load some rarely used but costly modules materialsproject/pymatgen#4128 (comment).
scipy doesn't seem to have much room for improvement because it's used a lots of places.

janosh · 2024-10-23T06:06:41Z

By the way, how do we regenerate the .pytest-split-durations record (not now, after we finish everything, current import is still very slow and might need more tuning).

the command for that is

pytest --store-durations --durations-path tests/.pytest-split-durations

i should have documented that somewhere. probably best in test.yml where we set test-cmd: ...

DanielYang59 · 2024-10-23T08:23:45Z

Thanks a ton for letting me know!

What I don't really understand is how do we generate this runtime record via GitHub CI runners (in contrast to run that command locally)?

Especially for pymatgen where:

it's huge and I don't have that much resources to run all jobs locally
tasks are split (I haven't tried myself, perhaps it would be combined automatically?)

I have a feeling that we may not want this test to run for each commit (because I don't expect we would introduce changes that vary import time very often), it's slowing down everything. Perhaps we only run it only when merging into main?

For this PR, I currently cannot think of more improvement to make as the left is pretty much core packages like scipy, pandas, matplotlib. Do we want to perhaps merge it (after I modify it to run at main branch only)

janosh · 2024-11-02T20:21:06Z

very thankful for this PR! 👍 didn't mean to keep it parked so long

What I don't really understand is how do we generate this runtime record via GitHub CI runners (in contrast to run that command locally)?

i think generating locally should be fine. what matters shouldn't be absolute values for test run times but how long a test takes relative to the others which will be similar in CI and locally

it's huge and I don't have that much resources to run all jobs locally

even pymatgen only takes 10 min or so to run the whole test suite locally for me. curious how long for you?

Perhaps we only run it only when merging into main?

is the test slow enough in CI that it would extend overall time to checks complete if it ran inside its own split? i just tried test_import_time locally and got 80.52s. will be more on GitHub but tests already take ~2 min.
either way, i think it's fine to only run this test on main

is this PR ready to go from your side? are we waiting on a monty release to get the new startup benefits?

janosh · 2024-11-02T20:32:59Z

are we waiting on a monty release to get the new startup benefits?

ah, saw your comment in pyproject

    # TODO: pmv doesn't actually depend on monty, however latest monty
    # includes a critical import patch, remove this after pmg bump dep
    "monty>=2024.10.21",

DanielYang59 · 2024-11-03T03:47:02Z

i think generating locally should be fine.

even pymatgen only takes 10 min or so to run the whole test suite locally for me. curious how long for you?

Yep should takes about the same time just like pymatgen CI (macOS runner is running on M1 chip, each split takes around 1 min), as I'm usually coding on a M3 MacBook Air laptop. I'm just being curious as I think running on a dedicated runner would give better scalability and consistency :)

is the test slow enough in CI that it would extend overall time to checks complete if it ran inside its own split?

The original implementation was running 10 repeats, but I reduced repeat to 3 so should be good now.

DanielYang59 · 2024-11-03T03:55:15Z

tests/performance_tests/test_import_time.py

        for module_name in REF_IMPORT_TIME
    }

    # Print out the import times in a copyable format
    print("\nCopyable import time dictionary:")
-    print("{")
-    for module_name, import_time in import_times.items():
-        print(f'    "{module_name}": {import_time:.2f},')


This part was intended to generate a directly copyable dict (and human-readable for manual inspection) for us to directly copy and paste to update the time recording, printing a dict directly gives a one-liner:

Copyable import time dictionary: {'pymatviz': 2084.25, 'pymatviz.coordination': 2342.41, 'pymatviz.cumulative': 2299.73, 'pymatviz.histogram': 2443.11, 'pymatviz.phonons': 2235.57, 'pymatviz.powerups': 2172.71, 'pymatviz.ptable': 2286.77, 'pymatviz.rainclouds': 2702.03, 'pymatviz.rdf': 2331.98, 'pymatviz.relevance': 2256.29, 'pymatviz.sankey': 2313.12, 'pymatviz.scatter': 2312.48, 'pymatviz.structure_viz': 2330.39, 'pymatviz.sunburst': 2395.04, 'pymatviz.uncertainty': 2317.87, 'pymatviz.xrd': 2242.09}

i saw that. the thing is, if you paste that printed dict into a python file, ruff will auto-format it to multiple lines so having easier to read code seemed more important

Fair point!

DanielYang59 · 2024-11-03T03:59:22Z

tests/performance_tests/test_import_time.py

@@ -69,12 +66,12 @@ def measure_import_time_in_ms(module_name: str, count: int = 3) -> float:
    """
    total_time = 0.0

-    for _ in range(count):
-        start_time = time.perf_counter_ns()


Thanks for catching this. using perf_counter_ns was intended to avoid float precision lost (the former gives time in int, while the latter gives in float), but admittedly for our case the error should be negligible as: 1. it's far from the ns level; 2. we're rounding it anyway

Use perf_counter_ns() to avoid the precision loss caused by the float type.

* Add test framework to monitor module import times with regression tests * Use time.perf_counter for accurate timing * Implement lazy imports across multiple modules to improve performance: - scipy - plotly.figure_factory - sklearn - pymatgen (Structure, NearNeighbors, PhononDos, PhononBands, Composition) * Add reference import times for all core modules * Configure tests to run only on main branch * Add grace and hard thresholds for import time regression --------- Co-authored-by: Janosh Riebesell <[email protected]>

DanielYang59 added 2 commits October 20, 2024 18:58

pre-commit migrate-config

b6bee81

avoid import Structure for type check

5c8032d

DanielYang59 added ux User experience pkg Package labels Oct 20, 2024

DanielYang59 self-assigned this Oct 20, 2024

DanielYang59 added 2 commits October 20, 2024 19:01

lazily import scipy

212d573

avoid import Structure in utils

d4e25e7

DanielYang59 commented Oct 20, 2024

View reviewed changes

pymatviz/utils.py Show resolved Hide resolved

DanielYang59 added 3 commits October 20, 2024 19:28

copy helper func _check_type from monty

865a4fa

lazy import plotly.figure_factory

82a31c4

lazy import NearNeighbors

e60c770

DanielYang59 commented Oct 20, 2024

View reviewed changes

pymatviz/coordination.py Outdated Show resolved Hide resolved

DanielYang59 added 5 commits October 20, 2024 19:52

lazy import PhononDos and PhononBands

2777c0f

more lazy import Structure

13960b4

clean up some duplicate imports

07e084c

lazy import sklearn

5c01d25

lazy import pmg composition

5918aeb

DanielYang59 added 2 commits October 20, 2024 21:18

remove unused import from root __init__

1161bf0

relocate scikit learn import

2b04381

DanielYang59 commented Oct 21, 2024

View reviewed changes

pymatviz/structure_viz/plotly.py Show resolved Hide resolved

DanielYang59 force-pushed the speedup-import branch from 3147dc5 to 54aaf3f Compare October 21, 2024 02:45

revert hacky type check changes

1059be9

DanielYang59 force-pushed the speedup-import branch from 54aaf3f to 1059be9 Compare October 21, 2024 02:46

DanielYang59 added 2 commits October 23, 2024 11:31

bump monty the hard way

92a4821

WIP: add draft import time checker

23cbe32

DanielYang59 force-pushed the speedup-import branch from 7497edc to 23cbe32 Compare October 23, 2024 05:11

DanielYang59 added 3 commits October 23, 2024 13:16

add more test modules

fe1e32d

reduce default average count to 3, it seems very slow

b019517

tweak gen ref time logic

6bd0bb8

update ref time

bf57cd0

DanielYang59 added 2 commits October 23, 2024 13:38

use perf_counter over time()

acd25f9

lazy import plotly.figure_factory, reduce 0.2s 10%

ed48b13

update ref import time

2245483

DanielYang59 changed the title ~~Speedup import~~ Speedup import and add regression check for import time Oct 23, 2024

use standard time format

f64739a

DanielYang59 marked this pull request as ready for review October 23, 2024 08:29

only run on main branch

5143765

DanielYang59 force-pushed the speedup-import branch from a36eccc to 5143765 Compare October 23, 2024 08:31

DanielYang59 added 2 commits October 26, 2024 16:50

use warnings.warn

77f3f59

use perf_counter_ns

b0006c8

rename measure_import_time_in_ms -> measure_import_time

c337e54

janosh merged commit 691a632 into janosh:main Nov 2, 2024
25 checks passed

DanielYang59 deleted the speedup-import branch November 3, 2024 03:35

DanielYang59 commented Nov 3, 2024

View reviewed changes

Speedup import and add regression check for import time #238

Speedup import and add regression check for import time #238

Uh oh!

Conversation

DanielYang59 commented Oct 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

Uh oh!

Uh oh!

DanielYang59 commented Oct 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Profile on main branch (without torch):

The tip of this branch:

Uh oh!

janosh commented Oct 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

janosh commented Oct 22, 2024

Uh oh!

DanielYang59 commented Oct 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DanielYang59 commented Oct 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

janosh commented Oct 23, 2024

Uh oh!

DanielYang59 commented Oct 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DanielYang59 commented Oct 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

janosh commented Oct 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DanielYang59 commented Oct 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

janosh commented Nov 2, 2024

Uh oh!

janosh commented Nov 2, 2024

Uh oh!

Uh oh!

DanielYang59 commented Nov 3, 2024

Uh oh!

DanielYang59 Nov 3, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

janosh Nov 3, 2024

Choose a reason for hiding this comment

Uh oh!

DanielYang59 Nov 3, 2024

Choose a reason for hiding this comment

Uh oh!

DanielYang59 Nov 3, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

DanielYang59 commented Oct 20, 2024 •

edited

Loading

DanielYang59 commented Oct 20, 2024 •

edited

Loading

Profile on main branch (without `torch`):

janosh commented Oct 20, 2024 •

edited

Loading

DanielYang59 commented Oct 23, 2024 •

edited

Loading

DanielYang59 commented Oct 23, 2024 •

edited

Loading

DanielYang59 commented Oct 23, 2024 •

edited

Loading

DanielYang59 commented Oct 23, 2024 •

edited

Loading

janosh commented Oct 23, 2024 •

edited

Loading

DanielYang59 commented Oct 23, 2024 •

edited

Loading

DanielYang59 Nov 3, 2024 •

edited

Loading

DanielYang59 Nov 3, 2024 •

edited

Loading