pt: fix se_a type_one_side performance degradation #3361

njzjz · 2024-02-28T23:11:37Z

The code in this PR is ugly, but applying a mask is causing performance degradation for ~3 ms/step.

When applying a mask, aten::nonzero has a high host time, as it causes host-device synchronization:

After fixing:

See pytorch/pytorch#12461 for more information.

Signed-off-by: Jinzhe Zeng <[email protected]>

codecov · 2024-02-28T23:19:06Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 75.86%. Comparing base (2a1508d) to head (aa02b18).

Additional details and impacted files

@@           Coverage Diff           @@
##            devel    #3361   +/-   ##
=======================================
  Coverage   75.85%   75.86%           
=======================================
  Files         416      416           
  Lines       34908    34914    +6     
  Branches     1614     1614           
=======================================
+ Hits        26480    26486    +6     
  Misses       7560     7560           
  Partials      868      868

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

The code in this PR is ugly, but applying a mask is causing performance degradation for ~3 ms/step. When applying a mask, `aten::nonzero` has a high host time, as it causes host-device synchronization: ![image](https://github.com/deepmodeling/deepmd-kit/assets/9496702/86b3518c-206d-410d-928e-2f605746147c) After fixing: ![image](https://github.com/deepmodeling/deepmd-kit/assets/9496702/af9e86fa-7908-4bbb-ace7-58b4602e167f) See pytorch/pytorch#12461 for more information. Signed-off-by: Jinzhe Zeng <[email protected]>

wanghan-iapcm · 2024-02-29T02:15:30Z

Is the exclude_mask has side effect on the performance?

njzjz · 2024-02-29T02:29:06Z

Is the exclude_mask has side effect on the performance?

It has a different problem. It seems the integer index is slow: pytorch/pytorch#15245 I haven't tested it.

pt: fix se_a type_one_side performance degradation

aa02b18

Signed-off-by: Jinzhe Zeng <[email protected]>

njzjz requested a review from wanghan-iapcm February 28, 2024 23:11

github-actions bot added the Python label Feb 28, 2024

wanghan-iapcm approved these changes Feb 29, 2024

View reviewed changes

wanghan-iapcm added this pull request to the merge queue Feb 29, 2024

wanghan-iapcm removed this pull request from the merge queue due to a manual request Feb 29, 2024

wanghan-iapcm added this pull request to the merge queue Feb 29, 2024

Merged via the queue into deepmodeling:devel with commit 48c8818 Feb 29, 2024

njzjz mentioned this pull request Apr 2, 2024

[TYPO] #3635

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pt: fix se_a type_one_side performance degradation #3361

pt: fix se_a type_one_side performance degradation #3361

njzjz commented Feb 28, 2024 •

edited

Loading

codecov bot commented Feb 28, 2024 •

edited

Loading

wanghan-iapcm commented Feb 29, 2024

njzjz commented Feb 29, 2024

pt: fix se_a type_one_side performance degradation #3361

pt: fix se_a type_one_side performance degradation #3361

Conversation

njzjz commented Feb 28, 2024 • edited Loading

codecov bot commented Feb 28, 2024 • edited Loading

Codecov Report

wanghan-iapcm commented Feb 29, 2024

njzjz commented Feb 29, 2024

njzjz commented Feb 28, 2024 •

edited

Loading

codecov bot commented Feb 28, 2024 •

edited

Loading