pre-commit: PR141010 #2365

dtcxzyw · 2025-05-22T05:47:27Z

Link: llvm/llvm-project#141010
Requested by: @dtcxzyw

dtcxzyw · 2025-05-22T06:10:00Z

Diff mode

runner: ariselab-64c-docker
baseline: llvm/llvm-project@bcdce98
patch: llvm/llvm-project#141010
sha256: d0c06aeb883e926f0ff9e4305a253f1deb614e480f24314f92340bb9715f0801
commit: e7d9d88

30 files changed, 289 insertions(+), 294 deletions(-)

Improvements:
  licm.NumHoisted 5628412 -> 5628416 +0.00%
Regressions:
  licm.NumMovedCalls 35453 -> 35449 -0.01%
  globalsmodref-aa.NumNoMemFunctions 812624 -> 812619 -0.00%
  globalsmodref-aa.NumReadMemFunctions 1242242 -> 1242237 -0.00%
  instcombine.NumDeadInst 45389339 -> 45389329 -0.00%
  instcombine.NumCombined 132017079 -> 132017069 -0.00%

1 1 bench/darktable/optimized/blend_gui.ll
2 2 bench/ffmpeg/optimized/af_adeclick.ll
4 4 bench/ffmpeg/optimized/agm.ll
3 3 bench/flac/optimized/lpc.ll
9 10 bench/ncnn/optimized/roialign.ll
9 10 bench/ncnn/optimized/roialign_x86.ll

github-actions · 2025-05-22T06:17:32Z

The patch primarily removes the nsz (no signed zeros) flag from several select instructions in LLVM IR across multiple benchmarks. Here are the major changes:

Removal of nsz from select Instructions:
In multiple functions across blend_gui.ll, af_adeclick.ll, agm.ll, and lpc.ll, the nsz fast-math flag has been removed from select instructions that involve floating-point comparisons. This change aligns with the principle that select inherits fast-math flags from associated floating-point operations but may not require all flags to be explicitly propagated.
Simplification of FP Comparison Logic in ROIAlign Functions:
In roialign.ll and roialign_x86.ll, calls to @llvm.maxnum.f32 for clamping negative values to zero were replaced with explicit fcmp ole followed by a select. This transformation avoids use of legacy max intrinsics and gives more explicit control over comparison semantics, possibly enabling better optimization or consistency.
Minor PHI Node Adjustments:
Some PHI nodes in loop headers within roialign.ll and roialign_x86.ll had their incoming block labels updated due to changes in control flow structure after replacing intrinsic usage. These are mostly mechanical updates and don't affect logic.
Cleanup of Unused Declarations:
The declarations of @llvm.maxnum.f32 in roialign.ll and roialign_x86.ll were removed since they are no longer used after the intrinsic calls were replaced with fcmp + select.
Control Flow Adjustment in Loop Exit Blocks:
A few exit blocks in loops had their branch targets slightly adjusted in roialign_x86.ll, indicating minor restructuring of the CFG likely due to the removal of intrinsic-based patterns and replacement with basic FP comparison logic.

Summary:
This patch improves semantic clarity and potentially enables better optimization by replacing uses of @llvm.maxnum.f32 with fcmp + select, and by cleaning up unnecessary fast-math flags. It also removes unused function declarations and adjusts control flow and PHIs accordingly.

model: qwen-plus-latest
CompletionUsage(completion_tokens=469, prompt_tokens=6607, total_tokens=7076, completion_tokens_details=None, prompt_tokens_details=None)

pre-commit: PR141010

8b7d553

github-actions bot mentioned this pull request May 22, 2025

Task submission #1312

Open

github-actions bot added 2 commits May 22, 2025 06:09

pre-commit: Update

db91b3f

pre-commit: Remap

e7d9d88

dtcxzyw added regression reviewed labels May 22, 2025

dtcxzyw closed this May 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

pre-commit: PR141010 #2365

pre-commit: PR141010 #2365

Uh oh!

dtcxzyw commented May 22, 2025

Uh oh!

dtcxzyw commented May 22, 2025

Uh oh!

github-actions bot commented May 22, 2025

Uh oh!

Uh oh!

pre-commit: PR141010 #2365

pre-commit: PR141010 #2365

Uh oh!

Conversation

dtcxzyw commented May 22, 2025

Uh oh!

dtcxzyw commented May 22, 2025

Diff mode

Uh oh!

github-actions bot commented May 22, 2025

Uh oh!

Uh oh!