Merge OpenAI Triton commit `99b5e29` #4219

whitneywhtsang · 2025-05-15T19:49:52Z

This PR change the Triton base from 86e7117 to 99b5e29 (May 13).
Pass rate: 97.77%->97.25% (#4221, #4222)

Please do not squash and merge this PR.

When `TRITON_PRINT_AUTOTUNING=1`, we expect `self.bench_time` to be populated if we did not used a cached result for the benchmarking results. There was a codepath that used cached results from the disk, but did not update the flag saying we used cached results, leading to a crash when `self.bench_time` was unset.  # New contributor declaration - [x] I am not making a trivial change, such as fixing a typo in a comment. - [x] I have written a PR description following these [rules](https://cbea.ms/git-commit/#why-not-how). - [x] I have run `pre-commit run --from-ref origin/main --to-ref HEAD`. - Select one of the following. - [ ] I have added tests. - `/test` for `lit` tests - `/unittest` for C++ tests - `/python/test` for end-to-end tests - [x] This PR does not need a test because it should be handled by existing tests. - Select one of the following. - [x] I have not added any `lit` tests. - [ ] The `lit` tests I have added follow these [best practices](https://mlir.llvm.org/getting_started/TestingGuide/#filecheck-best-practices), including the "tests should be minimal" section. (Usually running Python code and using the instructions it generates is not minimal.)

The newly-added autotune cache needs to bail out if given an InterpretedFunction (it has no cache key, and autotuning the interpreter is a little meaningless)

* verify that WarpGroupDotOp's result encoding is always NVMMA Hopper encoding * clean up some code with this * teach FenceInsertion to look through WarpSpecializeOp * deduplicate fences (e.g. two dots in a loop with captured reg->shared operands)

…(#6753) This implements a pass for converting tma load/store into legacy loads/stores. This is required for supporting tensor descriptors on hardware that doesn't directly support tensor descriptors. This does not implement: * Host side tensor descriptors - I'll submit this in a follow up PR. * Descriptor reduction operations. * Interop for unsupported tensor descriptors on devices which support tensor descriptors. This updates the (old) CUDA and HIP lowering to use this new pass. Lit tests have been added for the pass and the CUDA tensor descriptor tests that work on hardware have been move to the language folder since they are now supported on other hardware. The HIP lowering is untested as I don't have access to a AMD card. I have tested the CUDA lowering on an A100 machine.

Signed-off-by: Whitney Tsang <[email protected]>

NikhilAPatel and others added 4 commits May 13, 2025 09:42

[frontend] Fix autotune cache lookup when interpreter enabled (#6678)

6ae57f9

The newly-added autotune cache needs to bail out if given an InterpretedFunction (it has no cache key, and autotuning the interpreter is a little meaningless)

whitneywhtsang requested review from pbchekin and anmyachev May 15, 2025 19:49

whitneywhtsang self-assigned this May 15, 2025

anmyachev approved these changes May 15, 2025

View reviewed changes

Merge commit '99b5e296ca21ba21b6c02dc48391422e53f25ffb'

cbadfc0

whitneywhtsang force-pushed the whitneywhtsang/merge branch 4 times, most recently from dd293da to 445e97a Compare May 15, 2025 22:12

Fix test failures from 99b5e29

7fa8493

Signed-off-by: Whitney Tsang <[email protected]>

whitneywhtsang force-pushed the whitneywhtsang/merge branch from 445e97a to 7fa8493 Compare May 15, 2025 22:40

whitneywhtsang marked this pull request as ready for review May 15, 2025 23:23

whitneywhtsang merged commit 85f473a into main May 15, 2025
15 checks passed

whitneywhtsang deleted the whitneywhtsang/merge branch May 15, 2025 23:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge OpenAI Triton commit `99b5e29` #4219

Merge OpenAI Triton commit `99b5e29` #4219

whitneywhtsang commented May 15, 2025 •

edited

Loading

Merge OpenAI Triton commit 99b5e29 #4219

Merge OpenAI Triton commit 99b5e29 #4219

Conversation

whitneywhtsang commented May 15, 2025 • edited Loading

Merge OpenAI Triton commit `99b5e29` #4219

Merge OpenAI Triton commit `99b5e29` #4219

whitneywhtsang commented May 15, 2025 •

edited

Loading