Skip to content

Pull requests: ROCm/triton

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add bf16 gemm pingpong with num_stages=3
#818 opened May 30, 2025 by jungpark-mlir Loading…
Bypass LDS for scale B operand for skinny gemms
#817 opened May 29, 2025 by plognjen Loading…
[DRAFT] Shared/aggregate load
#804 opened May 21, 2025 by alefimov-amd Draft
[AMD] Improve Scheduling for Async BF16 GEMM
#802 opened May 21, 2025 by raikonenfnu Loading…
7 tasks
add predicate mask for atomic_rmw ops
#799 opened May 19, 2025 by scxiao Loading…
4 of 7 tasks
Shaoclee/compare ck
#788 opened May 2, 2025 by k50112113 Loading…
5 of 7 tasks
[WIP] [StreamK]
#782 opened Apr 28, 2025 by zhanglx13 Draft
[AMD] Added bufferOps refinement
#776 opened Apr 14, 2025 by ravil-mobile Loading…
update scale dot assertion in plot_layout.py
#774 opened Apr 10, 2025 by jtang10 Loading…
Tjactions security issue
#773 opened Apr 3, 2025 by Cemberk Loading…
Update FlashAttention transV scripts
#766 opened Mar 21, 2025 by binarman Loading…
Add v2 test to paged_attention_decode
#764 opened Mar 20, 2025 by rahulbatra85 Loading…
MLA prefill, forward_normal benchmark
#750 opened Mar 7, 2025 by Chi-Chu319 Loading…
Cap warp count to 16 for devices with warp size 64
#747 opened Mar 5, 2025 by schung-amd Draft
4 of 7 tasks
Add int4 quantization support to MoE
#715 opened Jan 28, 2025 by rahulbatra85 Loading…
ProTip! Type g i on any issue or pull request to go back to the issue listing page.