Skip to content

Actions: flashinfer-ai/flashinfer

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
115 workflow run results
115 workflow run results

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

bugfix: suppress alignment warning of sampling kernels (#297)
Build FlashInfer Docs #171: Commit 1250b68 pushed by yzh119
June 11, 2024 08:47 45s main
June 11, 2024 08:47 45s
bugfix: fix wrong padded_batch_size_ (#296)
Automatically bump version and release Python wheels #140: Commit aff4cf0 pushed by yzh119
June 11, 2024 05:14 26s main
June 11, 2024 05:14 26s
bugfix: fix wrong padded_batch_size_ (#296)
Build FlashInfer Docs #170: Commit aff4cf0 pushed by yzh119
June 11, 2024 05:14 1m 11s main
June 11, 2024 05:14 1m 11s
refactor: refactor decode handler (#294)
Build FlashInfer Docs #169: Commit 60459e4 pushed by yzh119
June 10, 2024 10:33 53s main
June 10, 2024 10:33 53s
refactor: refactor decode handler (#294)
Automatically bump version and release Python wheels #139: Commit 60459e4 pushed by yzh119
June 10, 2024 10:33 24s main
June 10, 2024 10:33 24s
misc: add some notes in cmake.config (#293)
Automatically bump version and release Python wheels #138: Commit 4c5e28b pushed by yzh119
June 10, 2024 07:20 24s main
June 10, 2024 07:20 24s
misc: add some notes in cmake.config (#293)
Build FlashInfer Docs #168: Commit 4c5e28b pushed by yzh119
June 10, 2024 07:20 45s main
June 10, 2024 07:20 45s
doc: fix the math display of group gemm operator (#292)
Build FlashInfer Docs #167: Commit 4198686 pushed by yzh119
June 10, 2024 07:14 57s main
June 10, 2024 07:14 57s
doc: fix the math display of group gemm operator (#292)
Automatically bump version and release Python wheels #137: Commit 4198686 pushed by yzh119
June 10, 2024 07:14 26s main
June 10, 2024 07:14 26s
bugfix: Fix the behavior of decode cuda graph wrapper (#291)
Automatically bump version and release Python wheels #136: Commit e252c94 pushed by yzh119
June 10, 2024 07:07 36s main
June 10, 2024 07:07 36s
bugfix: Fix the behavior of decode cuda graph wrapper (#291)
Build FlashInfer Docs #166: Commit e252c94 pushed by yzh119
June 10, 2024 07:07 49s main
June 10, 2024 07:07 49s
bugfix: fix the synchronization issue in distributed operators (#290)
Automatically bump version and release Python wheels #135: Commit f13ec08 pushed by yzh119
June 9, 2024 08:49 32s main
June 9, 2024 08:49 32s
bugfix: fix the synchronization issue in distributed operators (#290)
Build FlashInfer Docs #165: Commit f13ec08 pushed by yzh119
June 9, 2024 08:49 1m 0s main
June 9, 2024 08:49 1m 0s
feat: initial support of distributed operators (#289)
Automatically bump version and release Python wheels #134: Commit 03553da pushed by yzh119
June 8, 2024 08:24 28s main
June 8, 2024 08:24 28s
feat: initial support of distributed operators (#289)
Build FlashInfer Docs #164: Commit 03553da pushed by yzh119
June 8, 2024 08:24 44s main
June 8, 2024 08:24 44s
cmake: fix DECODE_F8_DTYPES and DECODE_FP8_DTYPES discrepancy (#287)
Automatically bump version and release Python wheels #133: Commit 809abaa pushed by yzh119
June 7, 2024 08:13 32s main
June 7, 2024 08:13 32s
cmake: fix DECODE_F8_DTYPES and DECODE_FP8_DTYPES discrepancy (#287)
Build FlashInfer Docs #163: Commit 809abaa pushed by yzh119
June 7, 2024 08:13 48s main
June 7, 2024 08:13 48s
bugfix: fix the data type of aligned_alloc in handlers (#283)
Automatically bump version and release Python wheels #132: Commit 5a38066 pushed by yzh119
June 5, 2024 06:58 25s main
June 5, 2024 06:58 25s
bugfix: fix the data type of aligned_alloc in handlers (#283)
Build FlashInfer Docs #162: Commit 5a38066 pushed by yzh119
June 5, 2024 06:58 1m 1s main
June 5, 2024 06:58 1m 1s
feat: add group gemm operators (#282)
Automatically bump version and release Python wheels #131: Commit e08ba42 pushed by yzh119
June 5, 2024 03:28 30s main
June 5, 2024 03:28 30s
feat: add group gemm operators (#282)
Build FlashInfer Docs #161: Commit e08ba42 pushed by yzh119
June 5, 2024 03:28 49s main
June 5, 2024 03:28 49s
Add dtype checks for q-kv tensors (#280)
Automatically bump version and release Python wheels #130: Commit 7aadc0d pushed by yzh119
June 4, 2024 05:42 23s main
June 4, 2024 05:42 23s
Add dtype checks for q-kv tensors (#280)
Build FlashInfer Docs #160: Commit 7aadc0d pushed by yzh119
June 4, 2024 05:42 49s main
June 4, 2024 05:42 49s
bugfix: fix cudagraph-compatible prefill/decode apis (#281)
Automatically bump version and release Python wheels #129: Commit 1092e7e pushed by yzh119
June 4, 2024 05:21 25s main
June 4, 2024 05:21 25s
bugfix: fix cudagraph-compatible prefill/decode apis (#281)
Build FlashInfer Docs #159: Commit 1092e7e pushed by yzh119
June 4, 2024 05:21 55s main
June 4, 2024 05:21 55s