Skip to content

Actions: flashinfer-ai/flashinfer

Build FlashInfer Docs

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
406 workflow runs
406 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

doc: fix fp8 bmm documentation (#470)
Build FlashInfer Docs #306: Commit d357a91 pushed by yzh119
August 27, 2024 01:08 56s main
August 27, 2024 01:08 56s
bugfix: Fix sm75 kernel configuration (#449)
Build FlashInfer Docs #305: Commit 3d38d0d pushed by yzh119
August 27, 2024 00:27 1m 0s main
August 27, 2024 00:27 1m 0s
feat: support bmm fp8 (#469)
Build FlashInfer Docs #304: Commit f1c0b68 pushed by yzh119
August 26, 2024 19:31 48s main
August 26, 2024 19:31 48s
doc: fix the use of exclude-members (#468)
Build FlashInfer Docs #303: Commit 2ba3f1c pushed by yzh119
August 26, 2024 10:15 50s main
August 26, 2024 10:15 50s
misc: use the new plan/run API for unittests (#467)
Build FlashInfer Docs #302: Commit 78ec6db pushed by yzh119
August 26, 2024 06:58 58s main
August 26, 2024 06:58 58s
refactor: replace begin_forward/forward/end_forward with plan
Build FlashInfer Docs #301: Commit d940d2e pushed by yzh119
August 25, 2024 08:56 47s main
August 25, 2024 08:56 47s
docs: improve cascade inference documentation (#465)
Build FlashInfer Docs #300: Commit 957572e pushed by yzh119
August 24, 2024 03:18 51s main
August 24, 2024 03:18 51s
doc: another bunch of documentation improvement (#463)
Build FlashInfer Docs #299: Commit f40b255 pushed by yzh119
August 23, 2024 03:54 47s main
August 23, 2024 03:54 47s
feat: add MultiLevelCascadeAttentionWrapper API (#462)
Build FlashInfer Docs #298: Commit 1e37989 pushed by yzh119
August 22, 2024 11:28 48s main
August 22, 2024 11:28 48s
docs: add some documentation (#461)
Build FlashInfer Docs #297: Commit c1f576a pushed by yzh119
August 22, 2024 09:40 51s main
August 22, 2024 09:40 51s
perf: use persistent kernel for merging attention states (#459)
Build FlashInfer Docs #296: Commit be6bf5b pushed by yzh119
August 21, 2024 19:08 1m 0s main
August 21, 2024 19:08 1m 0s
bugfix: fix the python api of prefill wrapper + custom mask (#460)
Build FlashInfer Docs #295: Commit 048560d pushed by yzh119
August 21, 2024 17:21 1m 8s main
August 21, 2024 17:21 1m 8s
perf: slight optimization on fragment layout swizzle (#458)
Build FlashInfer Docs #294: Commit 7c397cb pushed by yzh119
August 21, 2024 06:03 51s main
August 21, 2024 06:03 51s
misc: less syncthreads in renorm kernels (#457)
Build FlashInfer Docs #293: Commit 85b4c77 pushed by yzh119
August 21, 2024 05:51 1m 3s main
August 21, 2024 05:51 1m 3s
misc: improve error handling of sampling kernels (#456)
Build FlashInfer Docs #292: Commit 0dce178 pushed by yzh119
August 20, 2024 11:26 59s main
August 20, 2024 11:26 59s
perf: slight optimization on f16->f8 fragment layout swizzling (#453)
Build FlashInfer Docs #291: Commit 0d61871 pushed by yzh119
August 18, 2024 06:26 51s main
August 18, 2024 06:26 51s
feat: add accept num, emit num metric for ChainSpeculativeSampling (#…
Build FlashInfer Docs #290: Commit fa38b5e pushed by yzh119
August 17, 2024 05:25 57s main
August 17, 2024 05:25 57s
docs: update README (#451)
Build FlashInfer Docs #289: Commit 86c9e55 pushed by zhyncs
August 16, 2024 12:26 1m 0s main
August 16, 2024 12:26 1m 0s
bugfix: fix the prefill/append attention kernel accuracy issue on sm7…
Build FlashInfer Docs #288: Commit 338b2f5 pushed by yzh119
August 16, 2024 00:55 46s main
August 16, 2024 00:55 46s
fix: resolve cu121 compile wired issue (#446)
Build FlashInfer Docs #287: Commit 5f0159e pushed by zhyncs
August 14, 2024 09:47 52s main
August 14, 2024 09:47 52s
ci: set MAX_JOBS to 128 (#445)
Build FlashInfer Docs #286: Commit 838d050 pushed by yzh119
August 13, 2024 23:22 47s main
August 13, 2024 23:22 47s
bugfix: suppress warning #63-D: shift count is too large (#444)
Build FlashInfer Docs #285: Commit d07b19e pushed by yzh119
August 13, 2024 16:40 58s main
August 13, 2024 16:40 58s
chore(main): release 0.1.5 (#435)
Build FlashInfer Docs #284: Commit 7470edc pushed by yzh119
August 13, 2024 10:19 54s main
August 13, 2024 10:19 54s
feat: decouple float and int workspace buffer (#442)
Build FlashInfer Docs #283: Commit a7ee566 pushed by yzh119
August 13, 2024 10:02 58s main
August 13, 2024 10:02 58s
Fix PagedPrefill python api and some typos (#441)
Build FlashInfer Docs #282: Commit 3fff008 pushed by yzh119
August 13, 2024 09:26 1m 4s main
August 13, 2024 09:26 1m 4s