Skip to content

v0.1.5

Compare
Choose a tag to compare
@github-actions github-actions released this 13 Aug 10:19
· 0 commits to 7470edc345f64f8a5229f7878da81117ee678c09 since this release
838d050

0.1.5 (2024-08-13)

Bugfix

  • Fix PagedPrefill python api and some typos (#441) (3fff008)
  • fix prefill kernels' lse result for empty kv-cache (#440) (6ac28f4)

Features

  • decouple float and int workspace buffer (#442) (a7ee566)

Performance Improvements

  • faster fp8->fp16 dequantization for pre sm_90 arch (#439) (c93f647)

Acknowledgement

We thank contributions and feedbacks from the community: @comaniac, @hnyls2002, @jianfei-wangg, @Yard1.