Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

feat: expose decoupled kv-cache to pytorch api #383

Merged
merged 3 commits into from
Jul 20, 2024
Merged

Conversation

yzh119
Copy link
Collaborator

@yzh119 yzh119 commented Jul 19, 2024

Followup of #379

@yzh119 yzh119 marked this pull request as ready for review July 20, 2024 01:23
@yzh119 yzh119 merged commit 457a0ae into main Jul 20, 2024
yzh119 added a commit that referenced this pull request Jul 20, 2024
🤖 I have created a release *beep* *boop*
---

##
[0.1.1](v0.1.0...v0.1.1)
(2024-07-20)

### Bugfix

* fix the invalid kernel configuration for architectures with small
shared memory size
([#385](#385))
([cdac57](cdac577))

### Features

* expose decoupled kv-cache to pytorch api
([#383](#383))
([457a0ae](457a0ae))


### Performance Improvements

* use stmatrix in epilogue for sm90+
([#380](#380))
([c6f20d1](c6f20d1))
---
This PR was generated with [Release
Please](https://github.com/googleapis/release-please). See
[documentation](https://github.com/googleapis/release-please#release-please).

---------

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: yzh119 <expye@outlook.com>
@yzh119 yzh119 deleted the torch-decouple-kv branch July 24, 2024 10:38
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant