We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Hi, will it have a CPU version impl?
The text was updated successfully, but these errors were encountered:
We focus on reducing GPU memory reads/writes to speed up attention & save memory. The bottlenecks on CPUs are likely to be different.
Sorry, something went wrong.
Could have been great for the sake of having uniform code and only change the device...
Sure would have been great, someone just needs to write the code
No branches or pull requests
Hi, will it have a CPU version impl?
The text was updated successfully, but these errors were encountered: