-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Optimise OpenCL kernels #7
Labels
enhancement
New feature or request
Comments
OpenCL particle induced velocity: See commit ae683f3 Approximately 7.5% faster:
Not used:
Verified good:
To do:
NB:
|
commit d1a77dc ind_dvort is approximately 27% faster.
As above, there is some overhead in the Julia script used for benchmarking. 3% in worst case. |
commit 6d962b7 ind_dvort is 1.5% faster
|
commit 4db32f2 Changing to transpose method for dvort calculations allows reusing a cross product. 7% faster. |
# for free
to join this conversation on GitHub.
Already have an account?
# to comment
Surely the code can be faster?
The text was updated successfully, but these errors were encountered: