-
Notifications
You must be signed in to change notification settings - Fork 33
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Implement helicity recycling in our CUDA/C++ #279
Comments
I would actually doubt that doing helicity recycling for gpu is a good idea since this blows up the size of the code and the memory requirement. For vectorised cpu, that is obviously an option. |
This was referenced Mar 10, 2022
Within PR #401, note that I had to introduce this fix while moving to v311 In practice
|
# for free
to join this conversation on GitHub.
Already have an account?
# to comment
This is a followup of issue #276.
In order to compare C++/CUDA and Fortran throughputs, we should make sure that they use the same algorithm. This is presently not the case: we are comparing a faster Fortran with helicity recycling to a slower C++ without helicity recycling.
In issue #276, I will follow up a better estimate of slower Fortran without helicity recycling, that can be directly compared to C++.
But what we really need to do is implement helicity recycling in the CUDA/C++. @oliviermattelaer is this something that would be complicated (an/or maybe is already underway)? Thanks
The text was updated successfully, but these errors were encountered: