-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
FFT MPI with threads #49
Comments
I haven't tried, but I guess it should just work if you do Note that all other operations besides FFTs, such as transpositions in PencilArrays.jl, are not threaded, so using See also the FFTW docs on combining MPI and threads. |
I just assume that thread is cheap than mpi so it will have a speed-up if we run things in a combined way. |
I guess it's worth trying! I'm very curious to know how it compares to pure MPI. |
seems not as I expected
|
As I mentioned, combining threads and MPI is likely slower because transpositions are not threaded. And the cost of transpositions can be comparable or larger to that of the FFTs themselves. So I'm not really surprised by these results. You may want to see where the time is actually spent. You can use TimerOutputs.jl for this. See here for details on how to enable timers for PencilArrays / PencilFFTs functions. |
See also the FFTW docs that I linked above:
|
Thanks! |
hi @jipolanco I just find that for large number of mpi process |
Hi, unfortunately I don't know what is going on there. It would be good to fix this, and for this we need to know where it's hanging exactly. How many processes are you using? Do you have some minimal code that reproduces the issue (on your machine/cluster)? |
#51 I just report a new issue. |
Is is possible to combine
PencilFFTs
with FFTW threads setting?I mean that
PencilFFTs
to callMPI
processes and each process will perform FFT with multiple threads.Does it need any specific settings or just launch julia with
-t
and callFFTW.set_num_threads(Threads.nthreads())
The text was updated successfully, but these errors were encountered: