-
Notifications
You must be signed in to change notification settings - Fork 5.7k
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
fix cudasynchronize of fft_c2r_compute #63249
base: develop
Are you sure you want to change the base?
Conversation
你的PR提交成功,感谢你对开源项目的贡献! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM,可以在描述中加一下nsigh的两个对比图,然后说一下为什么不用thrust库而是手动copy,这样能直观说明问题
Sorry to inform you that 3f3fc88's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually. |
… fftr2c_improve rebase
1f57460
to
36d44ea
Compare
Sorry to inform you that 36d44ea's CIs have passed for more than 7 days. To prevent PR conflicts, you need to re-run all CIs manually. |
|
PR Category
Performance Optimization
PR Types
Performance
Description
修复fft_c2r_compute在计算过程中长时间进行cudasynchronize 的问题,使用trust库会导致这个算子在计算时长时间的同步
data:image/s3,"s3://crabby-images/70189/7018939c58ef1754edee9846991aaaf498753f94" alt="20BE5FD8B1F7F76F94707EE4FC56EA80"
data:image/s3,"s3://crabby-images/b03da/b03dae5aeba5f26a29f0fc6f66dfc797155d063b" alt="image"
修复前:
修复后: