-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Add SBGEMM for arm neoversev1 #5108
Conversation
Signed-off-by: Ye Tao <ye.tao@arm.com>
@martin-frbg hi, martin, seems my patch fails on a irrelevant mips64 ci pipeline and after printing out TEST 109/111 kernel_regress:skx_avx [OK] due to exceeding maximum execution time. Could you help me on understanding whether this is expected or my changes accidentally cause performance issues? it seems some previous pr also fails after this kernel_regress:skx_avx test. |
Thanks - no problem with your PR, just me lacking the time and energy to confirm and merge it |
@martin-frbg Do we have any release cadence to upgrade the version to 0.30.0? |
@aditew01 0.3.30 is planned for end of the month (please see Milestones) |
- Updates changelog - Removes pytorch/pytorch#139387 - Add prepacking for linear weights. Performance gains better realised by ideep reorder caching. - Updates OpenBLAS build to use recent commit from develop which includes: OpenMathLib/OpenBLAS#5108
@martin-frbg that sounds great! Thanks for the clarification. Missed the milestone update, but I'd keep a look out. :) |
- Updates changelog - Removes pytorch/pytorch#139387 - Add prepacking for linear weights. Performance gains better realised by ideep reorder caching. - Updates OpenBLAS build to use recent commit from develop which includes: OpenMathLib/OpenBLAS#5108
This PR is to add an optimised bf16 gemm kernel for arm neoversev1 machine (sve-256 bit).