New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

#

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Jump to bottom

Look into sse/avx optimization options #168

Open

TysonAndre opened this issue Feb 24, 2022 · 0 comments

Owner

TysonAndre commented Feb 24, 2022

-march=native may help more than doing this by hand in some cases - what works well on some architectures may be worse on others (e.g. skylake)

https://www.intel.com/content/www/us/en/docs/intrinsics-guide/index.html performance seems to vary, e.g. _mm_loadu_si128 -based approach does worse on skylake than gcc's output for the original implementation of teds_intvector_is_sorted_int16_t

https://en.wikipedia.org/wiki/Broadwell_(microarchitecture)

The text was updated successfully, but these errors were encountered:

# for free to join this conversation on GitHub. Already have an account? # to comment