Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Look into sse/avx optimization options #168

Open
TysonAndre opened this issue Feb 24, 2022 · 0 comments
Open

Look into sse/avx optimization options #168

TysonAndre opened this issue Feb 24, 2022 · 0 comments

Comments

@TysonAndre
Copy link
Owner

-march=native may help more than doing this by hand in some cases - what works well on some architectures may be worse on others (e.g. skylake)

https://www.intel.com/content/www/us/en/docs/intrinsics-guide/index.html performance seems to vary, e.g. _mm_loadu_si128 -based approach does worse on skylake than gcc's output for the original implementation of teds_intvector_is_sorted_int16_t

https://en.wikipedia.org/wiki/Broadwell_(microarchitecture)

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant