You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Not sure how important this is across architectures since only AVX2 supports gather (ARM only supports gather via SVE, which LLVM doesn't seem to support, and WASM seems to only support it via an experimental proposal), but I was porting a noise library to another SIMD library a while back, and using AVX2 gather speeds things up by ~20-40%. Might be worth implementing for AVX2 and falling back to a scalar implementation for everything else.
The text was updated successfully, but these errors were encountered:
Not sure how important this is across architectures since only AVX2 supports gather (ARM only supports gather via SVE, which LLVM doesn't seem to support, and WASM seems to only support it via an experimental proposal), but I was porting a noise library to another SIMD library a while back, and using AVX2 gather speeds things up by ~20-40%. Might be worth implementing for AVX2 and falling back to a scalar implementation for everything else.
The text was updated successfully, but these errors were encountered: