clang: _mm512_reduce_add_ps lowers to LLVM IR that does not reflect correct reduce order

This
```C
#include <immintrin.h>
float foo(__m512 x) {
    return _mm512_reduce_add_ps(x);
}
```
[produces](https://godbolt.org/z/qera4378s)
```
define dso_local noundef float @foo(float vector[16])(<16 x float> noundef %x) local_unnamed_addr #0 {
entry:
  %0 = tail call reassoc noundef float @llvm.vector.reduce.fadd.v16f32(float -0.000000e+00, <16 x float> %x)
  ret float %0
}
```
According to the [LangRef](https://llvm.org/docs/LangRef.html#fast-math-flags), the `reassoc` here means that the addition may happen in *any* order, which is not what [Intel documents](https://www.intel.com/content/www/us/en/docs/intrinsics-guide/index.html#text=_mm512_reduce_add_ps&expand=133&ig_expand=5303) -- they specify a particular, "tree-like" order.

Even worse, we can chain two of these operations:
```C
#include <immintrin.h>
float foo(__m512 x) {
    float xr = _mm512_reduce_add_ps(x);
    __m512 y = _mm512_set_ps(
        xr, 1.8, 9.3, 0.0, 2.5, 0.0, 6.7, 9.0,
        0.0, 1.8, 9.3, 0.0, 2.5, 0.0, 6.7, 9.0
    );
    return _mm512_reduce_add_ps(y);
}
```
Now the second addition may be arbitrarily re-associated with the first one. As far as I understand, there's nothing about `reassoc` that constrains the re-association to only happen "inside" a single operation (and indeed, as a fast-math flag it is explicitly intended to apply when multiple subsequent operations are all `reassoc`).

_mm512_reduce_add_ps should probably either use a vendor-specific intrinsic, or LLVM IR needs a version of `vector.reduce.fadd` that explicitly specifies the "tree-like" reduction order documented by Intel.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

clang: _mm512_reduce_add_ps lowers to LLVM IR that does not reflect correct reduce order #82813

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

clang: _mm512_reduce_add_ps lowers to LLVM IR that does not reflect correct reduce order #82813

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions