Skip to content

Commit 1a03b70

Browse files
SlyEchoardfork
authored andcommitted
Undo mess
--------- Co-authored-by: ardfork <134447697+ardfork@users.noreply.github.com>
1 parent 4366ff9 commit 1a03b70

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

ggml-cuda.cu

+2-2
Original file line numberDiff line numberDiff line change
@@ -1661,8 +1661,8 @@ template <int vdr> static __device__ __forceinline__ float vec_dot_q8_1_q8_1_imp
16611661
#else
16621662
const float2 dm8f = __half22float2(dm8);
16631663
const float2 ds8f = __half22float2(ds8);
1664-
const float d8d8 = __low2float(dm8) * __low2float(ds8);
1665-
const float m8s8 = __high2float(dm8) * __high2float(ds8);
1664+
const float d8d8 = dm8f.x * ds8f.x;
1665+
const float m8s8 = dm8f.y * ds8f.y;
16661666
#endif // GGML_CUDA_F16
16671667

16681668
// scale second part of sum by QI8_1/ vdr to compensate for multiple threads adding it

0 commit comments

Comments
 (0)