core intrinsics that should take void but take *u8 #199

gnzlbg · 2017-11-20T17:51:43Z

Using c_void requires the std::os::raw module.

This should be fixed in Rust upstream, the tracking issue is: rust-lang/rust#36193

The following intrinsics should use c_void on its API, but use *u8 instead so that they can be exposed in coresimd:

void _mm_clflush (void const* p)
xrstor
xrstor64
xrstors
xrstors64
xsave
xave64
xsaveopt
xasveopt64
xsaves
xsaves64
xsavec
xsavec64

The text was updated successfully, but these errors were encountered:

parched · 2017-11-21T20:32:03Z

What about defining a 512 byte long, 16 byte aligned type to be used for all x*? It would be much safer than c_void or *u8. Or is that deviating too much from the exact c prototype for the intrinsic?

gnzlbg · 2017-11-21T21:08:41Z

@parched

If you meant to define that for fxrs-family of intrinsics (FXSAVE and FXRSTOR) that is probably a good idea although I am not 100% sure. Technically one needs a 512-byte buffer, but the docs of FXSAVE say:

Bytes 464:511 are available to software use. The processor does not write to bytes 464:511 of an FXSAVE area.

And the docs of FXRSTOR say:

FXRSTOR ignores the content of bytes 464:511 in an FXSAVE state image.

So technically, while one need a 512-byte image to call FXRSTOR (I interpret the docs as "those bytes are ignored but read), one can still store images using FXSAVE into a smaller buffer.

So... if you convince me that it always works for fxrs I think it would be a good idea to do it there.

However, this doesn't help you a lot with xsave and friends because on CPUs with AVX (or AVX-512) the size of the xsave area is "customizable", and can vary from 512 up to 2560 depending on the mask and optimizations used (xsaves/xsavec/xsaveopt...). I don't think we could even do it for xsave since whether the CPU has AVX or AVX-512 might not be known till run-time.

parched · 2017-11-21T21:22:53Z

Ah ok right, I was looking at https://software.intel.com/en-us/cpp-compiler-18.0-developer-guide-and-reference-xsave-/-xsavec-/-xsaves but that appears to be wrong, more than 512 bytes is needed. Probably just best to leave it as a byte pointer then. It would be nice to encode the alignment requirement some how though, or does that vary with size too?

gnzlbg · 2017-11-21T21:28:35Z

It would be nice to encode the alignment requirement some how though, or does that vary with size too?

That does not vary with size, its 16-byte alignment for FXSAVE/FXRSTOR and 64-byte alignment for all other ones.

Is there an easy way to encode the alignment requirement?

Ah ok right, I was looking at https://software.intel.com/en-us/cpp-compiler-18.0-developer-guide-and-reference-xsave-/-xsavec-/-xsaves but that appears to be wrong, more than 512 bytes is needed.

Yes that is not right in general, e.g., xsave cannot return the AVX2 registers in 512 bytes, it needs 800 bytes for that. It is only right if you have a CPU that doesn't have AVX2, then it should be equivalent to FXSAVE, but maybe faster since you can use the mask to decide which registers you want to save and you might not want to save all of them.

alexcrichton · 2018-01-29T05:01:01Z

I believe the current status of this issue is that we've since added automatic verification of all intrinsics and their signatures. We have an explicit mapping that allows *mut u8 to map to void* in C (or what Intel specifies).

So at least in that sense we're consistently inconsistent with Intel's specification! My guess is that we're likely to stick with this, but that's at least where we're at!

gnzlbg · 2018-01-29T08:59:24Z

I think it would be better to just map void* to c_void and be consistent with the spec.

Providing c_void in core is a problem worth solving anyways. Maybe we could use simd-verify to generate an exhaustive list of intrinsics running into this, and just not stabilize those at first?

alexcrichton · 2018-01-29T14:43:33Z

It's true yeah if we had c_void in libcore I think there'd be an obvious choice of what to do. I'm also ok skipping stabilizing this if we feel like we really want to wait for c_void, although we'd want to audit the set of intrinsics to make sure they're not too desirable.

alexcrichton · 2018-02-11T16:31:25Z

I'm going to close this in favor of rust-lang/rfcs#2325 where I think we'll decide that either "u8 is fine" or we'll vendor c_void in libcore.

gnzlbg mentioned this issue Nov 20, 2017

[coresimd] extracts the no_std components into the coresimd crate #197

Merged

alexcrichton closed this as completed Feb 11, 2018

gnzlbg mentioned this issue Mar 9, 2018

How do I pass a pointer to the _mm_cmpestrm instrinsic? #360

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

core intrinsics that should take void but take *u8 #199

core intrinsics that should take void but take *u8 #199

gnzlbg commented Nov 20, 2017 •

edited

Loading

parched commented Nov 21, 2017

Uh oh!

gnzlbg commented Nov 21, 2017 •

edited

Loading

Uh oh!

parched commented Nov 21, 2017

Uh oh!

gnzlbg commented Nov 21, 2017 •

edited

Loading

Uh oh!

alexcrichton commented Jan 29, 2018

Uh oh!

gnzlbg commented Jan 29, 2018

Uh oh!

alexcrichton commented Jan 29, 2018

Uh oh!

alexcrichton commented Feb 11, 2018

Uh oh!

core intrinsics that should take void but take *u8 #199

core intrinsics that should take void but take *u8 #199

Comments

gnzlbg commented Nov 20, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

parched commented Nov 21, 2017

Uh oh!

gnzlbg commented Nov 21, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

parched commented Nov 21, 2017

Uh oh!

gnzlbg commented Nov 21, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alexcrichton commented Jan 29, 2018

Uh oh!

gnzlbg commented Jan 29, 2018

Uh oh!

alexcrichton commented Jan 29, 2018

Uh oh!

alexcrichton commented Feb 11, 2018

Uh oh!

gnzlbg commented Nov 20, 2017 •

edited

Loading

gnzlbg commented Nov 21, 2017 •

edited

Loading

gnzlbg commented Nov 21, 2017 •

edited

Loading