A few discrepancies in X86-64 Instruction Semantics #376

sdasgup3 · 2019-11-05T19:56:33Z

Hello Team,
I was validating McSema's semantics of various x86-64 instructions against the formal sematics using solver checks and found the following discrepancies.

Example Instruction	Potential Reason	Affected Variants
xaddq %rax, %rax	R4	(5) xaddq_r64_r64 xaddb_r8_r8 xaddb_rh_rh xaddl_r32_r32 xaddw_r16_r16
andnps %xmm2, %xmm1	R1	(20) andnps_xmm_xmm andnq_r64_r64_r64 pandn_xmm_xmm vandnpd_xmm_xmm_xmm vandnpd_ymm_ymm_ymm vandnps_xmm_xmm_xmm vandnps_ymm_ymm_ymm vpandn_xmm_xmm_xmm vpandn_ymm_ymm_ymm andnl_r32_r32_m32 andnpd_xmm_m128 andnps_xmm_m128 andnq_r64_r64_m64 pandn_xmm_m128 vandnpd_xmm_xmm_m128 vandnpd_ymm_ymm_m256 vandnps_xmm_xmm_m128 vandnps_ymm_ymm_m256 vpandn_xmm_xmm_m128 vpandn_ymm_ymm_m256
pmuludq %xmm2, %xmm1	R2	(2) pmuludq_xmm_xmm pmuludq_xmm_m128
cmpxchgl %ecx, %ebx	R3	(1) cmpxchgl_r32_r32
cmpxchgb %ah, %al	R5	(1) cmpxchgb_r8_rh

Reasons

R4

xaddq %rax, %rbx expects the operations (1)
temp ← %rax + %rbx, (2) %rax ← %rbx, and (3) %rbx
← temp, in that order. McSema performs the same operation
differently as (A) old_rbx = %rbx, (B) temp ← %rax +
%rbx, (C) %rbx ← temp, and (D) %rax ← old_rbx. This
will fail to work when the operands are the same registers.

R1
The Intel Manual says the implementation should be DEST←NOT(DEST) AND SRC, whereas McSema performs DEST←NOT(SRC) AND DEST
R2
The Intel Manual says the implementation should be

 DEST[63:0] ← DEST[31:0] ∗ SRC[31:0];
 DEST[127:64] ← DEST[95:64] ∗ SRC[95:64];

OTOH McSema performs DEST[63:0] ← DEST[31:0] ∗ SRC[31:0];

R3 and R5
As per the Manual, the semantics should be

TEMP ← DEST

IF accumulator = TEMP
    THEN
        ZF ← 1;
        DEST ← SRC;
    ELSE
        ZF ← 0;
        accumulator ← TEMP;
        DEST ← TEMP;
FI;

For cmpxchgl %ecx, %ebx
However, McSema compares the entire 64'DEST, which is TEMP as per the above pseudocode, against the accumulator Concat(32'0, RAX[31:0])

For cmpxchgb %ah, %al,
The control should get into the THEN part which must lead to DEST (al) <- SRC(ah). However, McSema keeps the lower 8 bits of RAX unchanged.

Please note that all the bugs are double-checked by looking into the lifted IR that McSema generated for these cases. We hope that this information might be useful to you.
Let me know your opinion.

The text was updated successfully, but these errors were encountered:

pgoodman · 2019-12-11T20:07:06Z

@sdasgup3 Can you verify if the fixes are acceptable?

sdasgup3 · 2019-12-12T03:21:40Z

Sure, I will.

sdasgup3 · 2019-12-14T07:26:35Z

Thanks, @kumarak & @pgoodman for the fixes.

Most of the semantics got fixed except for the followings. Also, I have attached the artfacts against which I am comparing.

cmpxchgl %ecx, %ebx
In the event of %ebx != %eax, the accumulator (%eax) should be updated with dest. But the higher 32 bits of %rax needs to be zeroed out, which seems to be missing.

The X86 semantics generates the following summary for the %rax register (please refer to artifacts/cmpxchgl_r32_r32/Output/test-z3.py)

xvar = (V_R == z3.If((z3.Extract(31, 0, VX_RAX) == z3.Extract(31, 0, VX_RBX)), VX_RAX, z3.Concat(z3.BitVecVal(0, 32), z3.Extract(31, 0, VX_RBX))))

and symbolically executing the LLVM IR that remill generates gives the following summary (with embedded comments to highlight the potential discrepenacy)

lvar = z3.And(

        # Case: Accumulator != dest
	z3.Implies(
		(z3.If((z3.Concat(z3.Extract(31, 24, VL_RBX),z3.Extract(23, 16, VL_RBX),z3.Extract(15, 8, VL_RBX),z3.Extract(7, 0, VL_RBX),) == z3.Concat(z3.Extract(31, 24, VL_RAX),z3.Extract(23, 16, VL_RAX),z3.Extract(15, 8, VL_RAX),z3.Extract(7, 0, VL_RAX),)), z3.BitVecVal(1, 8), z3.BitVecVal(0, 8)) == z3.BitVecVal(0, 8)), 

		V_R == z3.Concat(z3.Extract(63, 56, VL_RAX), z3.Extract(55, 48, VL_RAX), z3.Extract(47, 40, VL_RAX), z3.Extract(39, 32, VL_RAX), # <-- Most significans 8 bytes must be zero'ed.
                  z3.Extract(31, 24, z3.Concat(z3.Extract(31, 24, VL_RBX),z3.Extract(23, 16, VL_RBX),z3.Extract(15, 8, VL_RBX),z3.Extract(7, 0, VL_RBX),)),
                  z3.Extract(23, 16, z3.Concat(z3.Extract(31, 24, VL_RBX),z3.Extract(23, 16, VL_RBX),z3.Extract(15, 8, VL_RBX),z3.Extract(7, 0, VL_RBX),)),
                  z3.Extract(15, 8, z3.Concat(z3.Extract(31, 24, VL_RBX),z3.Extract(23, 16, VL_RBX),z3.Extract(15, 8, VL_RBX),z3.Extract(7, 0, VL_RBX),)),
                  z3.Extract(7, 0, z3.Concat(z3.Extract(31, 24, VL_RBX),z3.Extract(23, 16, VL_RBX),z3.Extract(15, 8, VL_RBX),z3.Extract(7, 0, VL_RBX),)))
	), 
  ............... omitted for brevity .....................
)

You may reproduce the error using

cd artifacts/cmpxchgl_r32_r32/
make provez3

cmpxchgb %ah, %al
Discrepancy with the update of rax register
xaddb %ah, %ah & xaddw %ax, %ax
The mcsema generated files does not seem to get updated since last time the bugs where files
. (Attached the respective ll files)

artifacts.tar.gz

XREF #376

pgoodman · 2019-12-19T19:19:32Z

@sdasgup3 does the issue_376 branch resolve these issues?

sdasgup3 · 2019-12-20T09:33:34Z

Hi @pgoodman
Thanks for the update. Here is my test outcome.

cmpxchgl %ecx, %ebx
Now we are correctly handling case %ebx != %eax
However, in the event of %ebx == %eax, the accumulator (%eax), along with its upper 32 bits, should remain unaffected. Whereas, the current implementation zero-extends in either cases. For example, the curent implementation says

rax = Concat(32'0, Extract(32, 0, %rax)) if %ebx == %eax // zero extension not needed
         Concat(32'0, Extract(32, 0, %rbx)), Otherwise

cmpxchgb %ah, %al
The following explains the difference.

// Current Implementaton
lvar = (V_R == z3.Concat(
      z3.Extract(63, 56, VL_RAX),
      z3.Extract(55, 48, VL_RAX),
      z3.Extract(47, 40, VL_RAX),
      z3.Extract(39, 32, VL_RAX),
      z3.Extract(31, 24, VL_RAX),
      z3.Extract(23, 16, VL_RAX),
      z3.Extract(15, 8, VL_RAX),
      z3.Extract(7, 0, z3.Extract(7, 0, VL_RAX))))

// Seemngly correct mplemenntation
xvar = (V_R == z3.Concat(z3.Extract(63, 8, VX_RAX), z3.Extract(15, 8, VX_RAX)))

Let me know if I need to provide any other information.

@adahsuzixin

* New x86 instructions * Add some isels * Fixes Issue #376 * Fixes Issue #433. Thanks @adahsuzixin for the semantics and tests * Fixes Issue #374 * Minor fix to the semantics for VINSERTF128, it should only look at the low bit of imm8 * Minor fixes for sparc isel naming

sdasgup3 changed the title ~~Bugs: Instruction Semantics~~ Discrepancies in Instruction Semantics Nov 5, 2019

sdasgup3 changed the title ~~Discrepancies in Instruction Semantics~~ Discrepancies in X86-64 Instruction Semantics Nov 5, 2019

sdasgup3 changed the title ~~Discrepancies in X86-64 Instruction Semantics~~ A Few discrepancies in X86-64 Instruction Semantics Nov 5, 2019

sdasgup3 changed the title ~~A Few discrepancies in X86-64 Instruction Semantics~~ A few discrepancies in X86-64 Instruction Semantics Nov 5, 2019

pgoodman assigned Aiethel Nov 13, 2019

pgoodman transferred this issue from lifting-bits/mcsema Nov 13, 2019

pgoodman assigned kumarak Nov 13, 2019

pgoodman added bug x86 Related to x86/x86-64/AMD64 lifting support labels Nov 13, 2019

pgoodman pushed a commit that referenced this issue Dec 19, 2019

Zero-extend destination register

9f172ac

XREF #376

pgoodman mentioned this issue Dec 19, 2019

Zero-extend destination register #389

Closed

sdasgup3 mentioned this issue Jan 6, 2020

Validate Mcsema-decompiled LLVM ir for single instructions sdasgup3/validating-binary-decompilation#12

Closed

3 tasks

pgoodman added a commit that referenced this issue Nov 5, 2020

Fixes Issue #376

7347f44

pgoodman closed this as completed Nov 5, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A few discrepancies in X86-64 Instruction Semantics #376

A few discrepancies in X86-64 Instruction Semantics #376

sdasgup3 commented Nov 5, 2019 •

edited

Loading

pgoodman commented Dec 11, 2019

sdasgup3 commented Dec 12, 2019

sdasgup3 commented Dec 14, 2019 •

edited

Loading

pgoodman commented Dec 19, 2019

sdasgup3 commented Dec 20, 2019

A few discrepancies in X86-64 Instruction Semantics #376

A few discrepancies in X86-64 Instruction Semantics #376

Comments

sdasgup3 commented Nov 5, 2019 • edited Loading

Reasons

pgoodman commented Dec 11, 2019

sdasgup3 commented Dec 12, 2019

sdasgup3 commented Dec 14, 2019 • edited Loading

pgoodman commented Dec 19, 2019

sdasgup3 commented Dec 20, 2019

sdasgup3 commented Nov 5, 2019 •

edited

Loading

sdasgup3 commented Dec 14, 2019 •

edited

Loading