VCMPPH—Compare Packed FP16 Values

Opcode/Instruction Op/En 64/32 bit Mode Support CPUID Feature Flag Description
EVEX.128.NP.0F3A.W0 C2 /r /ib VCMPPH k1{k2}, xmm2, xmm3/m128/m16bcst, imm8 A V/V AVX512-FP16 AVX512VL Compare packed FP16 values in xmm3/m128/m16bcst and xmm2 using bits 4:0 of imm8 as a comparison predicate subject to writemask k2, and store the result in mask register k1.
EVEX.256.NP.0F3A.W0 C2 /r /ib VCMPPH k1{k2}, ymm2, ymm3/m256/m16bcst, imm8 A V/V AVX512-FP16 AVX512VL Compare packed FP16 values in ymm3/m256/m16bcst and ymm2 using bits 4:0 of imm8 as a comparison predicate subject to writemask k2, and store the result in mask register k1.
EVEX.512.NP.0F3A.W0 C2 /r /ib VCMPPH k1{k2}, zmm2, zmm3/m512/m16bcst {sae}, imm8 A V/V AVX512-FP16 Compare packed FP16 values in zmm3/m512/m16bcst and zmm2 using bits 4:0 of imm8 as a comparison predicate subject to writemask k2, and store the result in mask register k1.

Instruction Operand Encoding

Op/En Tuple Operand 1 Operand 2 Operand 3 Operand 4
A Full ModRM:reg (w) VEX.vvvv (r) ModRM:r/m (r) imm8 (r)

Description

This instruction compares packed FP16 values from source operands and stores the result in the destination mask operand. The comparison predicate operand (immediate byte bits 4:0) specifies the type of comparison performed on each of the pairs of packed values. The destination elements are updated according to the writemask.

Operation

CASE (imm8 & 0x1F) OF

0: CMP_OPERATOR := EQ_OQ;

1: CMP_OPERATOR := LT_OS;

2: CMP_OPERATOR := LE_OS;

3: CMP_OPERATOR := UNORD_Q;

4: CMP_OPERATOR := NEQ_UQ;

5: CMP_OPERATOR := NLT_US;

6: CMP_OPERATOR := NLE_US;

7: CMP_OPERATOR := ORD_Q;

8: CMP_OPERATOR := EQ_UQ;

9: CMP_OPERATOR := NGE_US;

10: CMP_OPERATOR := NGT_US;

11: CMP_OPERATOR := FALSE_OQ;

12: CMP_OPERATOR := NEQ_OQ;

13: CMP_OPERATOR := GE_OS;

14: CMP_OPERATOR := GT_OS;

15: CMP_OPERATOR := TRUE_UQ;

16: CMP_OPERATOR := EQ_OS;

17: CMP_OPERATOR := LT_OQ;

18: CMP_OPERATOR := LE_OQ;

19: CMP_OPERATOR := UNORD_S;

20: CMP_OPERATOR := NEQ_US;

21: CMP_OPERATOR := NLT_UQ;

22: CMP_OPERATOR := NLE_UQ;

23: CMP_OPERATOR := ORD_S;

24: CMP_OPERATOR := EQ_US;

25: CMP_OPERATOR := NGE_UQ;

26: CMP_OPERATOR := NGT_UQ;

27: CMP_OPERATOR := FALSE_OS;

28: CMP_OPERATOR := NEQ_OS;

29: CMP_OPERATOR := GE_OQ;

30: CMP_OPERATOR := GT_OQ;

31: CMP_OPERATOR := TRUE_US;

ESAC

VCMPPH (EVEX Encoded Versions)

VL = 128, 256 or 512

KL := VL/16

FOR j := 0 TO KL-1:

IF k2[j] OR *no writemask*:

IF EVEX.b = 1:

tsrc2 := SRC2.fp16[0]

ELSE:

tsrc2 := SRC2.fp16[j]

DEST.bit[j] := SRC1.fp16[j] CMP_OPERATOR tsrc2

ELSE

DEST.bit[j] := 0

DEST[MAXKL-1:KL] := 0

Intel C/C++ Compiler Intrinsic Equivalent

VCMPPH ___mmask8 _mm_cmp_ph_mask (__m128h a, __m128h b, const int imm8);

VCMPPH ___mmask8 _mm_mask_cmp_ph_mask (__mmask8 k1, __m128h a, __m128h b, const int imm8);

VCMPPH ___mmask16 _mm256_cmp_ph_mask (__m256h a, __m256h b, const int imm8);

VCMPPH ___mmask16 _mm256_mask_cmp_ph_mask (__mmask16 k1, __m256h a, __m256h b, const int imm8);

VCMPPH ___mmask32 _mm512_cmp_ph_mask (__m512h a, __m512h b, const int imm8);

VCMPPH ___mmask32 _mm512_mask_cmp_ph_mask (__mmask32 k1, __m512h a, __m512h b, const int imm8);

VCMPPH ___mmask32 _mm512_cmp_round_ph_mask (__m512h a, __m512h b, const int imm8, const int sae);

VCMPPH ___mmask32 _mm512_mask_cmp_round_ph_mask (__mmask32 k1, __m512h a, __m512h b, const int imm8, const int sae);

SIMD Floating-Point Exceptions

Invalid, Denormal.

Other Exceptions

EVEX-encoded instructions, see Table 2-46, “Type E2 Class Exception Conditions.”