VPBLENDD - Blend Packed Dwords

Добавил:

Upload Опубликованный материал нарушает ваши авторские права? Сообщите нам.

Вуз:

Национальный Технический Университет Харьковский Политехнический Институт

Предмет:

[НЕСОРТИРОВАННОЕ]

Файл:

Лаб2012 / 319433-011.pdf

Скачиваний:

Добавлен:

02.02.2015

Размер:

2.31 Mб

Скачать

☆

<<< < Предыдущая 36 37 38 39 40 41 42 43 44 45 46 4748 / 7348 49 50 51 52 53 54 55 56 57 58 59 60 > Следующая >>>

				INSTRUCTION SET REFERENCE
VPBLENDD - Blend Packed Dwords

Opcode/	Op/	64/32	CPUID	Description
Instruction	En	-bit	Feature
		Mode	Flag
VEX.NDS.128.66.0F3A.W0 02 /r ib	A	V/V	AVX2	Select dwords from xmm2
VPBLENDD xmm1, xmm2,				and xmm3/m128 from mask
xmm3/m128, imm8				specified in imm8 and store
				the values into xmm1.
VEX.NDS.256.66.0F3A.W0 02 /r ib	A	V/V	AVX2	Select dwords from ymm2
VPBLENDD ymm1, ymm2,				and ymm3/m256 from mask
ymm3/m256, imm8				specified in imm8 and store
				the values into ymm1.

Instruction Operand Encoding

Op/En	Operand 1	Operand 2	Operand 3	Operand 4
A	ModRM:reg (w)	VEX.vvvv	ModRM:r/m (r)	NA

Description

Dword elements from the source operand (second operand) are conditionally written to the destination operand (first operand) depending on bits in the immediate operand (third operand). The immediate bits (bits 7:0) form a mask that determines whether the corresponding word in the destination is copied from the source. If a bit in the mask, corresponding to a word, is “1", then the word is copied, else the word is unchanged.

VEX.128 encoded version: The second source operand can be an XMM register or a 128-bit memory location. The first source and destination operands are XMM registers. Bits (255:128) of the corresponding YMM register are zeroed.

VEX.256 encoded version: The first source operand is a YMM register. The second source operand is a YMM register or a 256-bit memory location. The destination operand is a YMM register.

Operation

VPBLENDD (VEX.256 encoded version)

IF (imm8[0] == 1) THEN DEST[31:0]  SRC2[31:0]

ELSE DEST[31:0]  SRC1[31:0]

IF (imm8[1] == 1) THEN DEST[63:32]  SRC2[63:32]

ELSE DEST[63:32]  SRC1[63:32]

IF (imm8[2] == 1) THEN DEST[95:64]  SRC2[95:64]

ELSE DEST[95:64]  SRC1[95:64]

IF (imm8[3] == 1) THEN DEST[127:96]  SRC2[127:96]

Ref. # 319433-011

5-227

INSTRUCTION SET REFERENCE

ELSE DEST[127:96]  SRC1[127:96]

IF (imm8[4] == 1) THEN DEST[159:128]  SRC2[159:128]

ELSE DEST[159:128]  SRC1[159:128]

IF (imm8[5] == 1) THEN DEST[191:160]  SRC2[191:160]

ELSE DEST[191:160]  SRC1[191:160]

IF (imm8[6] == 1) THEN DEST[223:192]  SRC2[223:192]

ELSE DEST[223:192]  SRC1[223:192]

IF (imm8[7] == 1) THEN DEST[255:224]  SRC2[255:224]

ELSE DEST[255:224]  SRC1[255:224]

VPBLENDD (VEX.128 encoded version)

IF (imm8[0] == 1) THEN DEST[31:0]  SRC2[31:0]

ELSE DEST[31:0]  SRC1[31:0]

IF (imm8[1] == 1) THEN DEST[63:32]  SRC2[63:32]

ELSE DEST[63:32]  SRC1[63:32]

IF (imm8[2] == 1) THEN DEST[95:64]  SRC2[95:64]

ELSE DEST[95:64]  SRC1[95:64]

IF (imm8[3] == 1) THEN DEST[127:96]  SRC2[127:96]

ELSE DEST[127:96]  SRC1[127:96]

DEST[VLMAX:128]  0

Intel C/C++ Compiler Intrinsic Equivalent

VPBLENDD __m128i _mm_blend_epi32 (__m128i v1, __m128i v2, const int mask)

VPBLENDD __m256i _mm256_blend_epi32 (__m256i v1, __m256i v2, const int mask)

SIMD Floating-Point Exceptions

None

Other Exceptions

See Exceptions Type 4; additionally

#UD	If VEX.W = 1.

5-228

Ref. # 319433-011

					INSTRUCTION SET REFERENCE
VPBROADCASTBroadcast Integer Data

Opcode/		Op/	64/32	CPUID	Description
Instruction		En	-bit	Feature
			Mode	Flag
VEX.128.66.0F38.W0 78 /r		A	V/V	AVX2	Broadcast a byte integer in the
VPBROADCASTB xmm1,					source operand to sixteen locations
xmm2/m8					in xmm1.
VEX.256.66.0F38.W0 78 /r		A	V/V	AVX2	Broadcast a byte integer in the
VPBROADCASTB ymm1,					source operand to thirty-two loca-
xmm2/m8					tions in ymm1.
VEX.128.66.0F38.W0 79 /r		A	V/V	AVX2	Broadcast a word integer in the
VPBROADCASTW xmm1,					source operand to eight locations in
xmm2/m16					xmm1.
VEX.256.66.0F38.W0 79 /r		A	V/V	AVX2	Broadcast a word integer in the
VPBROADCASTW ymm1,					source operand to sixteen locations
xmm2/m16					in ymm1.
VEX.128.66.0F38.W0 58 /r		A	V/V	AVX2	Broadcast a dword integer in the
VPBROADCASTD xmm1,					source operand to four locations in
xmm2/m32					xmm1.
VEX.256.66.0F38.W0 58 /r		A	V/V	AVX2	Broadcast a dword integer in the
VPBROADCASTD ymm1,					source operand to eight locations in
xmm2/m32					ymm1.
VEX.128.66.0F38.W0 59 /r		A	V/V	AVX2	Broadcast a qword element in mem
VPBROADCASTQ xmm1,					to two locations in xmm1.
xmm2/m64
VEX.256.66.0F38.W0 59 /r		A	V/V	AVX2	Broadcast a qword element in mem
VPBROADCASTQ ymm1,					to four locations in ymm1.
xmm2/m64

		Instruction Operand Encoding
Op/En	Operand 1		Operand 2		Operand 3	Operand 4
A	ModRM:reg (w)	ModRM:r/m (r)			NA	NA

Description

Load integer data from the source operand (second operand) and broadcast to all elements of the destination operand (first operand).

Ref. # 319433-011

5-229

INSTRUCTION SET REFERENCE

The destination operand is a YMM register. The source operand is 8-bit, 16-bit 32-bit, 64-bit memory location or the low 8-bit, 16-bit 32-bit, 64-bit data in an XMM register. VPBROADCASTB/D/W/Q also support XMM register as the source operand.

VPBROADCASTB/W/D/Q is supported in both 128-bit and 256-bit wide versions.

Note: In VEX-encoded versions, VEX.vvvv is reserved and must be 1111b otherwise instructions will #UD. Attempts to execute any VPBROADCAST* instruction with VEX.W = 1 will cause #UD.

m32 X0

DEST

Figure 5-7. VPBROADCASTD Operation (VEX.256 encoded version)

m32 X0

DEST

Figure 5-8. VPBROADCASTD Operation (128-bit version)

5-230

Ref. # 319433-011

INSTRUCTION SET REFERENCE

m64	X0

DEST

Operation

VPBROADCASTB (VEX.128 encoded version) temp  SRC[7:0]

FOR j  0 TO 15 DEST[7+j*8: j*8]  temp ENDFOR DEST[VLMAX:128]  0

VPBROADCASTB (VEX.256 encoded version) temp  SRC[7:0]

FOR j  0 TO 31 DEST[7+j*8: j*8]  temp ENDFOR

VPBROADCASTW (VEX.128 encoded version) temp  SRC[15:0]

FOR j  0 TO 7 DEST[15+j*16: j*16]  temp ENDFOR

DEST[VLMAX:128]  0

VPBROADCASTW (VEX.256 encoded version) temp  SRC[15:0]

FOR j  0 TO 15 DEST[15+j*16: j*16]  temp ENDFOR

Ref. # 319433-011

5-231

INSTRUCTION SET REFERENCE

VPBROADCASTD (128 bit version) temp  SRC[31:0]

FOR j  0 TO 3 DEST[31+j*32: j*32]  temp ENDFOR

DEST[VLMAX:128]  0

VPBROADCASTD (VEX.256 encoded version) temp  SRC[31:0]

FOR j  0 TO 7 DEST[31+j*32: j*32]  temp ENDFOR

VPBROADCASTQ (VEX.128 encoded version) temp  SRC[63:0]

DEST[63:0]  temp DEST[127:64]  temp DEST[VLMAX:128]  0

VPBROADCASTQ (VEX.256 encoded version) temp  SRC[63:0]

DEST[63:0]  temp DEST[127:64]  temp DEST[191:128]  temp DEST[255:192]  temp

Intel C/C++ Compiler Intrinsic Equivalent

VPBROADCASTB __m256i _mm256_broadcastb_epi8(__m128i ); VPBROADCASTW __m256i _mm256_broadcastw_epi16(__m128i ); VPBROADCASTD __m256i _mm256_broadcastd_epi32(__m128i ); VPBROADCASTQ __m256i _mm256_broadcastq_epi64(__m128i ); VPBROADCASTB __m128i _mm_broadcastb_epi8(__m128i ); VPBROADCASTW __m128i _mm_broadcastw_epi16(__m128i ); VPBROADCASTD __m128i _mm_broadcastd_epi32(__m128i ); VPBROADCASTQ __m128i _mm_broadcastq_epi64(__m128i );

SIMD Floating-Point Exceptions

None

5-232

Ref. # 319433-011

INSTRUCTION SET REFERENCE

Other Exceptions

See Exceptions Type 6; additionally

#UD	If VEX.W = 1.

Ref. # 319433-011

5-233

<<< < Предыдущая 36 37 38 39 40 41 42 43 44 45 46 4748 / 7348 49 50 51 52 53 54 55 56 57 58 59 60 > Следующая >>>

Соседние файлы в папке Лаб2012

#
02.02.20153.33 Mб6525366517.pdf
#
02.02.20152.52 Mб25253666.pdf
#
02.02.20152.7 Mб2725366617.pdf
#
02.02.20152.09 Mб31253667.pdf
#
02.02.20152.19 Mб2625366717.pdf
#
02.02.20152.31 Mб27319433-011.pdf
#
02.02.2015162.82 Кб20ЛР1 Виконання арифм операц.doc
#
02.02.201564 Кб21ЛР10 ММХ-розширення.doc
#
02.02.201567.07 Кб22ЛР11 SSE-розширення.doc
#
02.02.201590.62 Кб18ЛР12 Windows-застосування.doc
#
02.02.2015181.25 Кб19ЛР13 Досл_дження коду програм.doc