IA-32 Intel® Architecture Optimization
4-10
The pack instructions always assume that the source operands are
signed numbers. The result in the destination register is always defined
by the pack instruction that performs the operation. For example, the
packssdw
instruction packs each of the two signed 32-bit values of the
two sources into four saturated 16-bit signed values in the destination
register. The
packuswb
instruction, on the other hand, packs each of the
four signed 16-bit values of the two sources into eight saturated eight-bit
unsigned values in the destination. A complete specification of the
MMX instruction set can be found in the
Intel Architecture MMX
Technology Programmer’s Reference Manual
, order number 243007.
Interleaved Pack without Saturation
Example 4-5 is similar to Example 4-4 except that the resulting words
are not saturated. In addition, in order to protect against overflow, only
the low order 16 bits of each doubleword are used in this operation.
Example 4-4
Interleaved Pack with Saturation
; Input:
MM0
signed source1 value
;
MM1
signed source2 value
; Output:
MM0
the first and third words contain the
;
signed-saturated doublewords from MM0,
;
the second and fourth words contain
;
signed-saturated doublewords from MM1
;
packssdw
MM0, MM0
; pack and sign saturate
packssdw
MM1, MM1
; pack and sign saturate
punpcklwd
MM0, MM1
; interleave the low-end 16-bit
; values of the operands
Summary of Contents for ARCHITECTURE IA-32
Page 1: ...IA 32 Intel Architecture Optimization Reference Manual Order Number 248966 013US April 2006...
Page 220: ...IA 32 Intel Architecture Optimization 3 40...
Page 434: ...IA 32 Intel Architecture Optimization 9 20...
Page 514: ...IA 32 Intel Architecture Optimization B 60...
Page 536: ...IA 32 Intel Architecture Optimization C 22...