xix
Figures
Typical SIMD Operations ................................................................... 1-3
SIMD Instruction Register Usage ...................................................... 1-4
The Intel NetBurst Microarchitecture ............................................... 1-10
Execution Units and Ports in the Out-Of-Order Core....................... 1-19
The Intel Pentium M Processor Microarchitecture ........................... 1-27
Hyper-Threading Technology on an SMP........................................ 1-35
Cache Line Split in Accessing Elements in a Array ......................... 2-31
Size and Alignment Restrictions in Store Forwarding...................... 2-34
Converting to Streaming SIMD Extensions Chart ............................. 3-9
Loop Blocking Access Pattern ......................................................... 3-36
Interleaved Pack with Saturation ....................................................... 4-9
PACKSSDW mm, mm/mm64 Instruction Example ............................ 4-9
Result of Non-Interleaved Unpack High in MM1.............................. 4-12
Result of Non-Interleaved Unpack Low in MM0 .............................. 4-12
pextrw Instruction ............................................................................ 4-14
pinsrw Instruction............................................................................. 4-15
pmovmskb Instruction Example....................................................... 4-17
pshuf Instruction Example ............................................................... 4-18
PSADBW Instruction Example ........................................................ 4-31
Homogeneous Operation on Parallel Data Elements ........................ 5-5
Dot Product Operation ....................................................................... 5-8
Horizontal Add Using movhlps/movlhps .......................................... 5-19
Asymmetric Arithmetic Operation of the SSE3 Instruction .............. 5-23
Effective Latency Reduction as a Function of Access Stride........... 6-22
Summary of Contents for ARCHITECTURE IA-32
Page 1: ...IA 32 Intel Architecture Optimization Reference Manual Order Number 248966 013US April 2006...
Page 220: ...IA 32 Intel Architecture Optimization 3 40...
Page 434: ...IA 32 Intel Architecture Optimization 9 20...
Page 514: ...IA 32 Intel Architecture Optimization B 60...
Page 536: ...IA 32 Intel Architecture Optimization C 22...