xxi
Table 1-1
Pentium 4 and Intel Xeon Processor Cache Parameters .................. 1-20
Table 1-3
Cache Parameters of Pentium M, Intel
®
Core™ Solo and
Intel
®
Core™ Duo Processors .......................................................... 1-30
Table 1-2
Trigger Threshold and CPUID Signatures for IA-32
Processor Families ............................................................................ 1-30
Table 1-4
Family And Model Designations of Microarchitectures...................... 1-42
Table 1-5
Characteristics of Load and Store Operations
in Intel Core Duo Processors ............................................................ 1-43
Table 2-1
Coding Pitfalls Affecting Performance ................................................. 2-2
Table 2-2
Avoiding Partial Flag Register Stall ................................................... 2-76
Table 2-3
Avoiding Partial Register Stall When Packing Byte Values ............... 2-78
Table 2-4
Avoiding False LCP Delays with 0xF7 Group Instructions ................ 2-81
Table 2-5
Using REP STOSD with Arbitrary Count Size and
4-Byte-Aligned Destination................................................................ 2-85
Table 5-1
SoA Form of Representing Vertices Data ........................................... 5-7
Table 6-1
Software Prefetching Considerations into Strip-mining Code............ 6-39
Table 6-2
Relative Performance of Memory Copy Routines ............................. 6-52
Table 6-3
Deterministic Cache Parameters Leaf............................................... 6-54
Table 7-1
Properties of Synchronization Objects .............................................. 7-21
Table B-1
Pentium 4 Processor Performance Metrics .......................................B-18
Table B-2
Metrics That Utilize Replay Tagging Mechanism...............................B-47
Table B-3
Table 3 Metrics That Utilize the Front-end Tagging Mechanism ........B-48
Table B-4
Metrics That Utilize the Execution Tagging Mechanism ....................B-49
Table B-5
New Metrics for Pentium 4 Processor (Family 15, Model 3)..............B-50
Table B-6
Metrics That Support Qualification by Logical Processor and
Parallel Counting ...............................................................................B-51
Table B-7
Metrics That Are Independent of Logical Processors........................B-55
Table C-1
Streaming SIMD Extension 3 SIMD Floating-point Instructions ......... C-6
Table C-2
Streaming SIMD Extension 2 128-bit Integer Instructions.................. C-7
Table C-3
Streaming SIMD Extension 2 Double-precision Floating-point
Instructions ......................................................................................... C-9
Table C-4
Streaming SIMD Extension Single-precision Floating-point
Instructions ....................................................................................... C-12
Table C-6
MMX Technology 64-bit Instructions ................................................ C-14
Summary of Contents for ARCHITECTURE IA-32
Page 1: ...IA 32 Intel Architecture Optimization Reference Manual Order Number 248966 013US April 2006...
Page 220: ...IA 32 Intel Architecture Optimization 3 40...
Page 434: ...IA 32 Intel Architecture Optimization 9 20...
Page 514: ...IA 32 Intel Architecture Optimization B 60...
Page 536: ...IA 32 Intel Architecture Optimization C 22...