Optimizing Cache Usage
6
6-31
Figure 6-5
Memory Access Latency and Execution With Prefetch
2 Load streams, 1 store stream
50
100
150
200
250
300
350
54 108 144 192 240 336 390
Computa tions pe r loop
E
ff
e
c
ti
v
e
l
oop
l
a
te
nc
y
0.00%
10.00%
20.00%
30.00%
40.00%
50.00%
60.00%
70.00%
80.00%
90.00%
100.00%
%
o
f B
u
s
U
tili
ze
d
16
32
64
128
none
% Bus Utilization
One load and one store stream
0
50
100
150
200
250
300
350
48 108 144 192 240 336 408
Computations per loop
Ef
fe
c
ti
v
e
l
oop
la
te
nc
y
0.00%
10.00%
20.00%
30.00%
40.00%
50.00%
60.00%
70.00%
80.00%
90.00%
100.00%
%
o
f B
u
s
U
tiliz
a
tio
n
16_por
32_por
64_por
128_por
None_por
% Bus Utilization
Summary of Contents for ARCHITECTURE IA-32
Page 1: ...IA 32 Intel Architecture Optimization Reference Manual Order Number 248966 013US April 2006...
Page 220: ...IA 32 Intel Architecture Optimization 3 40...
Page 434: ...IA 32 Intel Architecture Optimization 9 20...
Page 514: ...IA 32 Intel Architecture Optimization B 60...
Page 536: ...IA 32 Intel Architecture Optimization C 22...