The memory Mountain Figure 6.42 P514 PentiumⅢXeon 1200 550 MHz 16 KB on-chip L1 d-cache 1000 16 KB on-chip L1 i-cache 512 KB off-chip unified L2 cache 800 600 Ridges of Slopes of Temporal Spatial Locality 200 Locality mem stride(words) working set size(bytes) 后
11 The Memory Mountain Figure 6.42 P514 s1 s3 s5 s7 s9 s11 s13 s15 8m2m512k 128k32k8k2k 0 200 400 600 800 1000 1200 read throughput (MB/s) stride (words) working set size (bytes) Pentium III Xeon 550 MHz 16 KB on-chip L1 d-cache 16 KB on-chip L1 i-cache 512 KB off-chip unified L2 cache Ridges of Temporal Locality L1 L2 mem Slopes of Spatial Locality x e
Ridges of temporal locality Slice through the memory mountain with stride=1 illuminates read throughputs of different caches and memory Ridges:山脊 12
12 Ridges of temporal locality • Slice through the memory mountain with stride=1 – illuminates read throughputs of different caches and memory Ridges: 山脊
Ridges of temporal locality Figure 6.43 P515 1200 maIn memory 2 cache L1 cache region region region 1000 800 400 200 馬营蒽§落商亘。 working set size(bytes)
13 Ridges of temporal locality Figure 6.43 P515 0 200 400 600 800 1000 1200 8m4m2m 1024k 512k 256k 128k 64k 32k 16k8k4k2k1k working set size (bytes) read througput (MB/s) L1 cache region L2 cache region main memory region