A Primer on Hardware Prefetching

Falsafi, Babak; Wenisch, Thomas F.

doi:10.2200/S00581ED1V01Y201405CAC028

book/monograph

A Primer on Hardware Prefetching

Falsafi, Babak

•

Wenisch, Thomas F.

Martonosi, Margaret

Since the 1970’s, microprocessor-based digital platforms have been riding Moore’s law, allowing for doubling of density for the same area roughly every two years. However, whereas microprocessor fabrication has focused on increasing instruction execution rate, memory fabrication technologies have focused primarily on an increase in capacity with negligible increase in speed. This divergent trend in performance between the processors and memory has led to a phenomenon referred to as the “Memory Wall.” To overcome the memory wall, designers have resorted to a hierarchy of cache memory levels, which rely on the principal of memory access locality to reduce the observed memory access time and the performance gap between processors and memory. Unfortunately, important workload classes exhibit adverse memory access patterns that baffle the simple policies built into modern cache hierarchies to move instructions and data across cache levels. As such, processors often spend much time idling upon a demand fetch of memory blocks that miss in higher cache levels. Prefetching—predicting future memory accesses and issuing requests for the corresponding memory blocks in advance of explicit accesses—is an effective approach to hide memory access latency. There have been a myriad of proposed prefetching techniques, and nearly every modern processor includes some hardware prefetching mechanisms targeting simple and regular memory access patterns. This primer offers an overview of the various classes of hardware prefetchers for instructions and data proposed in the research literature, and presents examples of techniques in- corporated into modern microprocessors.

Type

book/monograph

ISBN

978-1-608459-52-0

DOI

10.2200/S00581ED1V01Y201405CAC028

Authors

Falsafi, Babak

•

Wenisch, Thomas F.

Editors

Martonosi, Margaret

Publication date

2014

Publisher

Morgan & Claypool

Subjects

hardware prefetching

next-line prefetching...

branch-directed prefe...

discontinuity prefetc...

stride prefetching

address-correlated pr...

Markov prefetcher

global history buffer...

temporal memory strea...

spatial memory stream...

execution-based prefe...

EPFL units

PARSA

Available on Infoscience

June 9, 2014

Use this identifier to reference this record

https://infoscience.epfl.ch/handle/20.500.14299/104074