Application-specific protocols for user-level shared memory

Recent distributed shared memory (DSM) systems and proposed shared-memory machines have implemented some or all of their cache coherence protocols in software. One way to exploit the flexibility of this software is to tailor a coherence protocol to match an application's communication patterns and memory semantics. This paper presents evidence that this approach can lead to large performance improvements. It shows that application-specific protocols substantially improved the performance of three application programs- appbt, em3d, and barnes-over carefully tuned transparent shared memory implementations. The speed-ups were obtained on Blizzard, a fine-grained DSM system running on a 32-node Thinking Machines CM-5

Published in:
Proceedings of the 1994 ACM/IEEE conference on Supercomputing, 380-389
Presented at:
Supercomputing '94, Washington D.C., USA, November 14-18

 Record created 2009-04-06, last modified 2020-10-24

External link:
Download fulltext
Rate this document:

Rate this document:
(Not yet reviewed)