Shared-Memory Performance Profiling

This paper describes a new approach to finding performance bottlenecks in shared-memory parallel programs and its embodiment in the Paradyn Parallel Performance Tools running with the Blizzard fine-grain distributed shared memory system. This approach exploits the underlying system's cache coherence protocol to detect data sharing patterns that indicate potential performance bottlenecks and presents performance measurements in a data-centric manner. As a demonstration, Parodyn helped us improve the performance of a new shared-memory application program by a factor of four.


Publié dans:
Sixth ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 240-251
Année
1997
Publisher:
ACM
Laboratoires:




 Notice créée le 2013-12-23, modifiée le 2019-09-16


Évaluer ce document:

Rate this document:
1
2
3
 
(Pas encore évalué)