000129700 001__ 129700
000129700 005__ 20180913054952.0
000129700 0247_ $$2doi$$a10.1016/j.jcss.2007.07.003
000129700 02470 $$2ISI$$a000258791600004
000129700 037__ $$aARTICLE
000129700 245__ $$aA Simulator for Adaptive Parallel Applications
000129700 269__ $$a2007
000129700 260__ $$c2007
000129700 336__ $$aJournal Articles
000129700 520__ $$aDynamically allocating computing nodes to parallel applications is a promising technique for improving the utilization of cluster resources. Detailed simulations can help identify allocation strategies and problem decomposition parameters that increase the efficiency of parallel applications. We describe a simulation framework supporting dynamic node allocation which, given a simple cluster model, predicts the running time of parallel applications taking CPU and network sharing into account. Simulations can be carried out without needing to modify the application code. Thanks to partial direct execution, simulation times and memory requirements are reduced. In partial direct execution simulations, the application's parallel behavior is retrieved via direct execution, and the duration of individual operations is obtained from a performance prediction model or from prior measurements. Simulations may then vary cluster model parameters, operation durations and problem decomposition parameters to analyze their impact on the application performance and identify the limiting factors. We implemented the proposed techniques by adding direct execution simulation capabilities to the Dynamic Parallel Schedules parallelization framework. We introduce the concept of dynamic efficiency to express the resource utilization efficiency as a function of time. We verify the accuracy of our simulator by comparing the effective running time, respectively the dynamic efficiency, of parallel program executions with the running time, respectively the dynamic efficiency, predicted by the simulator under different parallelization and dynamic node allocation strategies.
000129700 6531_ $$aAdaptive 
000129700 6531_ $$aparallel 
000129700 6531_ $$aapplication 
000129700 6531_ $$asimulation
000129700 6531_ $$aDynamic
000129700 6531_ $$aefficiency
000129700 6531_ $$aSensitivity
000129700 6531_ $$aanalysis
000129700 6531_ $$aPartial
000129700 6531_ $$adirect
000129700 6531_ $$aexecution
000129700 700__ $$0240888$$aSchaeli, B.$$g114320
000129700 700__ $$0240940$$aGerlach, S.$$g101589
000129700 700__ $$0242314$$aHersch, R.D.$$g105390
000129700 773__ $$jdoi:10.1016/j.physletb.2003.10.071$$tJ. Computer System ScienceDirect
000129700 909C0 $$0252034$$pLSP$$xU10417
000129700 909CO $$ooai:infoscience.tind.io:129700$$pIC$$particle
000129700 937__ $$aLSP-ARTICLE-2008-008
000129700 973__ $$aEPFL$$rREVIEWED$$sPUBLISHED
000129700 980__ $$aARTICLE