TY - JOUR
T1 - HeDPM: load balancing of linear pipeline applications on heterogeneous systems
AU - Moreno, Andreu
AU - Sikora, Anna
AU - César, Eduardo
AU - Sorribes, Joan
AU - Margalef, Tomàs
PY - 2017/9/1
Y1 - 2017/9/1
N2 - © 2017, The Author(s). This work presents a new algorithm, called Heterogeneous Dynamic Pipeline Mapping, that allows for dynamically improving the performance of pipeline applications running on heterogeneous systems. It is aimed at balancing the application load by determining the best replication (of slow stages) and gathering (of fast stages) combination taking into account processors computation and communication capacities. In addition, the algorithm has been designed with the requirement of keeping complexity low to allow its usage in a dynamic tuning tool. For this reason, it uses an analytical performance model of pipeline applications that addresses hardware heterogeneity and which depends on parameters that can be known in advance or measured at run-time. A wide experimentation is presented, including the comparison with the optimal brute force algorithm, a general comparison with the Binary Search Closest algorithm, and an application example with the Ferret pipeline included in the PARSEC benchmark suite. Results, matching those of the best existing algorithms, show significant performance improvements with lower complexity (O(N3), where N is the number of pipeline stages).
AB - © 2017, The Author(s). This work presents a new algorithm, called Heterogeneous Dynamic Pipeline Mapping, that allows for dynamically improving the performance of pipeline applications running on heterogeneous systems. It is aimed at balancing the application load by determining the best replication (of slow stages) and gathering (of fast stages) combination taking into account processors computation and communication capacities. In addition, the algorithm has been designed with the requirement of keeping complexity low to allow its usage in a dynamic tuning tool. For this reason, it uses an analytical performance model of pipeline applications that addresses hardware heterogeneity and which depends on parameters that can be known in advance or measured at run-time. A wide experimentation is presented, including the comparison with the optimal brute force algorithm, a general comparison with the Binary Search Closest algorithm, and an application example with the Ferret pipeline included in the PARSEC benchmark suite. Results, matching those of the best existing algorithms, show significant performance improvements with lower complexity (O(N3), where N is the number of pipeline stages).
KW - Heterogeneous systems
KW - Load balancing
KW - Performance
KW - Pipeline
U2 - 10.1007/s11227-017-1971-4
DO - 10.1007/s11227-017-1971-4
M3 - Article
SN - 0920-8542
VL - 73
SP - 3738
EP - 3760
JO - Journal of Supercomputing
JF - Journal of Supercomputing
IS - 9
ER -