TY - CHAP
T1 - Building representative and balanced datasets of OpenMP parallel regions
AU - Sorribes Gomis, Joan
AU - Cesar Galobardes, Eduardo
AU - Sikora , Anna Barbara
AU - Alcaraz, Jordi
AU - Sleder, Steven
AU - Tehrani, Ali
AU - Janesari, Ali
N1 - Publisher Copyright:
© 2021 IEEE.
PY - 2021/3/1
Y1 - 2021/3/1
N2 - Incorporating machine learning into automatic performance analysis and tuning tools is a promising path to tackle the increasing heterogeneity of current HPC applications. However, this introduces the need for generating balanced and representative datasets of parallel applications' executions. This work proposes a methodology for building datasets of OpenMP parallel code regions patterns. It allows for determining whether a given code region covers a unique part of the pattern input space not covered by the patterns already included in the dataset. The proposed methodology uses hardware performance counters to represent the execution of the region, which is referred to as the region signature for a given number of cores. Then, a complete representation of the region is built by joining the signatures for every different thread configuration in the system. Next, correlation analysis is performed between this representation and the representation of all the patterns already in the training set. Finally, if this correlation is below a given threshold, the region is considered to cover a unique part of the pattern input space and is subsequently added to the dataset. For validating this methodology, an example dataset, obtained from well known benchmarks, has been used to train a carefully designed neural network model to demonstrate that it is able to classify different patterns of OpenMP parallel regions.
AB - Incorporating machine learning into automatic performance analysis and tuning tools is a promising path to tackle the increasing heterogeneity of current HPC applications. However, this introduces the need for generating balanced and representative datasets of parallel applications' executions. This work proposes a methodology for building datasets of OpenMP parallel code regions patterns. It allows for determining whether a given code region covers a unique part of the pattern input space not covered by the patterns already included in the dataset. The proposed methodology uses hardware performance counters to represent the execution of the region, which is referred to as the region signature for a given number of cores. Then, a complete representation of the region is built by joining the signatures for every different thread configuration in the system. Next, correlation analysis is performed between this representation and the representation of all the patterns already in the training set. Finally, if this correlation is below a given threshold, the region is considered to cover a unique part of the pattern input space and is subsequently added to the dataset. For validating this methodology, an example dataset, obtained from well known benchmarks, has been used to train a carefully designed neural network model to demonstrate that it is able to classify different patterns of OpenMP parallel regions.
KW - Hardware counters
KW - Matching Learning
KW - OpenMP
UR - http://www.scopus.com/inward/record.url?scp=85105519022&partnerID=8YFLogxK
UR - https://www.mendeley.com/catalogue/f089ff58-4dba-3529-b9f0-401abb6df9ab/
U2 - 10.1109/pdp52278.2021.00019
DO - 10.1109/pdp52278.2021.00019
M3 - Chapter
AN - SCOPUS:85105519022
SN - 9781665414555
T3 - Proceedings - 29th Euromicro International Conference on Parallel, Distributed and Network-Based Processing, PDP 2021
SP - 67
EP - 74
BT - Proceedings - 29th Euromicro International Conference on Parallel, Distributed and Network-Based Processing, PDP 2021
PB - Institute of Electrical and Electronics Engineers Inc.
ER -