TY - JOUR
T1 - Efficient mapping of genomic sequences to optimize multiple pairwise alignment in hybrid cluster platforms
AU - Montañola, Alberto
AU - Roig, Concepció
AU - Hernández, Porfidio
PY - 2014/1/1
Y1 - 2014/1/1
N2 - Multiple sequence alignment (MSA), used in biocomputing to study similarities between different genomic sequences, is known to require important memory and computation resources. Nowadays, researchers are aligning thousands of these sequences, creating new challenges in order to solve the problem using the available resources efficiently. Determining the efficient amount of resources to allocate is important to avoid waste of them, thus reducing the economical costs required in running for example a specific cloud instance. The pairwise alignment is the initial key step of the MSA problem, which will compute all pair alignments needed. We present a method to determine the optimal amount of memory and computation resources to allocate by the pairwise alignment, and we will validate it through a set of experimental results for different possible inputs. These allow us to determine the best parameters to configure the applications in order to use effectively the available resources of a given system.
AB - Multiple sequence alignment (MSA), used in biocomputing to study similarities between different genomic sequences, is known to require important memory and computation resources. Nowadays, researchers are aligning thousands of these sequences, creating new challenges in order to solve the problem using the available resources efficiently. Determining the efficient amount of resources to allocate is important to avoid waste of them, thus reducing the economical costs required in running for example a specific cloud instance. The pairwise alignment is the initial key step of the MSA problem, which will compute all pair alignments needed. We present a method to determine the optimal amount of memory and computation resources to allocate by the pairwise alignment, and we will validate it through a set of experimental results for different possible inputs. These allow us to determine the best parameters to configure the applications in order to use effectively the available resources of a given system.
U2 - 10.2390/biecoll-jib-2014-251
DO - 10.2390/biecoll-jib-2014-251
M3 - Article
SN - 1613-4516
VL - 11
SP - 251
JO - Journal of integrative bioinformatics
JF - Journal of integrative bioinformatics
IS - 3
ER -