TY - JOUR
T1 - A guide for using deep learning for complex trait genomic prediction
AU - Pérez-Enciso, Miguel
AU - Zingaretti, Laura M.
PY - 2019/7/20
Y1 - 2019/7/20
N2 - © 2019 by the authors. Licensee MDPI, Basel, Switzerland. Deep learning (DL) has emerged as a powerful tool to make accurate predictions from complex data such as image, text, or video. However, its ability to predict phenotypic values from molecular data is less well studied. Here, we describe the theoretical foundations of DL and provide a generic code that can be easily modified to suit specific needs. DL comprises a wide variety of algorithms which depend on numerous hyperparameters. Careful optimization of hyperparameter values is critical to avoid overfitting. Among the DL architectures currently tested in genomic prediction, convolutional neural networks (CNNs) seem more promising than multilayer perceptrons (MLPs). A limitation of DL is in interpreting the results. This may not be relevant for genomic prediction in plant or animal breeding but can be critical when deciding the genetic risk to a disease. Although DL technologies are not ”plug-and-play”, they are easily implemented using Keras and TensorFlow public software. To illustrate the principles described here, we implemented a Keras-based code in GitHub.
AB - © 2019 by the authors. Licensee MDPI, Basel, Switzerland. Deep learning (DL) has emerged as a powerful tool to make accurate predictions from complex data such as image, text, or video. However, its ability to predict phenotypic values from molecular data is less well studied. Here, we describe the theoretical foundations of DL and provide a generic code that can be easily modified to suit specific needs. DL comprises a wide variety of algorithms which depend on numerous hyperparameters. Careful optimization of hyperparameter values is critical to avoid overfitting. Among the DL architectures currently tested in genomic prediction, convolutional neural networks (CNNs) seem more promising than multilayer perceptrons (MLPs). A limitation of DL is in interpreting the results. This may not be relevant for genomic prediction in plant or animal breeding but can be critical when deciding the genetic risk to a disease. Although DL technologies are not ”plug-and-play”, they are easily implemented using Keras and TensorFlow public software. To illustrate the principles described here, we implemented a Keras-based code in GitHub.
KW - Deep learning
KW - Genomic prediction
KW - Machine learning
KW - Genetic Code
KW - Models, Genetic
KW - Software
KW - Multifactorial Inheritance
KW - Deep Learning
UR - http://www.mendeley.com/research/guide-using-deep-learning-complex-trait-genomic-prediction
U2 - 10.3390/genes10070553
DO - 10.3390/genes10070553
M3 - Review article
C2 - 31330861
SN - 2073-4425
VL - 10
JO - Genes
JF - Genes
IS - 7
M1 - 553
ER -