TY - JOUR
T1 - Mortality prediction enhancement in end-stage renal disease
T2 - A machine learning approach
AU - Macias, Edwar
AU - Morell, Antoni
AU - Serrano, Javier
AU - Vicario, Jose Lopez
AU - Ibeas, Jose
N1 - Funding Information:
This work is supported by the Spanish Government under Project TEC2017-84321-C4-4-R co-funded with European Union ERDF funds and also by the Catalan Government under Project 2017 SGR 1670.
Funding Information:
This work is supported by the Spanish Government under Project TEC2017-84321-C4-4-R co-funded with European Union ERDF funds and also by the Catalan Government under Project 2017 SGR 1670 .
Publisher Copyright:
© 2020
PY - 2020
Y1 - 2020
N2 - In this work, we propose to combine massive variables collected during the evolution of patients in end-stage renal disease (ESRD), along with machine learning techniques to improve mortality prediction in ESRD. This work was carried out with a retrospective cohort of 261 patients, their evolution from diagnoses, laboratory tests, and variables recorded during haemodialysis sessions was combined. Random forest (RF) was used to explore the inference of the variables and define a base performance for long short-term memory (LSTM) recurrent neural networks. Then, LSTMs were trained with several groups of variables chosen by expert staff, the ones found by RF and all the available ones. The best performance was obtained using all the variables, but the ones found by RF had better predictive capacity than those chosen with expert knowledge. Integrating the three sources of information supposes an improvement in more than 4% in the area under the receiver operating characteristic curve. The approach is sufficiently robust to predict mortality at different time ranges. The massive integration of variables from patients in ESRD, together with the use of LSMTs, supposes an exceptional improvement in the predictive models of mortality. In conclusion, the machine learning approach can lead to a change in the paradigm in the analysis of predictive factors in mortality in ESRD.
AB - In this work, we propose to combine massive variables collected during the evolution of patients in end-stage renal disease (ESRD), along with machine learning techniques to improve mortality prediction in ESRD. This work was carried out with a retrospective cohort of 261 patients, their evolution from diagnoses, laboratory tests, and variables recorded during haemodialysis sessions was combined. Random forest (RF) was used to explore the inference of the variables and define a base performance for long short-term memory (LSTM) recurrent neural networks. Then, LSTMs were trained with several groups of variables chosen by expert staff, the ones found by RF and all the available ones. The best performance was obtained using all the variables, but the ones found by RF had better predictive capacity than those chosen with expert knowledge. Integrating the three sources of information supposes an improvement in more than 4% in the area under the receiver operating characteristic curve. The approach is sufficiently robust to predict mortality at different time ranges. The massive integration of variables from patients in ESRD, together with the use of LSMTs, supposes an exceptional improvement in the predictive models of mortality. In conclusion, the machine learning approach can lead to a change in the paradigm in the analysis of predictive factors in mortality in ESRD.
KW - End-stage renal disease
KW - LSTM
KW - Machine learning
KW - Mortality prediction
KW - Random forest
UR - http://www.scopus.com/inward/record.url?scp=85085750219&partnerID=8YFLogxK
U2 - 10.1016/j.imu.2020.100351
DO - 10.1016/j.imu.2020.100351
M3 - Article
AN - SCOPUS:85085750219
SN - 2352-9148
VL - 19
JO - Informatics in Medicine Unlocked
JF - Informatics in Medicine Unlocked
M1 - 100351
ER -