TY - JOUR
T1 - Data preprocessing for ANN-based industrial time-series forecasting with imbalanced data
AU - Pisa, Ivan
AU - Santín, Ignacio
AU - Vicario, Jose Lopez
AU - Morell, Antoni
AU - Vilanova, Ramon
N1 - Funding Information:
This research is supported by the Catalan Government under Projects 2017 SGR 1202 and 2017 SGR 1670, by La Secretaria d’Universitats i Recerca del Departament d’Empresa i Coneixement de la Generalitat de Catalunya i del Fons Social Europeu under FI grant and also by the Spanish Government under Projects TEC2017-84321-C4-4-R and DPI2016-77271-R co-funded with European Union ERDF funds.
Publisher Copyright:
© 2019 IEEE
Copyright:
Copyright 2019 Elsevier B.V., All rights reserved.
PY - 2019/9
Y1 - 2019/9
N2 - The evolution of Industry towards the 4.0 paradigm has motivated the adoption of Artificial Neural Networks (ANNs) to deal with applications where predictive and maintenance tasks are performed. These tasks become difficult to carry out when rare events are present due to the imbalance of data. This is because training of ANN can be biased. Conventional techniques addressing this problem are mainly based on resampling-based approaches. However, these are not always feasible when dealing with time-series forecasting tasks in industrial scenarios. For that reason, this work proposes the application of data preprocessing techniques especially designed to face this scenario, a problem which has not been covered enough in the state-of-the-art. Considered techniques are applied over time-series data coming from Wastewater Treatment Plants (WWTPs). Our proposal significantly outperforms current strategies showing a 68% of improvement in terms of RMSE when rare events are addressed.
AB - The evolution of Industry towards the 4.0 paradigm has motivated the adoption of Artificial Neural Networks (ANNs) to deal with applications where predictive and maintenance tasks are performed. These tasks become difficult to carry out when rare events are present due to the imbalance of data. This is because training of ANN can be biased. Conventional techniques addressing this problem are mainly based on resampling-based approaches. However, these are not always feasible when dealing with time-series forecasting tasks in industrial scenarios. For that reason, this work proposes the application of data preprocessing techniques especially designed to face this scenario, a problem which has not been covered enough in the state-of-the-art. Considered techniques are applied over time-series data coming from Wastewater Treatment Plants (WWTPs). Our proposal significantly outperforms current strategies showing a 68% of improvement in terms of RMSE when rare events are addressed.
KW - Artificial Neural Network
KW - Data preprocessing
KW - Imbalanced data
KW - Rare events
UR - http://www.scopus.com/inward/record.url?scp=85075613456&partnerID=8YFLogxK
U2 - 10.23919/EUSIPCO.2019.8902682
DO - 10.23919/EUSIPCO.2019.8902682
M3 - Article
AN - SCOPUS:85075613456
SN - 2219-5491
JO - European Signal Processing Conference
JF - European Signal Processing Conference
ER -