TY - JOUR
T1 - Estimated Covid-19 burden in Spain
T2 - ARCH underreported non-stationary time series
AU - Fernández-Fontelo, Amanda
AU - Cabaña Nigro, Alejandra
AU - Puig, Pedro
AU - Arratia, Argimiro
N1 - Publisher Copyright:
© 2023, The Author(s).
PY - 2023/3/28
Y1 - 2023/3/28
N2 - The problem of dealing with misreported data is very common in a wide range of contexts for different reasons. The current situation caused by the Covid-19 worldwide pandemic is a clear example, where the data provided by official sources were not always reliable due to data collection issues and to the high proportion of asymptomatic cases. In this work, a flexible framework is proposed, with the objective of quantifying the severity of misreporting in a time series and reconstructing the most likely evolution of the process. The performance of Bayesian Synthetic Likelihood to estimate the parameters of a model based on AutoRegressive Conditional Heteroskedastic time series capable of dealing with misreported information and to reconstruct the most likely evolution of the phenomenon is assessed through a comprehensive simulation study and illustrated by reconstructing the weekly Covid-19 incidence in each Spanish Autonomous Community. Only around 51% of the Covid-19 cases in the period 2020/02/23-2022/02/27 were reported in Spain, showing relevant differences in the severity of underreporting across the regions. The proposed methodology provides public health decision-makers with a valuable tool in order to improve the assessment of a disease evolution under different scenarios. The online version contains supplementary material available at 10.1186/s12874-023-01894-9.
AB - The problem of dealing with misreported data is very common in a wide range of contexts for different reasons. The current situation caused by the Covid-19 worldwide pandemic is a clear example, where the data provided by official sources were not always reliable due to data collection issues and to the high proportion of asymptomatic cases. In this work, a flexible framework is proposed, with the objective of quantifying the severity of misreporting in a time series and reconstructing the most likely evolution of the process. The performance of Bayesian Synthetic Likelihood to estimate the parameters of a model based on AutoRegressive Conditional Heteroskedastic time series capable of dealing with misreported information and to reconstruct the most likely evolution of the phenomenon is assessed through a comprehensive simulation study and illustrated by reconstructing the weekly Covid-19 incidence in each Spanish Autonomous Community. Only around 51% of the Covid-19 cases in the period 2020/02/23-2022/02/27 were reported in Spain, showing relevant differences in the severity of underreporting across the regions. The proposed methodology provides public health decision-makers with a valuable tool in order to improve the assessment of a disease evolution under different scenarios. The online version contains supplementary material available at 10.1186/s12874-023-01894-9.
KW - Continuous time series
KW - Mixture distributions
KW - Under-reported data
KW - ARCH models
KW - Infectious diseases
KW - Covid-19
KW - Bayesian synthetic likelihood
UR - http://www.scopus.com/inward/record.url?scp=85151114872&partnerID=8YFLogxK
UR - https://www.mendeley.com/catalogue/4d737fb3-fe22-3e18-b8fc-27c6f1f5f026/
U2 - 10.1186/s12874-023-01894-9
DO - 10.1186/s12874-023-01894-9
M3 - Article
C2 - 36977977
SN - 1471-2288
VL - 23
JO - BMC Medical Research Methodology
JF - BMC Medical Research Methodology
IS - 1
M1 - 75
ER -