IIUM Repository

Impact of varying datasets for prediction of COVID - 19 cases

A Mohamed Zaki, Zakarya and Hassan Abdalla Hashim, Aisha (2024) Impact of varying datasets for prediction of COVID - 19 cases. International Journal of Science and Research (IJSR), 13 (4). pp. 430-435. E-ISSN 2319-7064

[img]
Preview
PDF (Journal) - Published Version
Download (416kB) | Preview

Abstract

COVID - 19 has been identified as a global pandemic, and many experiments are applying various numerical models to anticipate the virus's likely growth under development. It is responsible for the emergence of the highly contagious illness. It is impacting millions of people throughout the globe. It has created a change in the research community's orientations for identification, analysis, and control via the application of different statistical and predictive modelling methodologies. These numerical models are examples of decision - making techniques that depend significantly on data mining and machine learning to create predictions based on historical data. In order to make smart judgments and create strong strategies, policymakers and medical authorities need reliable forecasting techniques. These studies are carried out on a variety of small scale datasets including a few hundreds to thousands of records. This study uses different sets of datasets consisting of COVID - 19 instances recorded on a daily basis in Iraq, together with socio - demographic and health related attributes for the region. The primary goal of this research is to see what is the impact of varying datasets for daily forecasting of COVID - 19 instances using deep learning forecasting tools. The predictive modeling for daily COVID - 19 infection cases involved using neural network architectures like enhanced hybrid model built using a Convolutional Neural Network and a Long Short - Term Memory network (EH - CNN - LSTM. Prior to the modeling, appropriate procedures were used to prepare the data and identify any seasonality, residuals, and trends. The model is trained and tested on various splits of the dataset. It is discovered that the higher the amount of training data, the better the predicted performance. Mean Absolute Percentage Error (MAPE), Mean Squared Logarithmic Error (MSLE), and Root Mean Squared Logarithmic Error (RMSLE) are used to evaluate the predictive performance.

Item Type: Article (Journal)
Uncontrolled Keywords: COVID - 19, Forecasting, Prediction, and Deep Learning
Subjects: T Technology > TK Electrical engineering. Electronics Nuclear engineering > TK7800 Electronics. Computer engineering. Computer hardware. Photoelectronic devices > TK7885 Computer engineering
Kulliyyahs/Centres/Divisions/Institutes (Can select more than one option. Press CONTROL button): Kulliyyah of Engineering
Kulliyyah of Engineering > Department of Electrical and Computer Engineering
Depositing User: Prof. Dr. Aisha Hassan Abdalla Hashim
Date Deposited: 29 May 2024 10:37
Last Modified: 29 May 2024 10:55
URI: http://irep.iium.edu.my/id/eprint/112335

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year