IIUM Repository

Revolutionizing digit image recognition: pushing the limits with simple CNN and challenging image augmentation techniques on MNIST

Hulliyah, Khodijah and Awang Abu Bakar, Normi Sham and Aripiyanto, Saepul and Khairani, Dewi (2023) Revolutionizing digit image recognition: pushing the limits with simple CNN and challenging image augmentation techniques on MNIST. Journal of Applied Data Sciences, 4 (3). pp. 119-129. E-ISSN 2723-6471

[img] PDF (Journal) - Published Version
Restricted to Repository staff only

Download (750kB) | Request a copy
[img]
Preview
PDF (Scopus) - Supplemental Material
Download (162kB) | Preview

Abstract

This study aims to apply Convolutional Neural Networks (CNN) and image augmentation techniques in digit recognition using the MNIST dataset. We built a CNN model and experimented with various image augmentation techniques to improve digit recognition accuracy. The results showed that the use of CNN with image augmentation techniques was effective in improving digit recognition performance. In the data collection stage, we used the MNIST dataset consisting of images of handwritten digits as training and testing data. After building the CNN model, we apply image augmentation techniques such as rotation, shift, and flipping to the training data to enrich the data variety and prevent overfitting. The evaluation results show that the CNN model that has been trained with image augmentation techniques produces significant accuracy, with a maximum accuracy of 99.81%. We also performed an ensemble of several CNN models and found that this approach increased the digit recognition accuracy to 99.79%. This research has the potential for further development. Recommendations for further research include exploring more specific and complex image augmentation techniques, as well as using more challenging datasets. In addition, future research may consider improvements to the CNN architecture used or combining it with other methods such as recurrent neural networks (RNN).

Item Type: Article (Journal)
Uncontrolled Keywords: Digit Image Recognition, Convolutional Neural Networks (CNN), Image Augmentation
Subjects: T Technology > T Technology (General) > T10.5 Communication of technical information
Kulliyyahs/Centres/Divisions/Institutes (Can select more than one option. Press CONTROL button): Kulliyyah of Information and Communication Technology > Department of Computer Science
Kulliyyah of Information and Communication Technology > Department of Computer Science

Kulliyyah of Information and Communication Technology
Kulliyyah of Information and Communication Technology
Depositing User: Dr. Normi Sham Awang Abu Bakar
Date Deposited: 03 Oct 2023 17:06
Last Modified: 03 Oct 2023 17:06
URI: http://irep.iium.edu.my/id/eprint/107246

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year