IIUM Repository

Streptococcus gallolyticus infection and its Interrelation with colorectal cancer: diagnostic accuracy of statistical and machine learning models for early detection algorithm

Mohammad Aidid, Edre and Hamzah, Hairul Aini and Shalihin, Mohd Shaiful Ehsan and Md Nor, Azmi and Ismail, Che Muhammad Khairul Hisyam (2025) Streptococcus gallolyticus infection and its Interrelation with colorectal cancer: diagnostic accuracy of statistical and machine learning models for early detection algorithm. IIUM Medical Journal Malaysia, 24 (4). pp. 42-50. ISSN 1823-4631 E-ISSN 2735-2285

[img] PDF - Published Version
Download (2MB)

Abstract

INTRODUCTION: Epidemiological studies have emphasized the role of Streptococcus gallolyticus subspecies gallolyticus (Sgg) infection in the development of colorectal cancer (CRC), yet it remains underappreciated. While statistical and machine learning (ML) models can enhance CRC prediction, direct comparisons between them are rare. This study aims to assess the diagnostic accuracy of stool polymerase chain reaction (PCR) for Sgg and immunochemical fecal occult blood test (iFOBT) for CRC detection and to compare multivariable statistical and ML models in predicting CRC. MATERIALS AND METHODS: A hospital-based case-control study with a reversed flow design was conducted, involving 33 CRC cases and 80 controls. The analysis incorporated Asia Pacific Colorectal Screening (APCS) risk factors into three predictive models: logistic regression (LR), decision tree (DT), and ensemble Bayesian boosted decision tree (BDT). RESULTS: Combined testing achieved a net sensitivity of 54%, outperforming individual tests (iFOBT=12.1%, Stool PCR=48.5%). Among the models, the ensemble BDT approach demonstrated the highest classification accuracy for CRC (BDT= 78.1%; DT=72.4%; LR=69.9%). The DT model identified iFOBT as the sole predictor, while the BDT ensemble model prioritized positive stool PCR for Sgg as the primary predictor, followed by normal to overweight body mass index and individuals aged over 53 years. CONCLUSION: The ensemble ML model incorporating Sgg infection demonstrated superior predictive performance. Screening for Sgg in stool samples has the potential as an early CRC detection strategy, particularly for individuals with a normal to overweight BMI and those above 53 years old

Item Type: Article (Journal)
Uncontrolled Keywords: Streptococcus gallolyticus, colorectal cancer, diagnostic accuracy, machine learning, bayesian
Subjects: R Medicine > RA Public aspects of medicine > RA0421 Public health. Hygiene. Preventive Medicine
R Medicine > RA Public aspects of medicine > RA643 Communicable Diseases and Public Health
R Medicine > RA Public aspects of medicine > RA644.3 Chronic and Noninfectious Diseases and Public Health
Kulliyyahs/Centres/Divisions/Institutes (Can select more than one option. Press CONTROL button): Kulliyyah of Science
Kulliyyah of Medicine
Kulliyyah of Medicine > Department of Basic Medical
Kulliyyah of Medicine > Department of Community Medicine (Effective: 1st January 2011)
Kulliyyah of Medicine > Department of Family Medicine (Effective: 1st January 2011)
Kulliyyah of Medicine > Department of Surgery
Depositing User: Dr Edre Mohammad Aidid
Date Deposited: 02 Oct 2025 11:50
Last Modified: 02 Oct 2025 11:50
URI: http://irep.iium.edu.my/id/eprint/123529

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year