Prognostication of differentiated thyroid cancer recurrence: An explainable machine learning approach

Authors

  • Ghazi M. Idroes Department of Occupational Health and Safety, Faculty of Health Sciences, Universitas Abulyatama, Aceh Besar, Indonesia
  • Teuku R. Noviandy Department of Information Systems, Universitas Abulyatama, Aceh Besar, Indonesia https://orcid.org/0000-0002-5779-2235
  • Ghalieb M. Idroes Interdisciplinary Innovation Research Unit, Graha Primera Saintifika, Aceh Besar, Indonesia
  • Irsan Hardi Interdisciplinary Innovation Research Unit, Graha Primera Saintifika, Aceh Besar, Indonesia https://orcid.org/0000-0002-8657-0068
  • Teuku F. Duta Medical Research Unit, School of Medicine, Universitas Syiah Kuala, Banda Aceh, Indonesia
  • Lama MA. Hamoud Department of Pharmacy Practice, College of Pharmacy, Taibah University, Madinah, Saudi Arabia
  • Hala T. Al-Gunaid Faculty of Medicine Kasr Al-Ainy, Cairo University, Cairo, Egypt

DOI:

https://doi.org/10.52225/narrax.v2i3.183

Keywords:

LightGBM, SHAP, supervised learning, medical informatics, recurrence prediction

Abstract

Differentiated thyroid cancer (DTC) generally has a favorable prognosis, but recurrence remains a concern for a subset of patients, highlighting the need for accurate predictive tools. While traditional methods, such as the American Thyroid Association (ATA) guidelines, are widely used, they may not fully capture the complex patterns in clinical data. To address this, we developed a machine learning model using LightGBM and enhanced its interpretability with SHAP (SHapley Additive exPlanations). Our model, trained on data from 383 DTC patients, identified response to initial therapy as the most significant predictor of recurrence, alongside age and risk level. The model achieved an accuracy of 93.51%, with precision and sensitivity of 94.23% and 96.08%, respectively, using only five key features selected through Recursive Feature Elimination (RFE). SHAP analysis provided clear insights into how these features influenced predictions, offering a transparent and interpretable approach to risk stratification. These results highlight the potential of explainable machine learning to improve recurrence prediction, support personalized care, and build clinician trust, while laying the groundwork for further validation in diverse populations.

Downloads

Published

2024-12-31

Issue

Section

Original Article