Development and validation of machine learning-based risk prediction models of oral squamous cell carcinoma using salivary autoantibody biomarkers.
Details
Serval ID
serval:BIB_E21C36E7BF06
Type
Article: article from journal or magazin.
Collection
Publications
Institution
Title
Development and validation of machine learning-based risk prediction models of oral squamous cell carcinoma using salivary autoantibody biomarkers.
Journal
BMC oral health
ISSN
1472-6831 (Electronic)
ISSN-L
1472-6831
Publication state
Published
Issued date
24/11/2022
Peer-reviewed
Oui
Volume
22
Number
1
Pages
534
Language
english
Notes
Publication types: Journal Article ; Research Support, Non-U.S. Gov't
Publication Status: epublish
Publication Status: epublish
Abstract
The incidence of oral cavity squamous cell carcinoma (OSCC) continues to rise. OSCC is associated with a low average survival rate, and most patients have a poor disease prognosis because of delayed diagnosis. We used machine learning techniques to predict high-risk cases of OSCC by using salivary autoantibody levels and demographic and behavioral data.
We collected the salivary samples of patients recruited from a teaching hospital between September 2008 and December 2012. Ten salivary autoantibodies, sex, age, smoking, alcohol consumption, and betel nut chewing were used to build prediction models for identifying patients with a high risk of OSCC. The machine learning algorithms applied in the study were logistic regression, random forest, support vector machine with the radial basis function kernel, eXtreme Gradient Boosting (XGBoost), and a stacking model. We evaluated the performance of the models by using the area under the receiver operating characteristic curve (AUC), with simulations conducted 100 times.
A total of 337 participants were enrolled in this study. The best predictive model was constructed using a stacking algorithm with original forms of age and logarithmic levels of autoantibodies (AUC = 0.795 ± 0.055). Adding autoantibody levels as a data source significantly improved the prediction capability (from 0.698 ± 0.06 to 0.795 ± 0.055, p < 0.001).
We successfully established a prediction model for high-risk cases of OSCC. This model can be applied clinically through an online calculator to provide additional personalized information for OSCC diagnosis, thereby reducing the disease morbidity and mortality rates.
We collected the salivary samples of patients recruited from a teaching hospital between September 2008 and December 2012. Ten salivary autoantibodies, sex, age, smoking, alcohol consumption, and betel nut chewing were used to build prediction models for identifying patients with a high risk of OSCC. The machine learning algorithms applied in the study were logistic regression, random forest, support vector machine with the radial basis function kernel, eXtreme Gradient Boosting (XGBoost), and a stacking model. We evaluated the performance of the models by using the area under the receiver operating characteristic curve (AUC), with simulations conducted 100 times.
A total of 337 participants were enrolled in this study. The best predictive model was constructed using a stacking algorithm with original forms of age and logarithmic levels of autoantibodies (AUC = 0.795 ± 0.055). Adding autoantibody levels as a data source significantly improved the prediction capability (from 0.698 ± 0.06 to 0.795 ± 0.055, p < 0.001).
We successfully established a prediction model for high-risk cases of OSCC. This model can be applied clinically through an online calculator to provide additional personalized information for OSCC diagnosis, thereby reducing the disease morbidity and mortality rates.
Keywords
Humans, Mouth Neoplasms/diagnosis, Carcinoma, Squamous Cell/diagnosis, Squamous Cell Carcinoma of Head and Neck, Machine Learning, Head and Neck Neoplasms, Biomarkers, Autoantibodies, Biomarker, Machine learning, Oral cavity squamous cell carcinoma
Pubmed
Web of science
Open Access
Yes
Create date
06/12/2022 14:05
Last modification date
23/01/2024 7:36