TY - JOUR
T1 - Identifying factors associated with vaping cessation in young adults
T2 - A machine learning and XAI approach
AU - Satheeshkumar, Poolakkad S.
AU - Lango, Ian
AU - Zafo, Swarnali
AU - Ebanks, Mikaiel
AU - Das, Rahul Kumar
AU - Cheung, Kit Wai
AU - Pili, Roberto
AU - Mahajan, Supriya
N1 - Publisher Copyright:
© 2026 Satheeshkumar et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
PY - 2026/5
Y1 - 2026/5
N2 - The public health impact of vaping in the United States reflects a complex balance of potential benefits and emerging risks, as e‑cigarettes may reduce exposure to toxic combustion byproducts and support adult smoking cessation, yet growing evidence links vaping to respiratory and cardiovascular harm and youth uptake remains con‑ cerning, with 38.4% of adolescent users in 2024 reporting habitual use. To inform the optimal use of predictive technologies in cessation efforts, this study sought to char‑ acterize cessation‑related behaviors and attitudes among young adult vapers and evaluate machine learning and explainable AI methods for predicting quit attempts and cessation success. A social media–based survey captured behavioral, contex‑ tual, and demographic factors, and cessation was defined as self‑reported absti‑ nence from all vaping products for at least 30 days. Predictors were identified using forward selection and backward elimination, and data were split into training and test‑ ing sets. Linear models (LASSO, ridge regression, elastic net) and nonlinear models (random forest, support vector machine) were trained and evaluated using AUC and Brier scores. Linear models demonstrated the strongest overall performance: LASSO achieved AUCs of 0.89 (training) and 0.91 (testing), ridge regression 0.88 and 0.93, and elastic net 0.91 for both sets. Nonlinear models showed signs of overfitting, with random forest achieving 0.99 in training but only 0.70 in testing, and SVM achieving 0.89 and 0.72. Key predictors included age, environmental triggers, vaping fre‑ quency, sex, and long‑term behavioral outlook. Individuals under 25 showed greater vulnerability to continued use, environmental cues, especially social exposure, were strongly associated with relapse, and erratic vaping patterns predicted lower ces‑ sation success. While these models highlight behavioral and contextual factors that may influence cessation, findings should be interpreted as exploratory given the cross‑sectional design and sample characteristics. Larger, longitudinal studies are needed to validate these insights and clarify the potential of predictive modeling to inform targeted public health interventions.
AB - The public health impact of vaping in the United States reflects a complex balance of potential benefits and emerging risks, as e‑cigarettes may reduce exposure to toxic combustion byproducts and support adult smoking cessation, yet growing evidence links vaping to respiratory and cardiovascular harm and youth uptake remains con‑ cerning, with 38.4% of adolescent users in 2024 reporting habitual use. To inform the optimal use of predictive technologies in cessation efforts, this study sought to char‑ acterize cessation‑related behaviors and attitudes among young adult vapers and evaluate machine learning and explainable AI methods for predicting quit attempts and cessation success. A social media–based survey captured behavioral, contex‑ tual, and demographic factors, and cessation was defined as self‑reported absti‑ nence from all vaping products for at least 30 days. Predictors were identified using forward selection and backward elimination, and data were split into training and test‑ ing sets. Linear models (LASSO, ridge regression, elastic net) and nonlinear models (random forest, support vector machine) were trained and evaluated using AUC and Brier scores. Linear models demonstrated the strongest overall performance: LASSO achieved AUCs of 0.89 (training) and 0.91 (testing), ridge regression 0.88 and 0.93, and elastic net 0.91 for both sets. Nonlinear models showed signs of overfitting, with random forest achieving 0.99 in training but only 0.70 in testing, and SVM achieving 0.89 and 0.72. Key predictors included age, environmental triggers, vaping fre‑ quency, sex, and long‑term behavioral outlook. Individuals under 25 showed greater vulnerability to continued use, environmental cues, especially social exposure, were strongly associated with relapse, and erratic vaping patterns predicted lower ces‑ sation success. While these models highlight behavioral and contextual factors that may influence cessation, findings should be interpreted as exploratory given the cross‑sectional design and sample characteristics. Larger, longitudinal studies are needed to validate these insights and clarify the potential of predictive modeling to inform targeted public health interventions.
UR - https://www.scopus.com/pages/publications/105038226075
U2 - 10.1371/journal.pdig.0001031
DO - 10.1371/journal.pdig.0001031
M3 - Article
AN - SCOPUS:105038226075
SN - 2767-3170
VL - 5
JO - PLOS Digital Health
JF - PLOS Digital Health
IS - 5 May
M1 - e0001031
ER -