98%
921
2 minutes
20
Cancer remains one of the leading causes of mortality worldwide, where early detection significantly improves patient outcomes and reduces treatment burden. This study investigates the application of Machine Learning (ML) techniques to predict cancer risk based on a combination of genetic and lifestyle factors. A structured dataset of 1,200 patient records was used, comprising features such as age, gender, Body Mass Index (BMI), smoking status, alcohol intake, physical activity, genetic risk level, and personal history of cancer. A full end-to-end ML pipeline was implemented, encompassing data exploration, preprocessing, feature scaling, model training, and evaluation using stratified cross-validation and a separate test set. Nine supervised learning algorithms were evaluated and compared, including Logistic Regression (LR), Decision Tree (DT), Random Forest (RF), Support Vector Machines (SVMs), and several ensemble methods. Among these, Categorical Boosting (CatBoost) achieved the highest predictive performance, with a test accuracy of 98.75% and an F1-score of 0.9820, outperforming both traditional and other advanced models. Feature importance analysis confirmed the strong influence of cancer history, genetic risk, and smoking status on prediction outcomes. The findings highlight the effectiveness of boosting-based ensemble models in capturing complex interactions within health data and support their potential use in personalized cancer risk assessment. This research underscores the value of integrating genetic and modifiable lifestyle variables into predictive modeling to enhance early detection and preventive healthcare strategies.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12365227 | PMC |
http://dx.doi.org/10.1038/s41598-025-15656-8 | DOI Listing |
JAMA Dermatol
September 2025
Department of Population Health, QIMR Berghofer Medical Research Institute, Herston, Queensland, Australia.
Importance: Increasingly, strategies to systematically detect melanomas invoke targeted approaches, whereby those at highest risk are prioritized for skin screening. Many tools exist to predict future melanoma risk, but most have limited accuracy and are potentially biased.
Objectives: To develop an improved melanoma risk prediction tool for invasive melanoma.
JAMA Netw Open
September 2025
Division of Gastroenterology, Department of Medicine, University of California San Diego, La Jolla.
Importance: Janus kinase (JAK) inhibitors are highly effective medications for several immune-mediated inflammatory diseases (IMIDs). However, safety concerns have led to regulatory restrictions.
Objective: To compare the risk of adverse events with JAK inhibitors vs tumor necrosis factor (TNF) antagonists in patients with IMIDs in head-to-head comparative effectiveness studies.
JAMA Dermatol
September 2025
Department of Dermatology, University of Washington, Seattle.
Importance: Merkel cell carcinoma (MCC) is typically caused by the Merkel cell polyomavirus (MCPyV) and recurs in 40% of patients. Half of patients with MCC produce antibodies to MCPyV oncoproteins, the titers of which rise with disease recurrence and fall after successful treatment.
Objective: To assess the utility of MCPyV oncoprotein antibodies for early detection of first recurrence of MCC in a real-world clinical setting.
Int J Surg
September 2025
Department of Neurosurgery, Mianyang Central Hospital, School of Medicine, University of Electronic Science and Technology of China, Mianyang, Sichuan, People's Republic of China.
Int J Surg
September 2025
Department of Oral and Maxillofacial Surgery, The Affiliated Tai'an City Central Hospital of Qingdao University, Taian, China.