Short-Term Prediction Model for Breast Cancer Risk Based on One Million Medical Records.

Clin Breast Cancer

School of Medicine, Tel Aviv University, Tel Aviv, Israel; Department of Surgery, Chaim Sheba Medical Center, Ramat Gan, Israel. Electronic address:

Published: July 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Background: Despite progress in breast cancer screening many women are diagnosed with advanced stage. We sought to develop a short-term (one year) prediction model for breast cancer risk, based on readily available data from electronic medical records (EMRs), to support decision-making.

Methods: A retrospective cohort study using data of 1,039,212 members of a large healthcare organization between the years 1985 and 2021. During the study years, 18,959 people were diagnosed with breast cancer. Longitudinal personal medical information such as demographics, cancer-related family history, smoking habits, medical history, fertility treatments, surgeries, biopsies, medications, BMI, blood pressure and lab tests was used to predict the outcome: breast cancer diagnosis one year from the recorded data. Prediction models were trained using the CatBoost decision tree methodology. SHapley Additive exPlanations (SHAP) values were used to estimate the marginal impact of a feature on the model performance, considering the other features.

Results: The model includes numerous features not utilized in existing breast cancer risk models (e.g., medications, systolic blood pressure, TSH levels and more), available from the EMR. The informative features, ranked by SHAP values, include age, the number of surgical consultations and the number of breast biopsies. The model achieved high performance with an area under the ROC curve (AUC-ROC) of 0.85.

Conclusions: Use of data readily available from the EMR, can assist clinicians when assessing the short-term breast cancer risk.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.clbc.2025.07.025DOI Listing

Publication Analysis

Top Keywords

breast cancer
28
cancer risk
16
prediction model
8
breast
8
model breast
8
risk based
8
medical records
8
blood pressure
8
shap values
8
cancer
7

Similar Publications

Introduction: Cutaneous scalp metastases from breast carcinoma (CMBC) represent an uncommon manifestation of metastatic disease, with heterogeneous clinical presentations, including nodular or infiltrative lesions and scarring alopecia (alopecia neoplastica). The absence of standardized diagnostic criteria, particularly for alopecic phenotypes, poses challenges to early recognition of CMBC, which may represent either the first indication of neoplastic progression or a late recurrence.

Materials And Methods: We retrospectively analyzed a multicenter cohort of 15 patients with histologically confirmed CMBC.

View Article and Find Full Text PDF

PRMT1-Mediated PARP1 Methylation Drives Lung Metastasis and Chemoresistance via P65 Activation in Triple-Negative Breast Cancer.

Research (Wash D C)

September 2025

State Key Laboratory of Oncology in South China, Guangdong Provincial Clinical Research Center for Cancer, Sun Yat-sen University Cancer Center, Guangzhou 510060, China.

Triple-negative breast cancer (TNBC) is the most aggressive breast cancer subtype, characterized by a high propensity for metastasis, poor prognosis, and limited treatment options. Research has demonstrated a substantial correlation between the expression of protein arginine N-methyltransferase 1 (PRMT1) and enhanced proliferation, metastasis, and poor outcomes in TNBC. However, the specific role of PRMT1 in lung metastasis and chemoresistance remains unclear.

View Article and Find Full Text PDF

Purpose: This study aimed to conduct functional proteomics across breast cancer subtypes with bioinformatics analyses.

Methods: Candidate proteins were identified using nanoscale liquid chromatography with tandem mass spectrometry (NanoLC-MS/MS) from core needle biopsy samples of early stage (0-III) breast cancers, followed by external validation with public domain gene-expression datasets (TCGA TARGET GTEx and TCGA BRCA).

Results: Seventeen proteins demonstrated significantly differential expression and protein-protein interaction (PPI) found the strong networks including COL2A1, COL11A1, COL6A1, COL6A2, THBS1 and LUM.

View Article and Find Full Text PDF

A strategy for targeting tumor-associated hypoxia utilizes reductase enzyme-mediated cleavage to convert biologically inert prodrugs to their corresponding biologically active parent therapeutic agents selectively in areas of pronounced hypoxia. Small-molecule inhibitors of tubulin polymerization represent unique therapeutic agents for this approach, with the most promising functioning as both antiproliferative agents (cytotoxins) and as vascular disrupting agents (VDAs). VDAs selectively and effectively disrupt tumor-associated microvessels, which are typically fragile and chaotic in nature.

View Article and Find Full Text PDF

Breast cancer continues to present a major clinical hurdle, largely attributable to its aggressive metastatic behavior and the suboptimal efficacy of standard chemotherapeutic regimens. Cisplatin (CDDP) is a representative platinum drug in the treatment of breast cancer, however, its therapeutic application is often constrained by systemic toxicity and the frequent onset of chemoresistance. Here, we introduce a novel charge-adaptive nanoprodrug system, referred to as PP@, engineered to respond to tumor-specific conditions.

View Article and Find Full Text PDF