98%
921
2 minutes
20
Motivation: Gene expression and regulation, a key molecular mechanism driving human disease development, remains elusive, especially at early stages. Integrating the increasing amount of population-level genomic data and understanding gene regulatory mechanisms in disease development are still challenging. Machine learning has emerged to solve this, but many machine learning methods were typically limited to building an accurate prediction model as a 'black box', barely providing biological and clinical interpretability from the box.
Results: To address these challenges, we developed an interpretable and scalable machine learning model, ECMarker, to predict gene expression biomarkers for disease phenotypes and simultaneously reveal underlying regulatory mechanisms. Particularly, ECMarker is built on the integration of semi- and discriminative-restricted Boltzmann machines, a neural network model for classification allowing lateral connections at the input gene layer. This interpretable model is scalable without needing any prior feature selection and enables directly modeling and prioritizing genes and revealing potential gene networks (from lateral connections) for the phenotypes. With application to the gene expression data of non-small-cell lung cancer patients, we found that ECMarker not only achieved a relatively high accuracy for predicting cancer stages but also identified the biomarker genes and gene networks implying the regulatory mechanisms in the lung cancer development. In addition, ECMarker demonstrates clinical interpretability as its prioritized biomarker genes can predict survival rates of early lung cancer patients (P-value < 0.005). Finally, we identified a number of drugs currently in clinical use for late stages or other cancers with effects on these early lung cancer biomarkers, suggesting potential novel candidates on early cancer medicine.
Availabilityand Implementation: ECMarker is open source as a general-purpose tool at https://github.com/daifengwanglab/ECMarker.
Supplementary Information: Supplementary data are available at Bioinformatics online.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8150141 | PMC |
http://dx.doi.org/10.1093/bioinformatics/btaa935 | DOI Listing |
J Eval Clin Pract
September 2025
Department of Orthopedics and Traumatology, Medical Faculty, University of Health Sciences, Antalya, Turkey.
Aims And Objective: The field of medical statistics has experienced significant advancements driven by integrating innovative statistical methodologies. This study aims to conduct a comprehensive analysis to explore current trends, influential research areas, and future directions in medical statistics.
Methods: This paper maps the evolution of statistical methods used in medical research based on 4,919 relevant publications retrieved from the Web of Science.
J Magn Reson Imaging
September 2025
Department of Neurology, Dell Medical School, University of Texas at Austin, Austin, Texas, USA.
Background: Cerebrovascular reactivity reflects changes in cerebral blood flow in response to an acute stimulus and is reflective of the brain's ability to match blood flow to demand. Functional MRI with a breath-hold task can be used to elicit this vasoactive response, but data validity hinges on subject compliance. Determining breath-hold compliance often requires external monitoring equipment.
View Article and Find Full Text PDFZhong Nan Da Xue Xue Bao Yi Xue Ban
May 2025
Department of Geriatric Pulmonary and Critical Care Medicine, Xiangya Hospital, Central South University; National Clinical Research Center for Geriatric Disorders (Xiangya Hospital), Changsha 410008.
Objectives: Non-small cell lung cancer (NSCLC) is associated with poor prognosis, with 30% of patients diagnosed at an advanced stage. Mutations in the and genes are important prognostic factors for NSCLC, and targeted therapies can significantly improve survival in these patients. Although tissue biopsy remains the gold standard for detecting gene mutations, it has limitations, including invasiveness, sampling errors due to tumor heterogeneity, and poor reproducibility.
View Article and Find Full Text PDFDermatitis
September 2025
From the Department of Dermatology, Venereology and Leprology, All India Institute of Medical Sciences (AIIMS), Bhopal, India.
Contact dermatitis (CD), which includes both allergic CD and irritant CD, is a common inflammatory condition that can pose significant diagnostic challenges. Although patch testing is the gold standard for identifying causative allergens for allergic contact dermatitis (ACD), it is time-consuming, subjective, and requires expert interpretation. Recent advancements in artificial intelligence (AI), particularly in machine learning (ML) and deep learning, have shown promise in improving the accuracy, efficiency, and accessibility of CD diagnosis and management.
View Article and Find Full Text PDF