The rapid advancement of high-throughput sequencing and other assay technologies has resulted in the generation of large and complex multi-omics datasets, offering unprecedented opportunities for advancing precision medicine. However, multi-omics data integration remains challenging due to the high-dimensionality, heterogeneity, and frequency of missing values across data types. Computational methods leveraging statistical and machine learning approaches have been developed to address these issues and uncover complex biological patterns, improving our understanding of disease mechanisms.
View Article and Find Full Text PDFUnlabelled: Artificial intelligence applications in biomedicine face major challenges from data privacy requirements. To address this issue for clinically annotated tissue proteomic data, we developed a federated deep learning approach (ProCanFDL), training local models on simulated sites containing data from a pan-cancer cohort (n = 1,260) and 29 cohorts held behind private firewalls (n = 6,265), representing 19,930 replicate data-independent acquisition mass spectrometry runs. Local parameter updates were aggregated to build the global model, achieving a 43% performance gain on the hold-out test set (n = 625) in 14 cancer subtyping tasks compared with local models and matching centralized model performance.
View Article and Find Full Text PDFIntegrating diverse types of biological data is essential for a holistic understanding of cancer biology, yet it remains challenging due to data heterogeneity, complexity, and sparsity. Addressing this, our study introduces an unsupervised deep learning model, MOSA (Multi-Omic Synthetic Augmentation), specifically designed to integrate and augment the Cancer Dependency Map (DepMap). Harnessing orthogonal multi-omic information, this model successfully generates molecular and phenotypic profiles, resulting in an increase of 32.
View Article and Find Full Text PDFCancer Res Commun
December 2024
Abstract: Multiomic data analysis incorporating machine learning has the potential to significantly improve cancer diagnosis and prognosis. Traditional machine learning methods are usually limited to omic measurements, omitting existing domain knowledge, such as the biological networks that link molecular entities in various omic data types. Here, we develop a transformer-based explainable deep learning model, DeePathNet, which integrates cancer-specific pathway information into multiomic data analysis.
View Article and Find Full Text PDFProteomic data are a uniquely valuable resource for drug response prediction and biomarker discovery because most drugs interact directly with proteins in target cells rather than with DNA or RNA. Recent advances in mass spectrometry and associated processing methods have enabled the generation of large-scale proteomic datasets. Here we review the significant opportunities that currently exist to combine large-scale proteomic data with drug-related research, a field termed pharmacoproteomics.
View Article and Find Full Text PDFCancer Cell
August 2022
Multi-omics data analysis is an important aspect of cancer molecular biology studies and has led to ground-breaking discoveries. Many efforts have been made to develop machine learning methods that automatically integrate omics data. Here, we review machine learning tools categorized as either general-purpose or task-specific, covering both supervised and unsupervised learning for integrative analysis of multi-omics data.
View Article and Find Full Text PDFAnal Chem
November 2014
This study introduced a barcode-like design into a paper-based blood typing device by integrating with smartphone-based technology. The concept of presenting a paper-based blood typing assay in a barcode-like pattern significantly enhanced the adaptability of the assay to the smartphone technology. The fabrication of this device involved the use of a printing technique to define hydrophilic bar channels which were, respectively, treated with Anti-A, -B, and -D antibodies.
View Article and Find Full Text PDF