Publications by Zhaoxiang Cai

Publications by authors named "Zhaoxiang Cai"

Page 1 of 1

A technical review of multi-omics data integration methods: from classical statistical to deep generative approaches.

Ana R Baião , Zhaoxiang Cai , Rebecca C Poulos , Phillip J Robinson , Roger R Reddel

Brief Bioinform

July 2025

The rapid advancement of high-throughput sequencing and other assay technologies has resulted in the generation of large and complex multi-omics datasets, offering unprecedented opportunities for advancing precision medicine. However, multi-omics data integration remains challenging due to the high-dimensionality, heterogeneity, and frequency of missing values across data types. Computational methods leveraging statistical and machine learning approaches have been developed to address these issues and uncover complex biological patterns, improving our understanding of disease mechanisms.

View Article and Find Full Text PDF

Federated Deep Learning Enables Cancer Subtyping by Proteomics.

Zhaoxiang Cai , Emma L Boys , Zainab Noor , Adel T Aref , Dylan Xavier

Cancer Discov

September 2025

Unlabelled: Artificial intelligence applications in biomedicine face major challenges from data privacy requirements. To address this issue for clinically annotated tissue proteomic data, we developed a federated deep learning approach (ProCanFDL), training local models on simulated sites containing data from a pan-cancer cohort (n = 1,260) and 29 cohorts held behind private firewalls (n = 6,265), representing 19,930 replicate data-independent acquisition mass spectrometry runs. Local parameter updates were aggregated to build the global model, achieving a 43% performance gain on the hold-out test set (n = 625) in 14 cancer subtyping tasks compared with local models and matching centralized model performance.

View Article and Find Full Text PDF

Author Correction: Synthetic augmentation of cancer cell line multi-omic datasets using unsupervised deep learning.

Zhaoxiang Cai , Sofia Apolinário , Ana R Baião , Clare Pacini , Miguel D Sousa

Nat Commun

February 2025

View Article and Find Full Text PDF

Synthetic augmentation of cancer cell line multi-omic datasets using unsupervised deep learning.

Zhaoxiang Cai , Sofia Apolinário , Ana R Baião , Clare Pacini , Miguel D Sousa

Nat Commun

November 2024

Integrating diverse types of biological data is essential for a holistic understanding of cancer biology, yet it remains challenging due to data heterogeneity, complexity, and sparsity. Addressing this, our study introduces an unsupervised deep learning model, MOSA (Multi-Omic Synthetic Augmentation), specifically designed to integrate and augment the Cancer Dependency Map (DepMap). Harnessing orthogonal multi-omic information, this model successfully generates molecular and phenotypic profiles, resulting in an increase of 32.

View Article and Find Full Text PDF

DeePathNet: A Transformer-Based Deep Learning Model Integrating Multiomic Data with Cancer Pathways.

Zhaoxiang Cai , Rebecca C Poulos , Adel Aref , Phillip J Robinson , Roger R Reddel

Cancer Res Commun

December 2024

Abstract: Multiomic data analysis incorporating machine learning has the potential to significantly improve cancer diagnosis and prognosis. Traditional machine learning methods are usually limited to omic measurements, omitting existing domain knowledge, such as the biological networks that link molecular entities in various omic data types. Here, we develop a transformer-based explainable deep learning model, DeePathNet, which integrates cancer-specific pathway information into multiomic data analysis.

View Article and Find Full Text PDF

Opportunities for pharmacoproteomics in biomarker discovery.

Rebecca C Poulos , Zhaoxiang Cai , Phillip J Robinson , Roger R Reddel , Qing Zhong

Proteomics

April 2023

Proteomic data are a uniquely valuable resource for drug response prediction and biomarker discovery because most drugs interact directly with proteins in target cells rather than with DNA or RNA. Recent advances in mass spectrometry and associated processing methods have enabled the generation of large-scale proteomic datasets. Here we review the significant opportunities that currently exist to combine large-scale proteomic data with drug-related research, a field termed pharmacoproteomics.

View Article and Find Full Text PDF

Pan-cancer proteomic map of 949 human cell lines.

Emanuel Gonçalves , Rebecca C Poulos , Zhaoxiang Cai , Syd Barthorpe , Srikanth S Manda , Caitlin Hall

Cancer Cell

August 2022

Article Synopsis

* By integrating various datasets, including drug response and gene essentiality screens, researchers identified thousands of protein biomarkers linked to cancer vulnerabilities, many of which were undetectable at the transcript level.
* The study demonstrates that the predictive power of the proteome for drug response is similarly effective as that of the transcriptome, and even reducing the number of analyzed proteins to 1,500 does not significantly affect this predictive capability.

View Article and Find Full Text PDF

Machine learning for multi-omics data integration in cancer.

Zhaoxiang Cai , Rebecca C Poulos , Jia Liu , Qing Zhong

iScience

February 2022

Multi-omics data analysis is an important aspect of cancer molecular biology studies and has led to ground-breaking discoveries. Many efforts have been made to develop machine learning methods that automatically integrate omics data. Here, we review machine learning tools categorized as either general-purpose or task-specific, covering both supervised and unsupervised learning for integrative analysis of multi-omics data.

View Article and Find Full Text PDF

Barcode-like paper sensor for smartphone diagnostics: an application of blood typing.

Liyun Guan , Junfei Tian , Rong Cao , Miaosi Li , Zhaoxiang Cai

Anal Chem

November 2014

This study introduced a barcode-like design into a paper-based blood typing device by integrating with smartphone-based technology. The concept of presenting a paper-based blood typing assay in a barcode-like pattern significantly enhanced the adaptability of the assay to the smartphone technology. The fabrication of this device involved the use of a printing technique to define hydrophilic bar channels which were, respectively, treated with Anti-A, -B, and -D antibodies.

View Article and Find Full Text PDF