Artificial intelligence-based recognition for variant pathogenicity of BRCA1 using AlphaFold2-predicted structures.

Chang Li , Lili Zhang , Zhongling Zhuo , Fei Su , Hexin Li , Siyuan Xu , Ye Liu , Zaifeng Zhang , Yibo Xie , Xue Yu , Liheng Bian , Fei Xiao

Theranostics

Peking University Fifth School of Clinical Medicine, Beijing, China.

Published: January 2023

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

With the surge of the high-throughput sequencing technologies, many genetic variants have been identified in the past decade. The vast majority of these variants are defined as variants of uncertain significance (VUS), as their significance to the function or health of an organism is not known. It is urgently needed to develop intelligent models for the clinical interpretation of VUS. State-of-the-art artificial intelligence (AI)-based variant effect predictors only learn features from primary amino acid sequences, leaving out information about the most important three-dimensional structure that is more related to its function. We proposed a deep convolutional neural network model named variant effect recognition network for BRCA1 (vERnet-B) to recognize the clinical pathogenicity of missense single-nucleotide variants in the BRCT domain of BRCA1. vERnet-B learned features associated with the pathogenicity from the tertiary protein structures of variants predicted by AlphaFold2. After performing a series of validation and analyses on vERnet-B, we discovered that it exhibited significant advances over previous works. Recognizing the phenotypic consequences of VUS is one of the most daunting challenges in genetic informatics; however, we achieved 85% accuracy in recognizing disease BRCA1 variants with an ideal balance of false-positive and true-positive detection rates. vERnet-B correctly recognized the pathogenicity of variant A1708E, which was poorly predicted by AlphaFold2 as previously described. The vERnet-B web server is freely available from URL: http://ai-lab.bjrz.org.cn/vERnet. We applied protein tertiary structures to successfully recognize the pathogenic missense SNVs, which were difficult to be addressed by classical approaches based on sequences. Our work demonstrated that AlphaFold2-predicted structures were expected to be used for rich feature learning and revealed unique insights into the clinical interpretation of VUS in disease-related genes, using vERnet-B as a discovery tool.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9800725	PMC
http://dx.doi.org/10.7150/thno.79362	DOI Listing

Publication Analysis

Top Keywords

alphafold2-predicted structures

clinical interpretation

interpretation vus

brca1 vernet-b

predicted alphafold2

variants

vernet-b

artificial intelligence-based

intelligence-based recognition

variant

Similar Publications

From sequence to scaffold: computational design of protein nanoparticle vaccines from AlphaFold2-predicted building blocks.

bioRxiv

August 2025

Institute for Protein Design, University of Washington, Seattle, WA 98195, USA.

Cyrus M Haas , Naveen Jasti , Annie Dosey , Joel D Allen , Rebecca Gillespie

Self-assembling protein nanoparticles are being increasingly utilized in the design of next-generation vaccines due to their ability to induce antibody responses of superior magnitude, breadth, and durability. Computational protein design offers a route to novel nanoparticle scaffolds with structural and biochemical features tailored to specific vaccine applications. Although strategies for designing new self-assembling proteins have been established, the recent development of powerful machine learning-based tools for protein structure prediction and design provides an opportunity to overcome several of their limitations.

View Article and Find Full Text PDF

Similar Publications

Integrating ESM‑2 and Graph Neural Networks with AlphaFold‑2 Structures for Enhanced Protein Function Prediction.

ACS Omega

August 2025

Department of Biomedical Informatics, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 119228, Singapore.

Thi-Tuyen Nguyen , Zhuocheng Jiang , Van-Nui Nguyen , Nguyen Quoc Khanh Le , Matthew Chin Heng Chua

Protein function prediction is essential for elucidating biological processes and accelerating drug discovery. However, the vast number of unannotated protein sequences and the limited availability of experimentally validated functional data remain major challenges. Although deep learning models based on protein sequences or protein-protein interaction networks have shown promise, their performance is still restricted, particularly for proteins without interaction data.

View Article and Find Full Text PDF

Similar Publications

The Conformational Landscape of AlphaFold2-Predicted Amyloidogenic Light Chains and Their Correlation With VL Domain Mutations and Aggregation Propensity.

J Mol Recognit

September 2025

Protein Interactome Lab for Structural and Functional Biology, Advanced Centre for Treatment Research and Education in Cancer, Navi Mumbai, Maharashtra, India.

Sarita Puri , Ishaan Chaudhary , Arnav Khatri , Basudha Patel , Amit Kumawat

Systemic light-chain amyloidosis (AL) is caused by the misfolding and aggregation of immunoglobulin light chains (LCs), which natively form homodimers comprising variable (VL) and constant (CL) domains in each monomer. High sequence variability, particularly within the VL domain, results in varied structural changes and aggregation propensities, making it challenging to develop broadly effective native protein stabilizers/aggregation inhibitors, as each AL patient carries a unique light chain. Using artificial intelligence (AI)-based AlphaFold2, known for its accuracy in modeling folded proteins, we generated an extensive repertoire of structural models of full-length LCs from four amyloidogenic germlines: IGLV1(λ1), IGLV3(λ3), IGLV6(λ6), and IGKV1(κ1), over-represented in AL patients to identify germline-specific structural features.

View Article and Find Full Text PDF

Similar Publications

Zymoseptoria tritici Effectors Structurally Related to Killer Proteins UmV-KP4 and UmV-KP6 Inhibit Fungal Growth, and Define Extended Protein Families in Fungi.

Mol Plant Pathol

August 2025

Université Paris-Saclay, INRAE, UR BIOGER, Palaiseau, France.

Karine de Guillen , Léa Mammri , Jérôme Gracy , André Padilla , Philippe Barthe

Fungal effectors play crucial roles in plant infection. Despite low sequence identity, they were recently discovered to belong to families with similar three-dimensional structures. In this study, we elucidated the structures of Zt-NIP1 and Mycgr3-91409-2 effectors of the wheat fungal pathogen Zymoseptoria tritici using X-ray crystallography and NMR.

View Article and Find Full Text PDF

Similar Publications

Leveraging AlphaFold2 structural space exploration for generating drug target structures in structure-based virtual screening.

Biochem Biophys Rep

September 2025

Department of Computer Science, School of Computing, Institute of Science Tokyo, 4259 G3-56 Nagatsuta-cho, Midori-ku, Yokohama, 226-8501, Kanagawa, Japan.

Keisuke Uchikawa , Kairi Furui , Masahito Ohue

Computational virtual screening (VS) plays a vital role in early-stage drug discovery by enabling the efficient selection of candidate compounds and reducing associated costs. However, the absence of experimentally determined three-dimensional protein structures often limits the applicability of structure-based VS. Advances in protein structure prediction, notably AlphaFold2, have begun to address this gap.

View Article and Find Full Text PDF

Similar Publications