Identification of SH2 domain-containing proteins and motifs prediction by a deep learning method.

Comput Biol Med

School of Basic Medical Sciences, Fujian Medical University, Fuzhou, 350122, China; Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Fujian Medical University, Fuzhou, 350122, China. Electronic address:

Published: August 2023


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

The Src Homology 2 (SH2) domain plays an important role in the signal transmission mechanism in organisms. It mediates the protein-protein interactions based on the combination between phosphotyrosine and motifs in SH2 domain. In this study, we designed a method to identify SH2 domain-containing proteins and non-SH2 domain-containing proteins through deep learning technology. Firstly, we collected SH2 and non-SH2 domain-containing protein sequences including multiple species. We built six deep learning models through DeepBIO after data preprocessing and compared their performance. Secondly, we selected the model with the strongest comprehensive ability to conduct training and test separately again, and analyze the results visually. It was found that 288-dimensional (288D) feature could effectively identify two types of proteins. Finally, motifs analysis discovered the specific motif YKIR and revealed its function in signal transduction. In summary, we successfully identified SH2 domain and non-SH2 domain proteins through deep learning method, and obtained 288D features that perform best. In addition, we found a new motif YKIR in SH2 domain, and analyzed its function which helps to further understand the signaling mechanisms within the organism.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.compbiomed.2023.107065DOI Listing

Publication Analysis

Top Keywords

deep learning
16
sh2 domain
16
domain-containing proteins
12
sh2 domain-containing
8
learning method
8
non-sh2 domain-containing
8
proteins deep
8
motif ykir
8
sh2
6
proteins
5

Similar Publications

Postoperative aphasia (POA) is a common complication in patients undergoing surgery for language-eloquent lesions. This study aimed to enhance the prediction of POA by leveraging preoperative navigated transcranial magnetic stimulation (nTMS) language mapping and diffusion tensor imaging (DTI)-based tractography, incorporating deep learning (DL) algorithms. One hundred patients with left-hemispheric lesions were retrospectively enrolled (43 developed postoperative aphasia, as the POA group; 57 did not, as the non-aphasia (NA) group).

View Article and Find Full Text PDF

Machine learning (ML) and deep learning (DL) methodologies have significantly advanced drug discovery and design in several aspects. Additionally, the integration of structure-based data has proven to successfully support and improve the models' predictions. Indeed, we previously demonstrated that combining molecular dynamics (MD)-derived descriptors with ML models allows to effectively classify kinase ligands as allosteric or orthosteric.

View Article and Find Full Text PDF

In recent AI-driven disease diagnosis, the success of models has depended mainly on extensive data sets and advanced algorithms. However, creating traditional data sets for rare or emerging diseases presents significant challenges. To address this issue, this study introduces a direct-self-attention Wasserstein generative adversarial network (DSAWGAN) designed to improve diagnostic capabilities in infectious diseases with limited data availability.

View Article and Find Full Text PDF

Few-shot learning for highly accelerated 3D time-of-flight MRA reconstruction.

Magn Reson Med

September 2025

Centre for Integrative Neuroimaging, FMRIB Division, Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, UK.

Purpose: To develop a deep learning-based reconstruction method for highly accelerated 3D time-of-flight MRA (TOF-MRA) that achieves high-quality reconstruction with robust generalization using extremely limited acquired raw data, addressing the challenge of time-consuming acquisition of high-resolution, whole-head angiograms.

Methods: A novel few-shot learning-based reconstruction framework is proposed, featuring a 3D variational network specifically designed for 3D TOF-MRA that is pre-trained on simulated complex-valued, multi-coil raw k-space datasets synthesized from diverse open-source magnitude images and fine-tuned using only two single-slab experimentally acquired datasets. The proposed approach was evaluated against existing methods on acquired retrospectively undersampled in vivo k-space data from five healthy volunteers and on prospectively undersampled data from two additional subjects.

View Article and Find Full Text PDF

Automatic markerless estimation of infant posture and motion from ordinary videos carries great potential for movement studies "in the wild", facilitating understanding of motor development and massively increasing the chances of early diagnosis of disorders. There has been a rapid development of human pose estimation methods in computer vision, thanks to advances in deep learning and machine learning. However, these methods are trained on datasets that feature adults in different contexts.

View Article and Find Full Text PDF