DeepG4: A deep learning approach to predict cell-type specific active G-quadruplex regions.

PLoS Comput Biol

Molecular, Cellular and Developmental biology department (MCD), Centre de Biologie Intégrative (CBI), University of Toulouse, CNRS, UPS, Toulouse, France.

Published: August 2021


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

DNA is a complex molecule carrying the instructions an organism needs to develop, live and reproduce. In 1953, Watson and Crick discovered that DNA is composed of two chains forming a double-helix. Later on, other structures of DNA were discovered and shown to play important roles in the cell, in particular G-quadruplex (G4). Following genome sequencing, several bioinformatic algorithms were developed to map G4s in vitro based on a canonical sequence motif, G-richness and G-skewness or alternatively sequence features including k-mers, and more recently machine/deep learning. Recently, new sequencing techniques were developed to map G4s in vitro (G4-seq) and G4s in vivo (G4 ChIP-seq) at few hundred base resolution. Here, we propose a novel convolutional neural network (DeepG4) to map cell-type specific active G4 regions (e.g. regions within which G4s form both in vitro and in vivo). DeepG4 is very accurate to predict active G4 regions in different cell types. Moreover, DeepG4 identifies key DNA motifs that are predictive of G4 region activity. We found that such motifs do not follow a very flexible sequence pattern as current algorithms seek for. Instead, active G4 regions are determined by numerous specific motifs. Moreover, among those motifs, we identified known transcription factors (TFs) which could play important roles in G4 activity by contributing either directly to G4 structures themselves or indirectly by participating in G4 formation in the vicinity. In addition, we used DeepG4 to predict active G4 regions in a large number of tissues and cancers, thereby providing a comprehensive resource for researchers. Availability: https://github.com/morphos30/DeepG4.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8384162PMC
http://dx.doi.org/10.1371/journal.pcbi.1009308DOI Listing

Publication Analysis

Top Keywords

active regions
16
cell-type specific
8
specific active
8
play roles
8
developed map
8
map g4s
8
g4s vitro
8
predict active
8
regions
6
deepg4
5

Similar Publications

Background: Intensive language-action therapy treats language deficits and depressive symptoms in chronic poststroke aphasia, yet the underlying neural mechanisms remain underexplored. Long-range temporal correlations (LRTCs) in blood oxygenation level-dependent signals indicate persistence in brain activity patterns and may relate to learning and levels of depression. This observational study investigates blood oxygenation level-dependent LRTC changes alongside therapy-induced language and mood improvements in perisylvian and domain-general brain areas.

View Article and Find Full Text PDF

Introduction: Reverse total shoulder arthroplasty is a well-established treatment for patients with rotator cuff tear arthropathy. The outcome after reverse total shoulder arthroplasty has been investigated in several studies and national registries. However, the treatment has not been compared to non-surgical treatment.

View Article and Find Full Text PDF

Intraoperative electrocorticography (ECoG) represents a crucial tool for improving seizure outcomes during epilepsy surgeries by assisting in localization of the epileptogenic zones. There is a shortage of information in the literature regarding single-center experiences and long-term outcomes after ECoG-guided surgeries. Data are particularly scarce from the Eastern Mediterranean Region.

View Article and Find Full Text PDF

Goal-directed behavior requires adjusting cognitive control, both in preparation for and in reaction to conflict. Theta oscillations and population activity in dorsomedial prefrontal cortex (dmPFC) and dorsolateral PFC (dlPFC) are known to support reactive control. Here, we investigated their role in proactive control using human intracranial electroencephalogram (EEG) recordings during a Stroop task that manipulated conflict expectations.

View Article and Find Full Text PDF

Background: Candidiasis, predominantly caused by , poses a significant global health challenge, especially in tropical regions. Nystatin is a potent antifungal agent that is hindered by its low solubility and permeability, limiting its clinical efficacy.

Methods: This study aimed to investigate the potential of a layer-by-layer (LBL) coating system, employing chitosan and alginate, to improve the stability, entrapment efficiency (%EE), and antifungal efficacy of nystatin-loaded liposomes against Candida albicans.

View Article and Find Full Text PDF