A New Method of RNA Secondary Structure Prediction Based on Convolutional Neural Network and Dynamic Programming.

Front Genet

College of Computer Science and Technology and Symbol Computation and Knowledge Engineering, Ministry of Education, Jilin University, Changchun, China.

Published: May 2019


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

In recent years, obtaining RNA secondary structure information has played an important role in RNA and gene function research. Although some RNA secondary structures can be gained experimentally, in most cases, efficient, and accurate computational methods are still needed to predict RNA secondary structure. Current RNA secondary structure prediction methods are mainly based on the minimum free energy algorithm, which finds the optimal folding state of RNA using an iterative method to meet the minimum energy or other constraints. However, due to the complexity of biotic environment, a true RNA structure always keeps the balance of biological potential energy status, rather than the optimal folding status that meets the minimum energy. For short sequence RNA its equilibrium energy status for the RNA folding organism is close to the minimum free energy status; therefore, the minimum free energy algorithm for predicting RNA secondary structure has higher accuracy. Nevertheless, in a longer sequence RNA, constant folding causes its biopotential energy balance to deviate far from the minimum free energy status. This deviation is because of its complex structure and results in a serious decline in the prediction accuracy of its secondary structure. In this paper, we propose a novel RNA secondary structure prediction algorithm using a convolutional neural network model combined with a dynamic programming method to improve the accuracy with large-scale RNA sequence and structure data. We analyze current experimental RNA sequences and structure data to construct a deep convolutional network model, and then we extract implicit features of an effective classification from large-scale data to predict the pairing probability of each base in an RNA sequence. For the obtained probabilities of RNA sequence base pairing, an enhanced dynamic programming method is applied to obtain the optimal RNA secondary structure. Results indicate that our proposed method is superior to the common RNA secondary structure prediction algorithms in predicting three benchmark RNA families. Based on the characteristics of deep learning algorithm, it can be inferred that the method proposed in this paper has a 30% higher prediction success rate when compared with other algorithms, which will be needed as the amount of real RNA structure data increases in the future.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6540740PMC
http://dx.doi.org/10.3389/fgene.2019.00467DOI Listing

Publication Analysis

Top Keywords

rna secondary
36
secondary structure
36
rna
20
structure prediction
16
minimum free
16
free energy
16
energy status
16
structure
14
dynamic programming
12
rna sequence
12

Similar Publications

Clustering DNA and RNA molecular dynamics ensembles via secondary structure.

Biophys J

September 2025

Biophysical and Biomedical Measurement Group, Microsystems and Nanotechnology Division, Physical Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, MD, USA. Electronic address:

Macromolecular structure is central to biology. Yet, not all biomolecules have a well-defined fold. Intrinsically disordered regions are ubiquitous, conveying a versatility to function even in otherwise folded structures.

View Article and Find Full Text PDF

Manipulating Zika virus RNA tertiary structure for developing tissue-specific attenuated vaccines.

EMBO Mol Med

September 2025

State Key Laboratory of Pathogen and Biosecurity, Academy of Military Medical Sciences, 100071, Beijing, China.

Traditional live attenuated vaccines (LAVs) are typically developed through serial passaging or genetic engineering to introduce specific mutations or deletions. While viral RNA secondary or tertiary structures have been well-documented for their multiple functions, including binding with specific host proteins, their potential for LAV design remains largely unexplored. Herein, using Zika virus (ZIKV) as a model, we demonstrate that targeted disruption of the primary sequence or tertiary structure of a specific viral RNA element responsible for Musashi-1 (MSI1) binding leads to a tissue-specific attenuation phenotype in multiple animal models.

View Article and Find Full Text PDF

Dengue virus infection reprograms baseline innate immune gene expression.

Med

September 2025

Viral Research and Experimental Medicine Centre, SingHealth Duke-NUS Academic Medical Centre, Singapore 169856, Singapore; Program in Emerging Infectious Diseases, Duke-NUS Medical School, Singapore 169857, Singapore; Department of Clinical Translational Research, Singapore General Hospital, Singapo

Background: All three dengue vaccines that have completed phase 3 clinical trials have shown greater efficacy in dengue-seropositive compared to dengue-seronegative individuals. This includes the live-attenuated tetravalent dengue vaccine TAK-003, where immunogenicity in baseline seronegative individuals remains lower after two doses, despite seroconversion after the first dose, compared to baseline seropositive individuals after one dose.

Methods: A whole-genome microarray was used to analyze the host response to TAK-003.

View Article and Find Full Text PDF

Clusters of deep intronic RbFox motifs embedded in large assembly of splicing regulators sequences regulate alternative splicing.

PLoS Genet

September 2025

Neural Development Section, Mouse Cancer Genetics Program, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Frederick, Maryland, United States of America.

The RbFox RNA binding proteins regulate alternative splicing of genes governing mammalian development and organ function. They bind to the RNA sequence (U)GCAUG with high affinity but also non-canonical secondary motifs in a concentration dependent manner. However, the hierarchical requirement of RbFox motifs, which are widespread in the genome, is still unclear.

View Article and Find Full Text PDF

Soft tissue sarcomas are a heterogeneous group of malignancies arising from mesenchymal cells. Recent advancements in genomic profiling have identified novel gene fusions in these tumors, offering new insights into their pathogenesis and potential therapeutic targets. Here, we describe a spindle cell sarcoma harboring a novel gene fusion.

View Article and Find Full Text PDF