Pipelines for cross-species and genome-wide prediction of long noncoding RNA binding.

Nat Protoc

Bioinformatics Section, School of Basic Medical Sciences, Southern Medical University, Guangzhou, China.

Published: March 2019


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Abundant long, noncoding RNAs (lncRNAs) in mammals can bind to DNA sequences and recruit histone- and DNA-modifying enzymes to binding sites to epigenetically regulate target genes. However, most lncRNAs' binding motifs and target sites are unknown. The large numbers of lncRNAs and target sites in the whole genome make it infeasible to examine lncRNA binding to DNA purely experimentally. Here, we report a protocol for lncRNA/DNA-binding analysis that is built upon a database containing the GENCODE-annotated human and mouse lncRNAs, the orthologs of these lncRNAs in 17 mammals, and the genome sequences of the 17 mammals. Cross-species and genome-wide lncRNA/DNA-binding analysis begins with and is driven by database search. The predicted DNA-binding motifs and binding sites answer the general question of which lncRNAs may epigenetically regulate which genes, and can be used to identify potential sites for genome and epigenome editing. To use the protocol, preliminary knowledge of the base-pairing rules that guide the binding of noncoding RNAs to DNA to form triplexes, as well as the skills required to use the UCSC Genome Browser, are needed. A genome-wide prediction takes from 2 to 10 d, and the results are sent to users automatically by e-mail. The platform is updated continuously, making it possible to study more lncRNAs and larger genomic regions in less computational time.

Download full-text PDF

Source
http://dx.doi.org/10.1038/s41596-018-0115-5DOI Listing

Publication Analysis

Top Keywords

cross-species genome-wide
8
genome-wide prediction
8
long noncoding
8
noncoding rnas
8
lncrnas mammals
8
binding sites
8
epigenetically regulate
8
target sites
8
sites genome
8
lncrna/dna-binding analysis
8

Similar Publications

Metabolic dysfunction-associated fatty liver disease (MAFLD), a global epidemic affecting 25% of adults, is driven by immune-metabolic dysregulation, yet the causal mechanisms linking immune cell-specific gene perturbations to disease progression remain unresolved. Current studies lack systematic integration of single-cell transcriptomics, causal inference, and functional validation to dissect actionable potential intervention targets. We combined peripheral blood mononuclear cells (PBMCs) single-cell RNA sequencing (scRNA-seq; GSE179886: 2 MAFLD vs.

View Article and Find Full Text PDF

Saltations of cis-regulatory modules in Canidae and Hominidae.

Sci Rep

August 2025

State Key Laboratory of Mathematical Science, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, 100190, China.

Dogs, which were segregated from wolves about thirty thousand years ago, show unique human-similar social-cognitive abilities. However, the genomic basis accounting for the phenotypic saltation between dog and wolf remains unclear. We performed a comparative analysis of genome-wide cis-regulatory element frequencies (CREF) for five canids: dog, dingo, red fox, dhole, and wolf, along with four hominids.

View Article and Find Full Text PDF

Identification of L. Casein Kinase I-like Gene Family and Analysis of Abiotic Stress Response.

Genes (Basel)

June 2025

Shanghai Key Laboratory of Protected Horticulture Technology, The Protected Horticulture Institute, Shanghai Academy of Agricultural Sciences, Shanghai 201403, China.

Background: Casein kinase I-like () protein is a member of the serine/threonine kinase family and plays a pivotal regulatory role in various eukaryotic cellular processes, including stress responses.

Objectives: This study aims to systematically identify the gene family in the tomato genome and investigate its responsiveness to abiotic stress.

Methods: Members of were identified through genome-wide bioinformatics analysis, and their physicochemical properties, chromosomal localization, gene structure, conserved domains, phylogenetic relationships, cis-acting elements, cross-species collinearity, and tissue expression profiles were comprehensively analyzed.

View Article and Find Full Text PDF

Navigating the Strengths and Constraints of Mouse Models in Obesity Research.

Endocrinology

July 2025

Department of Internal Medicine, Erasmus MC, University Medical Center Rotterdam, Rotterdam 3015 GD, The Netherlands.

Obesity is a major health problem, being a risk factor for many metabolic diseases. Obesity results from an imbalance in energy intake and energy expenditure. Animal models, particularly naturally occurring mouse models of obesity, have provided a framework of the basic mechanisms regulating energy homeostasis.

View Article and Find Full Text PDF

Major Histocompatibility Complex (MHC) class I molecules facilitate antiviral immunity through their polymorphic peptide-binding diversity. Tibetan wild boars, adapted to extreme plateau conditions, exhibit enhanced disease resistance. Here, we present the first three-dimensional structure of the Tibetan wild boar SLA-1*Z0301 in complex with the GP3-ALL9 peptide derived from porcine reproductive and respiratory syndrome virus (PRRSV).

View Article and Find Full Text PDF