: an R package to infer gene transcription rates with a novel least sum of squares method.

NAR Genom Bioinform

Department of Internal Medicine, Nephrology Division, University of Michigan, Ann Arbor 48109 MI, United States.

Published: September 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

The dynamics of transcriptional elongation influence many biological activities, such as RNA splicing, polyadenylation, and nuclear export. To quantify the elongation rate, a typical method is to treat cells with drugs that inhibit RNA polymerase II (Pol II) from entering the gene body and then track Pol II using Pro-seq or Gro-seq. However, the downstream data analysis is challenged by the problem of identifying the transition point between the gene regions inhibited by the drug and not, which is necessary to calculate the transcription rate. Although the traditional hidden Markov model (HMM) can be used to solve it, this method is complicated with its hidden variable and many parameters to be estimated. Hence, we developed the R package , which identifies the transition point with a novel least sum of squares (LSS) method and calculates the elongation rate accordingly. In addition, also covers other functions frequently used in transcription dynamic study, including metagene plotting, pause index calculation, gene structure analysis, etc. The effectiveness of this package is proved by its performance on three Pro-seq or Gro-seq datasets, showing higher accuracy than HMM. is freely available at https://github.com/yuabrahamliu/proRate or https://github.com/FADHLyemen/proRate.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12412782PMC
http://dx.doi.org/10.1093/nargab/lqaf123DOI Listing

Publication Analysis

Top Keywords

novel sum
8
sum squares
8
elongation rate
8
pro-seq gro-seq
8
transition point
8
package infer
4
gene
4
infer gene
4
gene transcription
4
transcription rates
4

Similar Publications

: an R package to infer gene transcription rates with a novel least sum of squares method.

NAR Genom Bioinform

September 2025

Department of Internal Medicine, Nephrology Division, University of Michigan, Ann Arbor 48109 MI, United States.

The dynamics of transcriptional elongation influence many biological activities, such as RNA splicing, polyadenylation, and nuclear export. To quantify the elongation rate, a typical method is to treat cells with drugs that inhibit RNA polymerase II (Pol II) from entering the gene body and then track Pol II using Pro-seq or Gro-seq. However, the downstream data analysis is challenged by the problem of identifying the transition point between the gene regions inhibited by the drug and not, which is necessary to calculate the transcription rate.

View Article and Find Full Text PDF

The Three Nightmare Traits (TNT; disagreeableness, carelessness, and dishonesty; aka the Three Nonnormative Traits) represent low levels of socially desirable self-control, i.e., unsuccessful socialization.

View Article and Find Full Text PDF

Mollugin reacts with phenol thiol but not produce modification on cysteine discovered by a phenol thiol probe.

Front Cell Dev Biol

August 2025

Department of Pathogen Biology and Biosecurity, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, China.

Electrophilic compounds from natural products (NPs) and metabolites can covalently modify the cysteines of target proteins to induce biological activities. To facilitate the discovery of novel NPs and metabolites, chemical probes with various thiol groups-mimicking the reactivity of cysteine-have been developed. These probes are designed to react with electrophilic groups of NPs and metabolites in an electrophilic addition mechanism, with the resulting adducts having molecular masses which equal to the sum of the probe and the target compound.

View Article and Find Full Text PDF

Transcription factors (TFs) regulate gene expression by interacting with DNA in a sequence-specific manner. High-throughput in vitro technologies, such as protein-binding microarrays and HT-SELEX (high-throughput systematic evolution of ligands by exponential enrichment), have revealed the DNA-binding specificities of hundreds of TFs. However, they have limited ability to reliably identify lower-affinity DNA binding sites, which are increasingly recognized as important for precise spatiotemporal control of gene expression.

View Article and Find Full Text PDF

Local neighbor Normalization: Reconciling accurate normalization and heterogeneity recovery in large-scale metabolomics.

Anal Chim Acta

October 2025

Department of Electronic Science, National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, 361005, China. Electronic address:

Background: Metabolomics studies often grapple with the dilution effect, where sample concentrations vary due to inconsistent handling or biological diversity, particularly in samples like urine, saliva, or cell extracts. This variation can mask true metabolic differences, complicating data interpretation. Traditional normalization methods, such as Constant Sum Normalization (CSN), Probabilistic Quotient Normalization (PQN), and Maximal Density Fold Change (MDFC), assume that all samples share a certain invariant statistic and overlook data heterogeneity, potentially erasing the dataset's heterogeneity essential for distinguishing biological subgroups.

View Article and Find Full Text PDF