Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Human microbiome contains various microbial macromolecules with important biological functions. The Hidden Markov Models (HMMs) can overcome the problem of low similarity sequences with distant relationships and are widely implemented within various sequence alignment softwares. However, the HMM-based sequence alignments can generate a large number of results, how to quickly screen and batch extract target homologs from microbiomes is the major sticking points. It is necessary to develop an integrated gene filter and extraction pipeline to quickly and accurately screen homologs. Here, we introduced the HMMER-Extractor for amino acids or nucleotide sequences extraction, which was a supporting toolkit through provided filtering scores and an iterative keyword matching (IKM) logic. To make it more user-friendly and accessible, we further presented a visualized web server platform. An interactive HTML output provided a user-friendly way to browse homologous annotations and sequence extraction. The web server provided the community with a streamlined and user-friendly interface to analyze microbiomes. Through the HMMER-Extractor, we constructed a cardiovascular disease related gene dataset of the macromolecular metabolite trimethylamine (TMA) and lipopolysaccharide (LPS) based on 46,699 bacterial genomes from human gut. Approximately 21,014 and 1961 bacterial strains were identified to contain the cnt or cut operon of TMA, and the waa gene cluster of LPS, respectively. The Escherichia coli occupied the largest proportion among all the bacterial species, which belonged to the phyla Firmicutes. The HMMER-Extractor toolkit is an integrated pipeline and has been proven to be accurate and fast in extracting target macromolecular encoding genes from microbial genomes.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.ijbiomac.2024.137666DOI Listing

Publication Analysis

Top Keywords

hidden markov
8
markov models
8
web server
8
hmmer-extractor
4
hmmer-extractor auxiliary
4
auxiliary toolkit
4
toolkit identifying
4
identifying genomic
4
genomic macromolecular
4
macromolecular metabolites
4

Similar Publications

In this paper, we study the impact of momentum, volume and investor sentiment on U.S. tech sector stock returns using Principal Component Analysis-Hidden Markov Model (PCA-HMM) methodology.

View Article and Find Full Text PDF

Investigation of the fundamental microscopic processes occurring in organic reactions is essential for optimising both organocatalysts and synthetic strategies. In this study, single-molecule fluorescence microscopy was employed to study the Diels-Alder reaction catalysed by a first-generation MacMillan catalyst, providing direct insights into its kinetic dynamics. This reaction proceeds via a series of reversible processes under equilibrium conditions (S ⇄ IM ⇄ IM → P, IM and IM: N,O-acetal and iminium ion intermediates, respectively).

View Article and Find Full Text PDF

Analyzing the spontaneous activity of the human brain using dynamic approaches can reveal functional organizations. The co-activation pattern (CAP) analysis of signals from different brain regions is used to characterize brain neural networks that may serve specialized functions. However, CAP is based on spatial information but ignores temporal reproducible transition patterns, and lacks robustness to low signal-to-noise rate (SNR) data.

View Article and Find Full Text PDF

: an R package to infer gene transcription rates with a novel least sum of squares method.

NAR Genom Bioinform

September 2025

Department of Internal Medicine, Nephrology Division, University of Michigan, Ann Arbor 48109 MI, United States.

The dynamics of transcriptional elongation influence many biological activities, such as RNA splicing, polyadenylation, and nuclear export. To quantify the elongation rate, a typical method is to treat cells with drugs that inhibit RNA polymerase II (Pol II) from entering the gene body and then track Pol II using Pro-seq or Gro-seq. However, the downstream data analysis is challenged by the problem of identifying the transition point between the gene regions inhibited by the drug and not, which is necessary to calculate the transcription rate.

View Article and Find Full Text PDF

The sleep-wake cycle plays an important and far-reaching role in health. By utilizing personal physical activity monitors (PAMs), inferences about the sleep-wake cycle can be made. Hidden Markov models (HMMs) have been applied in this area as an accurate unsupervised approach.

View Article and Find Full Text PDF