98%
921
2 minutes
20
Despite the availability of various sequence analysis models, comparative genomic analysis remains a challenge in genomics, genetics, and phylogenetics. Commutative algebra, a fundamental tool in algebraic geometry and number theory, has rarely been used in data and biological sciences. In this study, we introduce commutative algebra k-mer learning (CAKL) as the first-ever nonlinear algebraic framework for analyzing genomic sequences. CAKL bridges between commutative algebra, algebraic topology, combinatorics, and machine learning to establish a new mathematical paradigm for comparative genomic analysis. We evaluate its effectiveness on three tasks-genetic variant identification, phylogenetic tree analysis, and viral genome classification-typically requiring alignment-based, alignment-free, and machine-learning approaches, respectively. Across eleven datasets, CAKL outperforms five state-of-the-art sequence analysis methods, particularly in viral classification, and maintains stable predictive accuracy as dataset size increases, underscoring its scalability and robustness. This work ushers in a new era in commutative algebraic data analysis and learning.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12364056 | PMC |
ArXiv
August 2025
Department of Mathematics, Michigan State University, MI 48824, USA.
Despite the availability of various sequence analysis models, comparative genomic analysis remains a challenge in genomics, genetics, and phylogenetics. Commutative algebra, a fundamental tool in algebraic geometry and number theory, has rarely been used in data and biological sciences. In this study, we introduce commutative algebra k-mer learning (CAKL) as the first-ever nonlinear algebraic framework for analyzing genomic sequences.
View Article and Find Full Text PDFMath Med Biol
August 2025
Bioinformatic Department, Insilico Medicine Hong Kong Ltd., Unit 310, 3/F, Building 8W, Phase 2, Science Park, Pak Shek Kok, New Territories, Hong-Kong, Hong-Kong.
We consider a reaction network of the Wnt pathway endowed with mass-action kinetics. Using concepts in the theory of robustness and stability within Chemical Reaction Network Theory and advances in decomposing reaction networks, we perform a systematic analysis of the structural, structo-kinetic and kinetic properties of this pathway. We show that the network can be systematically decomposed into a set of subnetworks and we use elements matrix theory to study their stability properties.
View Article and Find Full Text PDFJ Phys Chem A
July 2025
School of Chemistry and Biological Engineering, University of Science and Technology Beijing, Beijing 100083, China.
The unitary coupled-cluster (UCC)-based polarization propagator theory (PPT) is a novel Hermitian quantum chemical method for calculating excited states. This study benchmarks vertical excitation energies (VEEs) for medium-sized molecules using two practical schemes for UCC-PPT: UCC3 and quadratic UCCSD (qUCCSD). Their performance is evaluated and compared with the equation-of-motion coupled-cluster singles and doubles (EOM-CCSD) method and the algebraic construction (ADC) family of methods (ADC(2) and ADC(3)).
View Article and Find Full Text PDFPLoS One
June 2025
College of Mathematics and Statistics, Kashi University, Kashi, Xinjiang, China.
This paper explores the introduction of group structures within type theory, drawing from the algebraic theory proposed by Roy L. Crole. We define types with group structures and demonstrate that models of these types in categories with finite products can be interpreted as group objects.
View Article and Find Full Text PDFJ Chem Inf Model
July 2025
Department of Mathematics, Michigan State University, East Lansing, Michigan 48824, United States.
Recently, Suwayyid and Wei introduced commutative algebra as an emerging paradigm for machine learning and data science. In this work, we propose commutative algebra machine learning (CAML) for the prediction of protein-ligand binding affinities. Specifically, we apply persistent Stanley-Reisner theory, a key concept in combinatorial commutative algebra, to the affinity predictions of protein-ligand binding and metalloprotein-ligand binding.
View Article and Find Full Text PDF