98%
921
2 minutes
20
Background: Simple sequence repeats (SSRs) are widely used as molecular markers; however, traditional development of SSR molecular markers heavily relies on experimental methods. The advancement of modern sequencing technology has provided the possibility of directly extracting SSR characteristics from sequencing data and using them for variety identification.
Results: We have developed a computational framework for variety identification, treating the presence or absence of each SSR in sequencing data as a numerical characteristic while ignoring specific loci, flanking sequences, and occurrence counts. Therefore, subsequent variety identification does not rely on experimental validation but is directly performed based on the numerical characteristic matrix. Using a formula, we measure the variance of these numerical characteristics both within and among varieties, and select SSRs that exhibit intra-variety specificity and inter-variety polymorphism, forming a 0,1 matrix. We use t-SNE (t-distributed Stochastic Neighbor Embedding) to project the matrix onto a two-dimensional plane, followed by K-means clustering of the individuals. The classification performance of the matrix is preliminarily assessed by comparing the cluster labels with the true labels, providing an initial evaluation of its effectiveness in variety detection. Ultimately, we construct a recognition model based on the SSRs matrix and apply it for variety identification. The process has been encapsulated into the package SSR_VibraProfiler, which can serve as a tool for constructing an SSR variety DNA fingerprint database. We tested this package on a Rhododendron dataset that included 40 individuals from 8 varieties. The accuracy achieved through t-SNE dimensionality reduction and K-means clustering was 100%. Furthermore, we used the leave-one-out method to validate the accuracy of our method in predicting variety, and confirmed the reliability of our method in detecting varieties. The package is freely available at https://github.com/Olcat35412/SSR_VibraProfiler .
Conclusion: We introduced SSR_VibraProfiler, a Python package for distinguishing and predicting individual varieties without a reference genome by extracting SSR numerical characteristics from next-generation sequencing data. This tool will contribute to the development, identification, and protection of new varieties.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12082954 | PMC |
http://dx.doi.org/10.1186/s13007-025-01380-x | DOI Listing |
Front Immunol
September 2025
Department of Medicine, Division of Hematology, Bioclinicum and Center for Molecular Medicine, Karolinska Institute and Karolinska University Hospital Solna, Stockholm, Sweden.
Background: Metabolic reprogramming is an important hallmark of cervical cancer (CC), and extensive studies have provided important information for translational and clinical oncology. Here we sought to determine metabolic association with molecular aberrations, telomere maintenance and outcomes in CC.
Methods: RNA sequencing data from TCGA cohort of CC was analyzed for their metabolic gene expression profile and consensus clustering was then performed to classify tumors into different groups/subtypes.
iScience
September 2025
School of Biology and Biological Engineering, South China University of Technology, Guangzhou, Guangdong 510006, China.
Deep learning has rapidly emerged as a promising toolkit for protein optimization, yet its success remains limited, particularly in the realm of activity. Moreover, most algorithms lack rigorous iterative evaluation, a crucial aspect of protein engineering exemplified by classical directed evolution. This study introduces DeepDE, a robust iterative deep learning-guided algorithm leveraging triple mutants as building blocks and a compact library of ∼1,000 mutants for training.
View Article and Find Full Text PDFRSC Med Chem
August 2025
School of Cellular and Molecular Medicine, University of Bristol Bristol BS8 1TD UK
Carbapenemases, β-lactamases hydrolysing carbapenem antibiotics, challenge the treatment of multi-drug resistant bacteria. The OXA-48 carbapenemase is widely disseminated in , necessitating new treatments for producer strains. Diazabicyclooctane (DBO) inhibitors, including avibactam and nacubactam, act on a wide range of enzymes to overcome β-lactamase-mediated resistance.
View Article and Find Full Text PDFJ Healthc Sci Humanit
January 2024
Assistant Professor & Clinical Coordinator, Health Informatics Program, School of Health Professions, State University of New York Downstate Health Sciences University, 450 Clarkson Avenue, MSC 94, Brooklyn, NY 11203, (718) 270-7738, Fax: (718) 270-7739 Email:
COVID-19 variants continue to infect thousands of people even though the end of the pandemic was announced on May 11, 2023. Nextstrain CoVariants (CoVariants) genomic databases provide detailed information about more than 31 variants of COVID-19 viruses that have been identified through genomic sequencing, showing the mutations they carry. Mutated viruses may yield a negative result for a gene target using a PCR test that has a positive COVID-19 test result.
View Article and Find Full Text PDFFront Neurol
August 2025
Otolaryngology-Head and Neck Surgery, The Ohio State University Wexner Medical Center, Columbus, OH, United States.
Introduction: External continuous perturbations using a motion platform have been developed by employing either sum-of-sines (SoS) or a pseudorandom ternary sequence (PRTS) of numbers to quantify body sway evoked in the medial-lateral (ML) or anterior-posterior (AP) directions, which ultimately helps understand the human postural control system. These stimuli have been provided via pitch tilts of the motion platform for evaluations of AP balance responses or roll tilts for ML balance responses. However, little is known about whether a healthy postural control system responds to 2-dimensional (2D) perturbations similarly when the perturbation stimuli are provided in semicircular canal coordinates (i.
View Article and Find Full Text PDF