CleanBSequences: an efficient curator of biological sequences in R.

Mol Genet Genomics

Instituto de Investigaciones en Ciencias Agrarias de Rosario (IICAR) (CONICET-UNR), Zavalla, Argentina.

Published: July 2020


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

This work presents a new method and tool to solve a common problem of molecular biologists and geneticists who use molecular markers in their scientific research and developments: curation of sequences. Omic studies conducted by molecular biologists and geneticists usually involve the use of molecular markers. AFLP, cDNA-AFLP, and MSAP are examples of markers that render information at the genomics, transcriptomics, and epigenomics levels, respectively. These three types of molecular markers use adaptors that are the template for PCR amplification. The sequences of the adaptors have to be eliminated for the analysis of the results. Since a large number of sequences are usually obtained in these studies, this clean-up of the data could demand long time and work. To automate this work, an R package, named CleanBSequences, was created that allows the sequences to be curated massively, quickly, without errors and can be used offline. The curating is performed by aligning the forward and/or reverse primers or ends of cloning vectors with the sequences to be removed. After the alignment, new subsequences are generated without biological fragments not desired by the user, i.e., sequences needed by the techniques. In conclusion, the CleanBSequences tool facilitates the work of researchers, reducing time, effort, and working errors. Therefore, the present tool would respond to the problems related to the curation of sequences obtained from the use of some types of molecular markers. In addition to the above, being an open source, CleanBSequences is a flexible tool that has the potential to be used in future improvements to respond to new problems.

Download full-text PDF

Source
http://dx.doi.org/10.1007/s00438-020-01671-zDOI Listing

Publication Analysis

Top Keywords

molecular markers
16
sequences
8
molecular biologists
8
biologists geneticists
8
curation sequences
8
types molecular
8
respond problems
8
molecular
6
markers
5
cleanbsequences
4

Similar Publications

Cognitive impairment and dementia, including Alzheimer's disease (AD), pose a global health crisis, necessitating non-invasive biomarkers for early detection. This review highlights the retina, an accessible extension of the central nervous system (CNS), as a window to cerebral pathology through structural, functional, and molecular alterations. By synthesizing interdisciplinary evidence, we identify retinal biomarkers as promising tools for early diagnosis and risk stratification.

View Article and Find Full Text PDF

Neuroimaging Data Informed Mood and Psychosis Diagnosis Using an Ensemble Deep Multimodal Framework.

Hum Brain Mapp

September 2025

Tri-Institutional Center for Translational Research in Neuroimaging and Data Science (TReNDS), Georgia State University, Georgia Institute of Technology, and Emory University, Atlanta, Georgia, USA.

Investigating neuroimaging data to identify brain-based markers of mental illnesses has gained significant attention. Nevertheless, these endeavors encounter challenges arising from a reliance on symptoms and self-report assessments in making an initial diagnosis. The absence of biological data to delineate nosological categories hinders the provision of additional neurobiological insights into these disorders.

View Article and Find Full Text PDF

The effect of non-functionalized polystyrene nanoparticles (PS-NPs) with diameters of 29, 44, and 72 nm on plasmid DNA integrity and the expression of genes involved in the architecture of chromatin was investigated in human peripheral blood mononuclear cells (PBMCs). The cells were incubated with PS-NPs at concentrations ranging from 0.001 to 100 µg/mL for 24 hours.

View Article and Find Full Text PDF

Background: High-density lipoprotein (HDL) function, rather than its concentration, plays a crucial role in the development of coronary artery disease (CAD). Diminished HDL antioxidant properties, indicated by elevated oxidized HDL (nHDL) and diminished paraoxonase-1 (PON-1) activity, may contribute to vascular dysfunction and inflammation. Data on these associations in CAD patients, including acute coronary syndrome (ACS), remain limited.

View Article and Find Full Text PDF

De novo assembled nuclear, chloroplast and mitochondrial genomes show high intraspecific variation in the tropical rainforest species Symphonia globulifera.

G3 (Bethesda)

September 2025

INRAE, UR629 URFM, Ecologie des Forêts Méditerranéennes, Site Agroparc, Domaine Saint Paul, F-84914 Avignon Cedex 9, France.

Symphonia globulifera (Clusiaceae) has emerged as a model organism in tropical forest ecology and evolution due to its significant ecological role and complex biogeographical history. Originating from Africa, this species has independently colonized Caribbean, Central and South America three times, becoming a key component of tropical ecosystems across these regions. Despite the ecological importance of S.

View Article and Find Full Text PDF