Statistical modeling of STR capillary electrophoresis signal.

BMC Bioinformatics

Center for Computational and Integrative Biology, Rutgers University, Camden, 08102, NJ, USA.

Published: December 2019


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Background: In order to isolate an individual's genotype from a sample of biological material, most laboratories use PCR and Capillary Electrophoresis (CE) to construct a genetic profile based on polymorphic loci known as Short Tandem Repeats (STRs). The resulting profile consists of CE signal which contains information about the length and number of STR units amplified. For samples collected from the environment, interpretation of the signal can be challenging given that information regarding the quality and quantity of the DNA is often limited. The signal can be further compounded by the presence of noise and PCR artifacts such as stutter which can mask or mimic biological alleles. Because manual interpretation methods cannot comprehensively account for such nuances, it would be valuable to develop a signal model that can effectively characterize the various components of STR signal independent of a priori knowledge of the quantity or quality of DNA.

Results: First, we seek to mathematically characterize the quality of the profile by measuring changes in the signal with respect to amplicon size. Next, we examine the noise, allele, and stutter components of the signal and develop distinct models for each. Using cross-validation and model selection, we identify a model that can be effectively utilized for downstream interpretation. Finally, we show an implementation of the model in NOCIt, a software system that calculates the a posteriori probability distribution on the number of contributors.

Conclusion: The model was selected using a large, diverse set of DNA samples obtained from 144 different laboratory conditions; with DNA amounts ranging from a single copy of DNA to hundreds of copies, and the quality of the profiles ranging from pristine to highly degraded. Implemented in NOCIt, the model enables a probabilisitc approach to estimating the number of contributors to complex, environmental samples.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6886162PMC
http://dx.doi.org/10.1186/s12859-019-3074-0DOI Listing

Publication Analysis

Top Keywords

capillary electrophoresis
8
signal
8
model effectively
8
model
6
statistical modeling
4
modeling str
4
str capillary
4
electrophoresis signal
4
signal background
4
background order
4

Similar Publications

Background: Light chain multiple myeloma (LCMM) is a malignant hematological disease characterized by bone marrow infiltration by tumor plasma cells and the secretion of monoclonal free light chains (κ or λ). It is often di-agnosed through hypogammaglobulinemia detected by serum protein electrophoresis, followed by immunotyping showing a monoclonal band in free light chains. However, the structure of monoclonal light chains can sometimes complicate laboratory findings.

View Article and Find Full Text PDF

The objective of this study was to evaluate the concentration and integrity index of circulating cell-free DNA (ccf-DNA) as biomarkers for the detection and monitoring of minimal residual disease (MRD) in pediatric patients with B-cell acute lymphoblastic leukemia (B-ALL). Comparison with a validated methodology for the quantification of monoclonal rearrangements of the IGH gene was made. Peripheral blood and bone marrow samples were collected from 10 pediatric patients with B-ALL at diagnosis, remission, and maintenance phases.

View Article and Find Full Text PDF

Massively parallel sequencing (MPS) has caused a paradigm shift in forensic DNA analysis by enabling simultaneous examination of multiple genetic markers with higher resolution. Despite its growing importance, adoption in the 11 Southeast Asian countries remains limited. This paper reviews MPS implementation in forensic DNA laboratories across the region and discusses key adoption challenges.

View Article and Find Full Text PDF

Study of Bacteriostasis of Kaempferide on Foodborne Pathogenic Bacteria by Indirect Determination of Capillary Electrophoresis.

Electrophoresis

September 2025

Ministry of Education Key Laboratory for Analytical Science of Food Safety and Biology, and Fujian Provincial Key Laboratory of Analysis and Detection Technology for Food Safety, College of Chemistry, Fuzhou University, Fuzhou, People's Republic of China.

Foodborne pathogenic bacteria always threaten human health. Flavonoids are commonly used in antibacterial applications. Studying the antibacterial effect of flavonoids on bacteria is significant.

View Article and Find Full Text PDF

The Hidden Influence: Impacts of Residual Dimethylformamide in NDSB-211 on icIEF Separation for Monoclonal Antibodies.

Electrophoresis

September 2025

Therapeutics Development and Supply-Analytical Development, Janssen Research & Development, LLC, Malvern, Pennsylvania, USA.

Monoclonal antibodies (mAbs) present analytical challenges due to their inherent heterogeneity and susceptibility to post-translational modifications (PTMs) during production and storage. Monitoring of charge heterogeneity profiles by imaged capillary isoelectric focusing (icIEF) has been aided by the use of non-detergent sulfobetaines (NDSBs), particularly NDSB-211, to enhance protein solubility and stability. When used in a quality control laboratory setting, NDSB-211 has shown performance variability over time due to residual manufacturing impurities that impact the capillary isoelectric focusing separation.

View Article and Find Full Text PDF