Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Characterizing genotype-phenotype relationships of biomolecules (e.g. ribozymes) requires accurate ways to measure activity for a large set of molecules. Kinetic measurement using high-throughput sequencing (e.g. k-Seq) is an emerging assay applicable in various domains that potentially scales up measurement throughput to over 106 unique nucleic acid sequences. However, maximizing the return of such assays requires understanding the technical challenges introduced by sequence heterogeneity and DNA sequencing. We characterized the k-Seq method in terms of model identifiability, effects of sequencing error, accuracy and precision using simulated datasets and experimental data from a variant pool constructed from previously identified ribozymes. Relative abundance, kinetic coefficients, and measurement noise were found to affect the measurement of each sequence. We introduced bootstrapping to robustly quantify the uncertainty in estimating model parameters and proposed interpretable metrics to quantify model identifiability. These efforts enabled the rigorous reporting of data quality for individual sequences in k-Seq experiments. Here we present detailed protocols, define critical experimental factors, and identify general guidelines to maximize the number of sequences and their measurement accuracy from k-Seq data. Analogous practices could be applied to improve the rigor of other sequencing-based assays.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8559535PMC
http://dx.doi.org/10.1093/nar/gkab199DOI Listing

Publication Analysis

Top Keywords

model identifiability
8
measurement
5
kinetic sequencing
4
k-seq
4
sequencing k-seq
4
k-seq massively
4
massively parallel
4
parallel assay
4
assay ribozyme
4
ribozyme kinetics
4

Similar Publications

Summary: Dynamic models represent a powerful tool for studying complex biological processes, ranging from cell signalling to cell differentiation. Building such models often requires computationally demanding modelling workflows, such as model exploration and parameter estimation. We developed two Julia-based tools: SBMLImporter.

View Article and Find Full Text PDF

Globally rising cases of malaria have prompted concentrated efforts to control malaria transmission, utilising various mathematical models to support the Roll Back Malaria agenda. Many existing models with their specific modifications exhibit rigidity, limiting their application to inform malaria control interventions. This study addresses this limitation by employing a reduction technique on a comprehensive malaria control model to derive a simplified system that preserves the essential dynamics of the original system.

View Article and Find Full Text PDF

Factor analysis provides a canonical framework for imposing lower-dimensional structure such as sparse covariance in high-dimensional data. High-dimensional data on the same set of variables are often collected under different conditions, for instance in reproducing studies across research groups. In such cases, it is natural to seek to learn the shared versus condition-specific structure.

View Article and Find Full Text PDF

Understanding stroke mortality trends in Spain: A comprehensive age-period-cohort and joinpoint analysis.

Neurologia (Engl Ed)

September 2025

Unit of Public Health, Prevention and Health Promotion, South Seville Health Management Area, Seville, Spain. Electronic address:

Background: This study aims to update and evaluate the age-period-cohort (A-P-C) effects on stroke mortality in Spain over the period 1982-2021.

Methods: Data on stroke mortality and population by age and sex were obtained from the database of the National Institute of Statistics for the years 1982-2021. Joinpoint trend analysis software from the US National Cancer Institute was used to estimate the rates and their time trends.

View Article and Find Full Text PDF

Digital twins - personalized, data-driven computational models - are emerging as a powerful paradigm for representing and predicting disease trajectories at the individual level. These models have the potential to support diagnosis, monitor disease evolution, and evaluate therapeutic interventions in virtual settings in the context of clinical trials and patient care. Rigorous model assessment is thus critical for its implementation, but medical data are often sparse, noisy, and vary significantly across individuals, making it challenging to determine whether a digital twin optimized on such data is valid.

View Article and Find Full Text PDF