Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

The CRISPR/Cas9 nuclease from Streptococcus pyogenes (SpCas9) can be used with single guide RNAs (sgRNAs) as a sequence-specific antimicrobial agent and as a genome-engineering tool. However, current bacterial sgRNA activity models struggle with accurate predictions and do not generalize well, possibly because the underlying datasets used to train the models do not accurately measure SpCas9/sgRNA activity and cannot distinguish on-target cleavage from toxicity. Here, we solve this problem by using a two-plasmid positive selection system to generate high-quality data that more accurately reports on SpCas9/sgRNA cleavage and that separates activity from toxicity. We develop a machine learning architecture (crisprHAL) that can be trained on existing datasets, that shows marked improvements in sgRNA activity prediction accuracy when transfer learning is used with small amounts of high-quality data, and that can generalize predictions to different bacteria. The crisprHAL model recapitulates known SpCas9/sgRNA-target DNA interactions and provides a pathway to a generalizable sgRNA bacterial activity prediction tool that will enable accurate antimicrobial and genome engineering applications.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10485023PMC
http://dx.doi.org/10.1038/s41467-023-41143-7DOI Listing

Publication Analysis

Top Keywords

transfer learning
8
learning small
8
sgrna activity
8
high-quality data
8
activity prediction
8
activity
5
generalizable cas9/sgrna
4
cas9/sgrna prediction
4
prediction model
4
model machine
4

Similar Publications

Aim: The purpose of this study was to assess the accuracy of a customized deep learning model based on CNN and U-Net for detecting and segmenting the second mesiobuccal canal (MB2) of maxillary first molar teeth on cone beam computed tomography (CBCT) scans.

Methodology: CBCT scans of 37 patients were imported into 3D slicer software to crop and segment the canals of the mesiobuccal (MB) root of the maxillary first molar. The annotated data were divided into two groups: 80% for training and validation and 20% for testing.

View Article and Find Full Text PDF

Obsessive-compulsive disorder (OCD) is a chronic and disabling condition affecting approximately 3.5% of the global population, with diagnosis on average delayed by 7.1 years or often confounded with other psychiatric disorders.

View Article and Find Full Text PDF

Early prediction of orthodontic gingival enlargement using S100A4: a biomarker-based risk stratification model.

Odontology

September 2025

Department of Periodontics, Saveetha Dental College and Hospital, Saveetha Institute of Medical and Technical Sciences, Saveetha University, Chennai, Tamil Nadu, India.

Orthodontic-induced gingival enlargement (OIGE) affects approximately 15-30% of patients undergoing orthodontic treatment and remains largely unpredictable, often relying on subjective clinical assessments made after irreversible tissue changes have occurred. S100A4 is a well-characterized marker of activated fibroblasts involved in pathological tissue remodeling. This was a cross-sectional precision biomarker study that analyzed gingival tissue samples from three groups: healthy controls (n = 60), orthodontic patients without gingival enlargement (n = 31), and patients with clinically diagnosed OIGE (n = 61).

View Article and Find Full Text PDF

Purpose: The study aims to compare the treatment recommendations generated by four leading large language models (LLMs) with those from 21 sarcoma centers' multidisciplinary tumor boards (MTBs) of the sarcoma ring trial in managing complex soft tissue sarcoma (STS) cases.

Methods: We simulated STS-MTBs using four LLMs-Llama 3.2-vison: 90b, Claude 3.

View Article and Find Full Text PDF