Accurate Prediction of CRISPR/Cas13a Guide Activity Using Feature Selection and Deep Learning.

J Chem Inf Model

Research Center for Analytical Sciences, College of Chemistry, Nankai University, Tianjin 300071, China.

Published: April 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

CRISPR/Cas13a serves as a key tool for nucleic acid tests; therefore, accurate prediction of its activity is essential for creating robust and sensitive diagnosis. In this study, we create a dual-branch neural network model that achieves high prediction accuracy and classification performance across two independent CRISPR/Cas13a data sets, outperforming previously published models relying solely on sequence features. The model integrates direct sequence encoding with descriptive features and yields 99 key descriptive features out of 1553, extracted through statistical analysis, which critically influence guide-target interactions and Cas13a guide activity. By employing Shapley Additive Explanations and Integrated Gradients for feature importance analysis, we show that sequence composition, mismatch type and frequency, and the protospacer flanking site region are primary features. These findings underscore the importance of using descriptive features as complementary inputs to deep learning-based encoding and provide valuable insights into the mechanisms underlying guide-target interaction. All in all, this study not only introduces a reliable and efficient model for Cas13a guide activity prediction but also offers a foundation for future rational design efforts.

Download full-text PDF

Source
http://dx.doi.org/10.1021/acs.jcim.4c02438DOI Listing

Publication Analysis

Top Keywords

guide activity
12
descriptive features
12
accurate prediction
8
cas13a guide
8
features
5
prediction crispr/cas13a
4
crispr/cas13a guide
4
activity
4
activity feature
4
feature selection
4

Similar Publications

Background: Stored-product insects (Sitophilus spp., Plodia interpunctella, Sitotroga cerealella) drive substantial postharvest losses and increasingly resist synthetic fumigants. Valeriana wallichii roots yield volatile oils rich in short-chain acids and sesquiterpenes.

View Article and Find Full Text PDF

Seamless integration of active devices into photonic integrated circuits remains a challenge due to the limited accessibility of the optical field in conventional waveguides, which tightly confine light within their cores. In this study, we propose a two-dimensional (2D) ultrathin waveguide as a photonic platform that enables efficient interaction between guided light and surface-mounted devices by supporting optical modes dominated by evanescent fields. We show that the guided light in a monolayer MoS film propagates over millimeter-scale distances with more than 99.

View Article and Find Full Text PDF

Chlorinated hydrocarbons are widely used as solvents and synthetic intermediates, but their chemical persistence can cause hazardous environmental accumulation. Haloalkane dehalogenase from (DhlA) is a bacterial enzyme that naturally converts toxic chloroalkanes into less harmful alcohols. Using a multiscale approach based on the empirical valence bond method, we investigate the catalytic mechanism of 1,2-dichloroethane dehalogenation within DhlA and its mutants.

View Article and Find Full Text PDF

Intraoperative electrocorticography (ECoG) represents a crucial tool for improving seizure outcomes during epilepsy surgeries by assisting in localization of the epileptogenic zones. There is a shortage of information in the literature regarding single-center experiences and long-term outcomes after ECoG-guided surgeries. Data are particularly scarce from the Eastern Mediterranean Region.

View Article and Find Full Text PDF

Deep learning has rapidly emerged as a promising toolkit for protein optimization, yet its success remains limited, particularly in the realm of activity. Moreover, most algorithms lack rigorous iterative evaluation, a crucial aspect of protein engineering exemplified by classical directed evolution. This study introduces DeepDE, a robust iterative deep learning-guided algorithm leveraging triple mutants as building blocks and a compact library of ∼1,000 mutants for training.

View Article and Find Full Text PDF