CasCollect: targeted assembly of CRISPR-associated operons from high-throughput sequencing data.

NAR Genom Bioinform

Systems Biology, Sandia National Laboratories, Livermore, CA 94550, USA.

Published: September 2020


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

CRISPR arrays and CRISPR-associated (Cas) proteins comprise a widespread adaptive immune system in bacteria and archaea. These systems function as a defense against exogenous parasitic mobile genetic elements that include bacteriophages, plasmids and foreign nucleic acids. With the continuous spread of antibiotic resistance, knowledge of pathogen susceptibility to bacteriophage therapy is becoming more critical. Additionally, gene-editing applications would benefit from the discovery of new genes with favorable properties. While next-generation sequencing has produced staggering quantities of data, transitioning from raw sequencing reads to the identification of CRISPR/Cas systems has remained challenging. This is especially true for metagenomic data, which has the highest potential for identifying novel genes. We report a comprehensive computational pipeline, CasCollect, for the targeted assembly and annotation of genes and CRISPR arrays-even isolated arrays-from raw sequencing reads. Benchmarking our targeted assembly pipeline demonstrates significantly improved timing by almost two orders of magnitude compared with conventional assembly and annotation, while retaining the ability to detect CRISPR arrays and genes. CasCollect is a highly versatile pipeline and can be used for targeted assembly of any specialty gene set, reconfigurable for user provided Hidden Markov Models and/or reference nucleotide sequences.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7671303PMC
http://dx.doi.org/10.1093/nargab/lqaa063DOI Listing

Publication Analysis

Top Keywords

targeted assembly
16
cascollect targeted
8
crispr arrays
8
raw sequencing
8
sequencing reads
8
assembly annotation
8
assembly
5
assembly crispr-associated
4
crispr-associated operons
4
operons high-throughput
4

Similar Publications

EASY-edit: a toolbox for high-throughput single-step custom genetic editing in bacteria.

Nucleic Acids Res

September 2025

Expression génétique microbienne, UMR8261 CNRS, Université Paris Cité, Institut de Biologie Physico-Chimique, Paris 75005, France.

Targeted gene editing can be achieved using CRISPR-Cas9-assisted recombineering. However, high-efficiency editing requires careful optimization for each locus to be modified, which can be tedious and time-consuming. In this work, we developed a simple, fast and cheap method: Engineered Assembly of SYnthetic operons for targeted editing (EASY-edit) in Escherichia coli.

View Article and Find Full Text PDF

Crystal structures of distinct parallel and antiparallel DNA G-quadruplexes reveal structural polymorphism in C9orf72 G4C2 repeats.

Nucleic Acids Res

September 2025

State Key Laboratory of Vaccines for Infectious Diseases, School of Public Health, Xiamen University, Xiamen 361102, Fujian, China.

The abnormal expansion of GGGGCC (G4C2) repeats in the noncoding region of the C9orf72 gene is a major genetic cause of two devastating neurodegenerative disorders, amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD). These G4C2 repeats are known to form G-quadruplex (G4) structures, which are hypothesized to contribute to disease pathogenesis. Here, we demonstrated that four DNA G4C2 repeats can fold into two structurally distinct G4 conformations: a parallel and an antiparallel topology.

View Article and Find Full Text PDF

Targeting the IRS1 macromolecular signaling node by Trienomycin a triggers cytoprotective autophagy in pancreatic adenocarcinoma.

Int J Biol Macromol

September 2025

Shaanxi Key Laboratory of Natural Products and Chemical Biology, College of Chemistry and Pharmacy, Northwest A&F University, Xianyang, China. Electronic address:

Pancreatic adenocarcinoma (PAAD) lacks effective therapies due to complex macromolecular signaling networks. Here, we identified the natural compound Trienomycin A (TA) as a potent binder and degrader of the key signaling adaptor protein Insulin Receptor Substrate 1 (IRS1), disrupting its macromolecular assembly in insulin-like growth pathways. Through integrated biochemical, cellular, and in vivo analyses, we demonstrated that TA directly bound the phosphotyrosine-binding (PTB) domain of IRS1, inducing proteasomal degradation of this critical macromolecular hub mediated by the E3 ubiquitin ligase FBXW8.

View Article and Find Full Text PDF

Oral nanoformulation of a host-directed antiviral niclosamide effectively treats severe fever with thrombocytopenia syndrome.

Biomed Pharmacother

September 2025

Department of Biomedical Sciences and Institute for Medical Science, Jeonbuk National University Medical School, Jeonju, Jeonbuk 54907, South Korea. Electronic address:

Severe fever with thrombocytopenia syndrome (SFTS), caused by the tick-borne Dabie bandavirus (DBV), is a serious public health concern due to its high morbidity and mortality rates. However, no antiviral treatment has been developed for SFTS. Through target-focused screening, we identified five anti-SFTS candidates: niclosamide (NIC), cepharanthine, nifedipine, zanamivir, and ivacaftor.

View Article and Find Full Text PDF

Targeting NLRP3 inflammasome with curcumin: mechanisms and therapeutic promise in chronic inflammation.

Inflammopharmacology

September 2025

Centre for Research Impact & Outcome, Chitkara College of Pharmacy, Chitkara University, Rajpura, Punjab, 140401, India.

The NOD‑like receptor family pyrin domain containing 3 (NLRP3) inflammasome is a key molecular complex that amplifies inflammatory cascades by maturing interleukin‑1 beta (IL-1β) and interleukin‑18 (IL-18) and inducing pyroptosis. It serves as a major driver and co-driver of numerous diseases associated with chronic inflammation. Dysregulated NLRP3 activation contributes to the progression of disorders such as rheumatoid arthritis, inflammatory bowel disease, neurodegenerative diseases and atherosclerosis.

View Article and Find Full Text PDF