Serial crystallography with multi-stage merging of thousands of images.

Alexei S Soares , Yusuke Yamada , Jean Jakoncic , Sean McSweeney , Robert M Sweet , John Skinner , James Foadi , Martin R Fuchs , Dieter K Schneider , Wuxian Shi , Babak Andi , Lawrence C Andrews , Herbert J Bernstein

Acta Crystallogr F Struct Biol Commun

Ronin Institute for Independent Scholarship, c/o National Synchrotron Light Source II, Building 745, Brookhaven National Laboratory, Upton, New York, USA.

Published: July 2022

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

KAMO and BLEND provide particularly effective tools to automatically manage the merging of large numbers of data sets from serial crystallography. The requirement for manual intervention in the process can be reduced by extending BLEND to support additional clustering options such as the use of more accurate cell distance metrics and the use of reflection-intensity correlation coefficients to infer `distances' among sets of reflections. This increases the sensitivity to differences in unit-cell parameters and allows clustering to assemble nearly complete data sets on the basis of intensity or amplitude differences. If the data sets are already sufficiently complete to permit it, one applies KAMO once and clusters the data using intensities only. When starting from incomplete data sets, one applies KAMO twice, first using unit-cell parameters. In this step, either the simple cell vector distance of the original BLEND or the more sensitive NCDist is used. This step tends to find clusters of sufficient size such that, when merged, each cluster is sufficiently complete to allow reflection intensities or amplitudes to be compared. One then uses KAMO again using the correlation between reflections with a common hkl to merge clusters in a way that is sensitive to structural differences that may not have perturbed the unit-cell parameters sufficiently to make meaningful clusters. Many groups have developed effective clustering algorithms that use a measurable physical parameter from each diffraction still or wedge to cluster the data into categories which then can be merged, one hopes, to yield the electron density from a single protein form. Since these physical parameters are often largely independent of one another, it should be possible to greatly improve the efficacy of data-clustering software by using a multi-stage partitioning strategy. Here, one possible approach to multi-stage data clustering is demonstrated. The strategy is to use unit-cell clustering until the merged data are sufficiently complete and then to use intensity-based clustering. Using this strategy, it is demonstrated that it is possible to accurately cluster data sets from crystals that have subtle differences.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9254899	PMC
http://dx.doi.org/10.1107/S2053230X22006422	DOI Listing

Publication Analysis

Top Keywords

data sets

unit-cell parameters

data

serial crystallography

applies kamo

cluster data

sets

clustering

crystallography multi-stage

multi-stage merging

Similar Publications

Physicochemical Property Models for Poly- and Perfluorinated Alkyl Substances and Other Chemical Classes.

J Chem Inf Model

September 2025

United States Environmental Protection Agency, Center for Computational Toxicology and Exposure, 109 TW Alexander Dr., Research Triangle Park, North Carolina 27711, United States.

Todd M Martin , Landon R Batts , Nathaniel Charest , Charles N Lowe , Gabriel Sinclair

To assess environmental fate, transport, and exposure for PFAS (per- and polyfluoroalkyl substances), predictive models are needed to fill experimental data gaps for physicochemical properties. In this work, quantitative structure-property relationship (QSPR) models for octanol-water partition coefficient, water solubility, vapor pressure, boiling point, melting point, and Henry's law constant are presented. Over 200,000 experimental property value records were extracted from publicly available data sources.

View Article and Find Full Text PDF

Similar Publications

Spatial variation of infectious virus load in aggregated day 3 post-inoculation respiratory tract tissues from influenza A virus-infected ferrets.

mSphere

September 2025

Influenza Division, Centers for Disease Control and Prevention, Atlanta, Georgia, USA.

Troy J Kieran , Xiangjie Sun , Terrence M Tumpey , Taronna R Maines , Jessica A Belser

The ferret model is widely used to study influenza A viruses (IAVs) isolated from multiple avian and mammalian species, as IAVs typically replicate in the respiratory tract of ferrets without the need for prior host adaptation. During standard IAV risk assessments, tissues are routinely collected from ferrets at a fixed time point post-inoculation to assess the capacity for systemic spread. Here, we describe a data set of virus titers in tissues collected from both respiratory tract and extrapulmonary sites 3 days post-inoculation from over 300 ferrets inoculated with more than 100 unique IAVs (inclusive of H1, H2, H3, H5, H7, and H9 IAV subtypes, both mammalian and zoonotic origin).

View Article and Find Full Text PDF

Similar Publications

A cross-disciplinary hands-on genomics curriculum adaptable for high school to undergraduate education.

J Microbiol Biol Educ

September 2025

University of California Riverside, Riverside, California, USA.

Sophie Zaaijer , Simon C Groen

DNA literacy is becoming increasingly essential for navigating healthcare, understanding pandemics, and engaging with biotechnology-yet genomics education remains limited at the secondary level of education. We present a modular, hands-on curriculum designed for high school and early undergraduate students (ages 14-21) that introduces key genomics concepts through an experiment on fermentation, a process that is key to food preservation and medicine. Students follow a complete scientific process: exploring what DNA is and how microbial succession works, analyzing real DNA sequencing data, and writing a formal scientific report.

View Article and Find Full Text PDF

Similar Publications

Transcriptomic data sets for serovar Typhimurium 14028S that survived ingestion by , hydrogen peroxide treatment, or starvation.

Microbiol Resour Announc

September 2025

Research Department for Limnology, Universität Innsbruck, Mondsee, Austria.

Alexander Balkin , Andrey Plotnikov , Tatiana Konnova , Elena Shagimardanova , Yuri Gogolev

is a pathogenic bacterium that can survive in hostile environments and inside heterotrophic protozoan cells. Here, we present transcriptomic data for grown in a rich medium, cultured under starvation conditions, treated with hydrogen peroxide, and extracted from cells after 8 and 15 h of infection.

View Article and Find Full Text PDF

Similar Publications

The Impact of Mini-Screws and Micro-Implants on Orthodontic Clinical Outcomes: An Umbrella Meta-Analysis.

Clin Exp Dent Res

October 2025

Drug Applied Research Center, Tabriz University of Medical Sciences, Tabriz, Iran.

Abdolreza Jamilian , Helen Jamloo , Kurosh Majidi , Meysam Zarezadeh

Objectives: This umbrella meta-analysis aimed to answer the clinical question: Do mini-screws and micro-implants improve specific orthodontic outcomes such as intermolar width, interpremolar width, suture expansion, molar movement, and skeletal width compared to conventional anchorage methods?

Materials And Methods: A systematic search was performed in PubMed, Scopus, ISI Web of Science, and Google Scholar up to October 2024. Systematic reviews and meta-analyses on mini-screws and micro-implants in orthodontic treatment were included. Methodological quality was assessed using AMSTAR 2, and a random-effects model was used to calculate effect sizes (ESs) and 95% confidence intervals (CIs).

View Article and Find Full Text PDF

Similar Publications