Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Background: Due to the increasing availability of high-quality genome sequences, pan-genomes are gradually replacing single consensus reference genomes in many bioinformatics pipelines to better capture genetic diversity. Traditional bioinformatics tools using the FM-index face memory limitations with such large genome collections. Recent advancements in run-length compressed indices like Gagie et al.'s r-index and Nishimoto and Tabei's move structure, alleviate memory constraints but focus primarily on backward search for MEM-finding. Arakawa et al.'s br-index initiates complete approximate pattern matching using bidirectional search in run-length compressed space, but with significant computational overhead due to complex memory access patterns.

Results: We introduce b-move, a novel bidirectional extension of the move structure, enabling fast, cache-efficient, lossless approximate pattern matching in run-length compressed space. It achieves bidirectional character extensions up to 7 times faster than the br-index, closing the performance gap with FM-index-based alternatives. For locating occurrences, b-move performs and operations up to 7 times faster than the br-index. At the same time, it maintains the favorable memory characteristics of the br-index, for example, all available complete E. coli genomes on NCBI's RefSeq collection can be compiled into a b-move index that fits into the RAM of a typical laptop.

Conclusions: b-move proves practical and scalable for pan-genome indexing and querying. We provide a C++ implementation of b-move, supporting efficient lossless approximate pattern matching including locate functionality, available at https://github.com/biointec/b-move under the AGPL-3.0 license.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12345024PMC
http://dx.doi.org/10.1186/s13015-025-00281-xDOI Listing

Publication Analysis

Top Keywords

approximate pattern
16
pattern matching
16
run-length compressed
16
lossless approximate
12
matching run-length
8
move structure
8
compressed space
8
times faster
8
faster br-index
8
b-move
6

Similar Publications

Background And Objectives: Pollen-food allergy syndrome (PFAS) is a frequent comorbidity in individuals with hay fever. Identifying risk factors and allergen clusters can aid targeted interventions and management strategies. Objective: This study characterizes PFAS in patients with hay fever and identifies associated risk factors using the mobile health platform, AllerSearch.

View Article and Find Full Text PDF

Toward universal immunofluorescence normalization for multiplex tissue imaging with UniFORM.

Cell Rep Methods

August 2025

Department of Biomedical Engineering and Computational Biology Program, OHSU, Portland, OR, USA; Knight Cancer Institute, OHSU, Portland, OR, USA. Electronic address:

We present UniFORM, a non-parametric, Python-based pipeline for normalizing multiplex tissue imaging (MTI) data at both the feature and pixel levels. UniFORM employs an automated rigid landmark registration method tailored to the distributional characteristics of MTI, with UniFORM operating without prior distributional assumptions and handling both unimodal and bimodal patterns. By aligning the biologically invariant negative populations, UniFORM removes technical variation while preserving tissue-specific expression patterns in positive populations.

View Article and Find Full Text PDF

Discontinuing reinforcement for an operant behavior sometimes produces a transient increase in responding (i.e., an extinction burst).

View Article and Find Full Text PDF

Measurement appropriateness concerns the question of whether the test or survey scale under consideration can provide a valid measure for a specific individual. An aberrant item response pattern would provide internal counterevidence against using the test/scale for this person, whereas a more typical item response pattern would imply a fit of the measure to the person. Traditional approaches, including the popular Lz person fit statistic, are hampered by their two-stage estimation procedure and the fact that the fit for the person is determined based on the model calibrated on data that include the misfitting persons.

View Article and Find Full Text PDF

Objective: Researchers have differentiated forms (overt, relational) and functions (proactive, reactive) of aggressive behavior; however, the assessment options for measuring these constructs in youth remain limited. This study examined the parent-report Peer Conflict Scale (PCS) for measuring forms and functions of youth aggressive behavior in English and Spanish, including short- and long-form versions.

Method: Participants were caregivers of 653 youths (ages 6-17; 57% male; 48% Hispanic) throughout North America.

View Article and Find Full Text PDF