98%
921
2 minutes
20
Background: We developed a novel approach to minimize batch effects when assigning samples to batches. Our algorithm selects a batch allocation, among all possible ways of assigning samples to batches, that minimizes differences in average propensity score between batches. This strategy was compared to randomization and stratified randomization in a case-control study (30 per group) with a covariate (case vs control, represented as β1, set to be null) and two biologically relevant confounding variables (age, represented as β2, and hemoglobin A1c (HbA1c), represented as β3). Gene expression values were obtained from a publicly available dataset of expression data obtained from pancreas islet cells. Batch effects were simulated as twice the median biological variation across the gene expression dataset and were added to the publicly available dataset to simulate a batch effect condition. Bias was calculated as the absolute difference between observed betas under the batch allocation strategies and the true beta (no batch effects). Bias was also evaluated after adjustment for batch effects using ComBat as well as a linear regression model. In order to understand performance of our optimal allocation strategy under the alternative hypothesis, we also evaluated bias at a single gene associated with both age and HbA1c levels in the 'true' dataset (CAPN13 gene).
Results: Pre-batch correction, under the null hypothesis (β1), maximum absolute bias and root mean square (RMS) of maximum absolute bias, were minimized using the optimal allocation strategy. Under the alternative hypothesis (β2 and β3 for the CAPN13 gene), maximum absolute bias and RMS of maximum absolute bias were also consistently lower using the optimal allocation strategy. ComBat and the regression batch adjustment methods performed well as the bias estimates moved towards the true values in all conditions under both the null and alternative hypotheses. Although the differences between methods were less pronounced following batch correction, estimates of bias (average and RMS) were consistently lower using the optimal allocation strategy under both the null and alternative hypotheses.
Conclusions: Our algorithm provides an extremely flexible and effective method for assigning samples to batches by exploiting knowledge of covariates prior to sample allocation.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9990331 | PMC |
http://dx.doi.org/10.1186/s12859-023-05202-6 | DOI Listing |
IEEE Internet Things J
August 2025
Geometric Media Lab, School of Arts, Media and Engineering and School of Electrical, Computer and Energy Engineering, Arizona State University, Tempe, AZ 85281 USA.
Human gait analysis with wearable sensors has been widely used in various applications, such as daily life healthcare, rehabilitation, physical therapy, and clinical diagnostics and monitoring. In particular, ground reaction force (GRF) provides critical information about how the body interacts with the ground during locomotion. Although instrumented treadmills have been widely used as the gold standard for measuring GRF during walking, their lack of portability and high cost make them impractical for many applications.
View Article and Find Full Text PDFMediators Inflamm
September 2025
Shengli Clinical Medical College of Fujian Medical University, Fuzhou 350001, Fujian, China.
Osteoporosis is a prevalent metabolic bone disorder with complex molecular underpinnings. Emerging evidence implicates endoplasmic reticulum stress (ERS) in its pathogenesis; however, systematic exploration of ERS-related genes (ERSRGs) remains limited. This study aimed to identify ERS-related differentially expressed genes (ERSRDEGs) in osteoporosis, construct a diagnostic model, and elucidate associated molecular mechanisms.
View Article and Find Full Text PDFACS Omega
September 2025
Department of Chemistry, Faculty of Science, Dokuz Eylul University, Izmir 35160, Turkey.
A novel silica-based sorbent, silica-carbazole-formazan (Si-Carb-Formazan), was synthesized through in situ functionalization with a newly prepared carbazole formazan derivative to remove Cu-(II) ions from aqueous solutions efficiently. The sorbent was characterized using techniques such as FTIR, SEM, TGA, and XPS, which revealed a porous structure with a high surface area and excellent thermal stability. Batch adsorption experiments analyzed the influence of various factors on the sorbent's performance, demonstrating its high efficiency.
View Article and Find Full Text PDFiScience
September 2025
Guangdong Provincial Key Laboratory of Mathematical and Neural Dynamical Systems, School of Computing and Information Technology, Great Bay University, Dongguan, China.
Distinguishing similar cancer subtypes and predicting responses to immune checkpoint blockade (ICB) are critical for improving clinical outcomes. However, existing gene expression signatures often suffer from batch effects and poor generalizability across cohorts. To address these limitations, we propose adaptive individualized gene pair signatures (AIGPS), a robust method that adaptively quantifies gene pair reversals and selects informative features using machine learning.
View Article and Find Full Text PDFAdv Sci (Weinh)
September 2025
School of Artificial Intelligence, Jilin University, Changchun, 130012, China.
Single-cell multi-omics technologies are pivotal for deciphering the complexities of biological systems, with Cellular Indexing of Transcriptomes and Epitopes by Sequencing (CITE-seq) emerging as a particularly valuable approach. The dual-modality capability makes CITE-seq particularly advantageous for dissecting cellular heterogeneity and understanding the dynamic interplay between transcriptomic and proteomic landscapes. However, existing computational models for integrating these two modalities often struggle to capture the complex, non-linear interactions between RNA and antibody-derived tags (ADTs), and are computationally intensive.
View Article and Find Full Text PDF