98%
921
2 minutes
20
Generalized linear mixed models (GLMMs) are widely used in research for their ability to model correlated outcomes with non-Gaussian conditional distributions. The proper selection of fixed and random effects is a critical part of the modeling process, where model misspecification may lead to significant bias. However, the joint selection of fixed and random effects has historically been limited to lower dimensional GLMMs, largely due to the use of criterion-based model selection strategies. Here we present the R package glmmPen, one of the first to select fixed and random effects in higher dimension using a penalized GLMM modeling framework. Model parameters are estimated using a Monte Carlo expectation conditional minimization (MCECM) algorithm, which leverages Stan and RcppArmadillo for increased computational efficiency. Our package supports the Binomial, Gaussian, and Poisson families and multiple penalty functions. In this manuscript we discuss the modeling procedure, estimation scheme, and software implementation through application to a pancreatic cancer subtyping study. Simulation results show our method has good performance in selecting both the fixed and random effects in high dimensional GLMMs.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11138212 | PMC |
http://dx.doi.org/10.32614/rj-2023-086 | DOI Listing |
J Appl Stat
February 2025
Department of Mathematics and Statistics, Wright State University, Dayton, Ohio, USA.
Overdispersion is a common phenomenon in genetic data, such as gene expression count data. In genetic association studies, it is important to investigate the association between a gene expression and a set of genetic variants from a pathway. However, existing approaches for pathway analysis are primarily designed for continuous and binary outcomes and are not applicable to overdispersed count data.
View Article and Find Full Text PDFProc Mach Learn Res
November 2024
Pretraining plays a pivotal role in acquiring generalized knowledge from large-scale data, achieving remarkable successes as evidenced by large models in CV and NLP. However, progress in the graph domain remains limited due to fundamental challenges represented by feature heterogeneity and structural heterogeneity. Recent efforts have been made to address feature heterogeneity via Large Language Models (LLMs) on text-attributed graphs (TAGs) by generating fixed-length text representations as node features.
View Article and Find Full Text PDFBioinform Adv
August 2025
Department of CSE, BUET, Dhaka 1000, Bangladesh.
Motivation: Heavy usage of synthetic nitrogen fertilizers to satisfy the increasing demands for food has led to severe environmental impacts like decreasing crop yields and eutrophication. One promising alternative is using nitrogen-fixing microorganisms as biofertilizers, which use the nitrogenase enzyme. This could also be achieved by expressing a functional nitrogenase enzyme in the cells of the cereal crops.
View Article and Find Full Text PDFPsychogeriatrics
September 2025
Department of Psychiatry, The 4th People's Hospital of Ziyang, Ziyang Psychosis Hospital, Ziyang, China.
Background: Olfactory training (OT) has been proposed as a non-pharmacological intervention to improve cognitive functions and depressive symptomatology, but evidence remains fragmented.
Methods: In this study, we conducted a systematic review and meta-analysis of randomised controlled trials (RCTs) comparing OT versus control in middle-aged and elderly adults. Four databases (PubMed, Cochrane Library, Web of Science, Embase) were systematically searched from database inception through June 2025.
Int J Sports Physiol Perform
September 2025
Department of Physical Education and Sport Sciences, National Taiwan Normal University, Taipei, Taiwan.
Purpose: This study explored the acute physiological effects of different eccentric tempos, explosive speed (EXP), volitional speed, and 4-second tempo during 5 sets of velocity-based squat training.
Methods: Twelve healthy males performed parallel squats under 3 eccentric conditions using a randomized crossover design. Each session included 5 sets at a relative load, initiated with a concentric mean velocity of 0.