98%
921
2 minutes
20
Recent reform efforts have pushed toward a better understanding of the distinction between exploratory and confirmatory research, and appropriate use of each. As some utilize more exploratory tools, it may be tempting to employ multiple linear regression models. In this paper, we advocate for the use of random forest (RF) models. RF is able to obtain better predictive performance than traditional regression, while also inherently protecting against overfitting as well as detecting nonlinear effects and interactions among predictors. Given the advantages of RF compared to other statistical procedures, it is a tool commonly used within a plethora of industries, including stock trading, banking, pharmaceuticals, and patient healthcare planning. However, we find RF is used within the field of psychology comparatively less frequently. In the current paper, we advocate for RF as an important statistical tool within the context of behavioral and psychological research. In hopes of increasing the use of RF in the field of psychology, we provide information pertaining to the limitations one might confront in using RF and how to overcome such limitations. Moreover, we discuss various methods for how to optimally utilize RF with psychological data, such as nonparametric modeling, interaction and nonlinearity detection, variable selection, prediction and classification modeling, and assessing parameters of Monte Carlo simulations. Throughout, we illustrate the use of RF with visualization strategies, aimed to make RF models more comprehensible and intuitive.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.3758/s13428-022-01901-9 | DOI Listing |
Mol Divers
September 2025
Department of Biotechnology, National Institute of Technology Raipur, Raipur, Chhattisgarh, 492001, India.
Traditional drug discovery methods like high-throughput screening and molecular docking are slow and costly. This study introduces a machine learning framework to predict bioactivity (pIC₅₀) and identify key molecular properties and structural features for targeting Trypanothione reductase (TR), Protein kinase C theta (PKC-θ), and Cannabinoid receptor 1 (CB1) using data from the ChEMBL database. Molecular fingerprints, generated via PaDEL-Descriptor and RDKit, encoded structural features as binary vectors.
View Article and Find Full Text PDFJ Neurooncol
September 2025
Department of Neurology, Xiangya Hospital, Central South University, No.87 Xiangya Road, Kaifu District, Changsha, 410008, Hunan Province, China.
Background And Objective: Differentiating central nervous system infections (CNSIs) from brain tumors (BTs) is difficult due to overlapping features and the limited individual indicators, and cerebrospinal fluid (CSF) cytology remains underutilized. To improve differential diagnosis, we developed a model based on 9 early, cost-effective cerebrospinal fluid parameters, including CSF cytology.
Methods: Patients diagnosed with CNSIs or BTs at Xiangya Hospital of Central South University between October 1st, 2017 and March 31st, 2024 were enrolled and divided into the training set and the test set.
Clin Nurs Res
September 2025
Chonnam National University College of Nursing, Donggu, Gwangju, South Korea.
The increasing prevalence of diabetes mellitus (DM) and patients' lack of self-management awareness have led to a decline in health-related quality of life (HRQoL). Studies identifying potential risk factors for HRQoL in DM patients and presenting generalized models are relatively scarce. The study aimed to develop and evaluate a machine learning (ML)-based model to predict the HRQoL in adult diabetic patients and to examine the important factors affecting HRQoL.
View Article and Find Full Text PDFClin Nurs Res
September 2025
Xuzhou Medical University, Jiangsu Province, China.
This study aimed to develop and validate a machine learning-based predictive model for assessing the risk of fear of childbirth in pregnant women during late pregnancy. A cross-sectional observational study was conducted from November 2022 to July 2023, involving 406 pregnant women. Six machine learning algorithms, including Lasso-assisted logistic regression (LR), random forest (RF), eXtreme Gradient Boosting (XGB), support vector machine (SVM), Bayesian network (BN), and k-nearest neighbors (KNN), were used to construct the models with 10-fold cross-validation.
View Article and Find Full Text PDFCancer Med
September 2025
Department of Computer Engineering, Social and Biological Network Analysis Laboratory, University of Kurdistan, Sanandaj, Iran.
Background: Ovarian cancer (OC) remains the most lethal gynecological malignancy, largely due to its late-stage diagnosis and nonspecific early symptoms. Advances in biomarker identification and machine learning offer promising avenues for improving early detection and prognosis. This review evaluates the role of biomarker-driven ML models in enhancing the early detection, risk stratification, and treatment planning of OC.
View Article and Find Full Text PDF