Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Recent reform efforts have pushed toward a better understanding of the distinction between exploratory and confirmatory research, and appropriate use of each. As some utilize more exploratory tools, it may be tempting to employ multiple linear regression models. In this paper, we advocate for the use of random forest (RF) models. RF is able to obtain better predictive performance than traditional regression, while also inherently protecting against overfitting as well as detecting nonlinear effects and interactions among predictors. Given the advantages of RF compared to other statistical procedures, it is a tool commonly used within a plethora of industries, including stock trading, banking, pharmaceuticals, and patient healthcare planning. However, we find RF is used within the field of psychology comparatively less frequently. In the current paper, we advocate for RF as an important statistical tool within the context of behavioral and psychological research. In hopes of increasing the use of RF in the field of psychology, we provide information pertaining to the limitations one might confront in using RF and how to overcome such limitations. Moreover, we discuss various methods for how to optimally utilize RF with psychological data, such as nonparametric modeling, interaction and nonlinearity detection, variable selection, prediction and classification modeling, and assessing parameters of Monte Carlo simulations. Throughout, we illustrate the use of RF with visualization strategies, aimed to make RF models more comprehensible and intuitive.

Download full-text PDF

Source
http://dx.doi.org/10.3758/s13428-022-01901-9DOI Listing

Publication Analysis

Top Keywords

random forest
8
paper advocate
8
field psychology
8
common uncommon
4
uncommon novel
4
novel applications
4
applications random
4
forest psychological
4
psychological reform
4
reform efforts
4

Similar Publications

Traditional drug discovery methods like high-throughput screening and molecular docking are slow and costly. This study introduces a machine learning framework to predict bioactivity (pIC₅₀) and identify key molecular properties and structural features for targeting Trypanothione reductase (TR), Protein kinase C theta (PKC-θ), and Cannabinoid receptor 1 (CB1) using data from the ChEMBL database. Molecular fingerprints, generated via PaDEL-Descriptor and RDKit, encoded structural features as binary vectors.

View Article and Find Full Text PDF

Utility and performance of cerebrospinal fluid cytology in discriminating central nervous system infections and brain tumors.

J Neurooncol

September 2025

Department of Neurology, Xiangya Hospital, Central South University, No.87 Xiangya Road, Kaifu District, Changsha, 410008, Hunan Province, China.

Background And Objective: Differentiating central nervous system infections (CNSIs) from brain tumors (BTs) is difficult due to overlapping features and the limited individual indicators, and cerebrospinal fluid (CSF) cytology remains underutilized. To improve differential diagnosis, we developed a model based on 9 early, cost-effective cerebrospinal fluid parameters, including CSF cytology.

Methods: Patients diagnosed with CNSIs or BTs at Xiangya Hospital of Central South University between October 1st, 2017 and March 31st, 2024 were enrolled and divided into the training set and the test set.

View Article and Find Full Text PDF

The increasing prevalence of diabetes mellitus (DM) and patients' lack of self-management awareness have led to a decline in health-related quality of life (HRQoL). Studies identifying potential risk factors for HRQoL in DM patients and presenting generalized models are relatively scarce. The study aimed to develop and evaluate a machine learning (ML)-based model to predict the HRQoL in adult diabetic patients and to examine the important factors affecting HRQoL.

View Article and Find Full Text PDF

This study aimed to develop and validate a machine learning-based predictive model for assessing the risk of fear of childbirth in pregnant women during late pregnancy. A cross-sectional observational study was conducted from November 2022 to July 2023, involving 406 pregnant women. Six machine learning algorithms, including Lasso-assisted logistic regression (LR), random forest (RF), eXtreme Gradient Boosting (XGB), support vector machine (SVM), Bayesian network (BN), and k-nearest neighbors (KNN), were used to construct the models with 10-fold cross-validation.

View Article and Find Full Text PDF

Background: Ovarian cancer (OC) remains the most lethal gynecological malignancy, largely due to its late-stage diagnosis and nonspecific early symptoms. Advances in biomarker identification and machine learning offer promising avenues for improving early detection and prognosis. This review evaluates the role of biomarker-driven ML models in enhancing the early detection, risk stratification, and treatment planning of OC.

View Article and Find Full Text PDF