Interrater reliability for multilevel data: A generalizability theory approach.

Debby Ten Hove , Terrence D Jorgensen , L Andries van der Ark

Psychol Methods

Research Institute of Child Development and Education.

Published: August 2022

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

Current interrater reliability (IRR) coefficients ignore the nested structure of multilevel observational data, resulting in biased estimates of both subject- and cluster-level IRR. We used generalizability theory to provide a conceptualization and estimation method for IRR of continuous multilevel observational data. We explain how generalizability theory decomposes the variance of multilevel observational data into subject-, cluster-, and rater-related components, which can be estimated using Markov chain Monte Carlo (MCMC) estimation. We explain how IRR coefficients for each level can be derived from these variance components, and how they can be estimated as intraclass correlation coefficients (ICC). We assessed the quality of MCMC point and interval estimates with a simulation study, and showed that small numbers of raters were the main source of bias and inefficiency of the ICCs. In a follow-up simulation, we showed that a planned missing data design can diminish most estimation difficulties in these conditions, yielding a useful approach to estimating multilevel interrater reliability for most social and behavioral research. We illustrated the method using data on student-teacher relationships. All software code and data used for this article is available on the Open Science Framework: https://osf.io/bwk5t/. (PsycInfo Database Record (c) 2022 APA, all rights reserved).

Download full-text PDF	Source
http://dx.doi.org/10.1037/met0000391	DOI Listing

Publication Analysis

Top Keywords

interrater reliability

generalizability theory

multilevel observational

observational data

irr coefficients

components estimated

data

multilevel

reliability multilevel

multilevel data

Similar Publications

Validation of a standardised approach to collect sociodemographic and social needs data in Canadian primary care: cross-sectional study of the SPARK tool.

BMJ Open

September 2025

Upstream Lab, MAP Centre for Urban Health Solutions, Li Ka Shing Knowledge Institute, Unity Health Toronto, Toronto, Ontario, Canada

Leanne Kosowan , Alan Katz , Dana Howse , Itunuoluwa Adekoya , Alannah Delahunty-Pike

Objective: This study validates the previously tested Screening for Poverty And Related social determinants to improve Knowledge of and access to resources ('SPARK Tool') against comparison questions from well-established national surveys (Post Survey Questionnaire (PSQ)) to inform the development of a standardised tool to collect patients' demographic and social needs data in healthcare.

Design: Cross-sectional study.

Setting: Pan-Canadian study of participants from four Canadian provinces (SK, MB, ON and NL).

View Article and Find Full Text PDF

Similar Publications

Validation of the Applicability and Standard Revision of the Canadian Agility and Movement Skill Assessment in Chinese Children Aged 8-12.

Percept Mot Skills

September 2025

College of Physical Education, Shandong Normal University, Jinan, China.

Xiaojin Mao , Yunjiao Yang , Han Xie , Botian Wang , Wenhao Li

This study aims to assess the applicability of the Canadian Agility and Movement Skill Assessment (CAMSA) in Chinese children aged 8-12 and to undertake preliminary revisions for areas found to be unsuitable. A randomized sample of 911 children aged 8-12 underwent testing. The results showed that difficulty coefficients for time scores among 8-9-year-olds were relatively low (.

View Article and Find Full Text PDF

Similar Publications

Evaluating the completeness of postoperative endoscopic recurrence assessment in Crohn's disease patients with Kono-S anastomoses.

Eur J Gastroenterol Hepatol

August 2025

Department of Gastroenterology and Hepatology, Monash Health.

Nikita Parkash , Charlotte Keung , Sally J Bell , Gregory T Moore

Background And Aims: Despite therapeutic advances, resection rates in Crohn's disease remain high. Kono-S is a novel anastomosis for ileocolonic resections; however, its altered configuration may challenge standard endoscopic assessment, particularly in the absence of validated scoring tools. This study evaluated the endoscopic assessment of Kono-S anastomosis anatomy and recurrence stratification using Rutgeert's score.

View Article and Find Full Text PDF

Similar Publications

Development and preliminary inter-rater reliability of the new PROOF tool to measure fidelity of problem-solving therapy for depression delivered by non-specialists in a low-resource African setting.

Glob Ment Health (Camb)

July 2025

Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK.

Lily Cooke , Tarisai Bere , Amelia Stanton , Walter Mangezi , Steven A Safren

Problem-solving therapy (PST) is a brief psychological intervention often implemented for depression. Currently, there are no tools with well-evidenced reliability to measure PST fidelity. This pilot study aimed to measure the inter-rater reliability and agreement of the blem-Slving Therapy idelity (PROOF) scale, comprising binary 14-item adherence and an 8-item competence subscales.

View Article and Find Full Text PDF

Similar Publications

Validation of the Croatian Visual Function Classification System and subtype-specific differences in cerebral palsy.

Int J Rehabil Res

September 2025

Visual Impairments, Faculty of Education and Rehabilitation Sciences, University of Zagreb, Zagreb, Croatia.

Ana Katušić , Sonja Alimović , Andrea Paulik

The Visual Function Classification System (VFCS) provides a standardised framework for grading visual functioning in children with cerebral palsy (CP). This study evaluated the reliability and construct validity of the Croatian VFCS, and its ability to distinguish visual functioning across CP subtypes and functional classifications. Ninety-five children with CP (mean age: 11.

View Article and Find Full Text PDF

Similar Publications