98%
921
2 minutes
20
Current interrater reliability (IRR) coefficients ignore the nested structure of multilevel observational data, resulting in biased estimates of both subject- and cluster-level IRR. We used generalizability theory to provide a conceptualization and estimation method for IRR of continuous multilevel observational data. We explain how generalizability theory decomposes the variance of multilevel observational data into subject-, cluster-, and rater-related components, which can be estimated using Markov chain Monte Carlo (MCMC) estimation. We explain how IRR coefficients for each level can be derived from these variance components, and how they can be estimated as intraclass correlation coefficients (ICC). We assessed the quality of MCMC point and interval estimates with a simulation study, and showed that small numbers of raters were the main source of bias and inefficiency of the ICCs. In a follow-up simulation, we showed that a planned missing data design can diminish most estimation difficulties in these conditions, yielding a useful approach to estimating multilevel interrater reliability for most social and behavioral research. We illustrated the method using data on student-teacher relationships. All software code and data used for this article is available on the Open Science Framework: https://osf.io/bwk5t/. (PsycInfo Database Record (c) 2022 APA, all rights reserved).
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1037/met0000391 | DOI Listing |
BMJ Open
September 2025
Upstream Lab, MAP Centre for Urban Health Solutions, Li Ka Shing Knowledge Institute, Unity Health Toronto, Toronto, Ontario, Canada
Objective: This study validates the previously tested Screening for Poverty And Related social determinants to improve Knowledge of and access to resources ('SPARK Tool') against comparison questions from well-established national surveys (Post Survey Questionnaire (PSQ)) to inform the development of a standardised tool to collect patients' demographic and social needs data in healthcare.
Design: Cross-sectional study.
Setting: Pan-Canadian study of participants from four Canadian provinces (SK, MB, ON and NL).
Percept Mot Skills
September 2025
College of Physical Education, Shandong Normal University, Jinan, China.
This study aims to assess the applicability of the Canadian Agility and Movement Skill Assessment (CAMSA) in Chinese children aged 8-12 and to undertake preliminary revisions for areas found to be unsuitable. A randomized sample of 911 children aged 8-12 underwent testing. The results showed that difficulty coefficients for time scores among 8-9-year-olds were relatively low (.
View Article and Find Full Text PDFEur J Gastroenterol Hepatol
August 2025
Department of Gastroenterology and Hepatology, Monash Health.
Background And Aims: Despite therapeutic advances, resection rates in Crohn's disease remain high. Kono-S is a novel anastomosis for ileocolonic resections; however, its altered configuration may challenge standard endoscopic assessment, particularly in the absence of validated scoring tools. This study evaluated the endoscopic assessment of Kono-S anastomosis anatomy and recurrence stratification using Rutgeert's score.
View Article and Find Full Text PDFGlob Ment Health (Camb)
July 2025
Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, UK.
Problem-solving therapy (PST) is a brief psychological intervention often implemented for depression. Currently, there are no tools with well-evidenced reliability to measure PST fidelity. This pilot study aimed to measure the inter-rater reliability and agreement of the blem-Slving Therapy idelity (PROOF) scale, comprising binary 14-item adherence and an 8-item competence subscales.
View Article and Find Full Text PDFInt J Rehabil Res
September 2025
Visual Impairments, Faculty of Education and Rehabilitation Sciences, University of Zagreb, Zagreb, Croatia.
The Visual Function Classification System (VFCS) provides a standardised framework for grading visual functioning in children with cerebral palsy (CP). This study evaluated the reliability and construct validity of the Croatian VFCS, and its ability to distinguish visual functioning across CP subtypes and functional classifications. Ninety-five children with CP (mean age: 11.
View Article and Find Full Text PDF