Evaluating the Influence of Role-Playing Prompts on ChatGPT's Misinformation Detection Accuracy: Quantitative Study.

JMIR Infodemiology

Global Health Program, Department of Anthropology, University of California, San Diego, La Jolla, CA, United States.

Published: September 2024


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Background: During the COVID-19 pandemic, the rapid spread of misinformation on social media created significant public health challenges. Large language models (LLMs), pretrained on extensive textual data, have shown potential in detecting misinformation, but their performance can be influenced by factors such as prompt engineering (ie, modifying LLM requests to assess changes in output). One form of prompt engineering is role-playing, where, upon request, OpenAI's ChatGPT imitates specific social roles or identities. This research examines how ChatGPT's accuracy in detecting COVID-19-related misinformation is affected when it is assigned social identities in the request prompt. Understanding how LLMs respond to different identity cues can inform messaging campaigns, ensuring effective use in public health communications.

Objective: This study investigates the impact of role-playing prompts on ChatGPT's accuracy in detecting misinformation. This study also assesses differences in performance when misinformation is explicitly stated versus implied, based on contextual knowledge, and examines the reasoning given by ChatGPT for classification decisions.

Methods: Overall, 36 real-world tweets about COVID-19 collected in September 2021 were categorized into misinformation, sentiment (opinions aligned vs unaligned with public health guidelines), corrections, and neutral reporting. ChatGPT was tested with prompts incorporating different combinations of multiple social identities (ie, political beliefs, education levels, locality, religiosity, and personality traits), resulting in 51,840 runs. Two control conditions were used to compare results: prompts with no identities and those including only political identity.

Results: The findings reveal that including social identities in prompts reduces average detection accuracy, with a notable drop from 68.1% (SD 41.2%; no identities) to 29.3% (SD 31.6%; all identities included). Prompts with only political identity resulted in the lowest accuracy (19.2%, SD 29.2%). ChatGPT was also able to distinguish between sentiments expressing opinions not aligned with public health guidelines from misinformation making declarative statements. There were no consistent differences in performance between explicit and implicit misinformation requiring contextual knowledge. While the findings show that the inclusion of identities decreased detection accuracy, it remains uncertain whether ChatGPT adopts views aligned with social identities: when assigned a conservative identity, ChatGPT identified misinformation with nearly the same accuracy as it did when assigned a liberal identity. While political identity was mentioned most frequently in ChatGPT's explanations for its classification decisions, the rationales for classifications were inconsistent across study conditions, and contradictory explanations were provided in some instances.

Conclusions: These results indicate that ChatGPT's ability to classify misinformation is negatively impacted when role-playing social identities, highlighting the complexity of integrating human biases and perspectives in LLMs. This points to the need for human oversight in the use of LLMs for misinformation detection. Further research is needed to understand how LLMs weigh social identities in prompt-based tasks and explore their application in different cultural contexts.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11467603PMC
http://dx.doi.org/10.2196/60678DOI Listing

Publication Analysis

Top Keywords

social identities
24
public health
16
misinformation
12
detection accuracy
12
identities
11
role-playing prompts
8
prompts chatgpt's
8
misinformation detection
8
social
8
detecting misinformation
8

Similar Publications

"To love, to create, to express, to live"- a typology of aging poems by older men.

J Aging Stud

September 2025

Louis and Gabi Weisfeld School of Social Work, Bar Ilan University, Ramat Gan, Israel. Electronic address:

Poetry writing can serve as a means for personal expression of feelings, thoughts, and attitudes toward various subjects, as well as for a deeper understanding of lived experiences and identity. The present study examined the aging experiences of men over the age of 70 (N = 15), living in a continuing care retirement community in Israel, as reflected in the poems they wrote. The poems were analyzed using latent content analysis, resulting in a typology of three types of poems: a) Preparation for end-of-life poems, b) Positive aging poems, and c) Nostalgic poems.

View Article and Find Full Text PDF

Over the last decade, a range of research has demonstrated the detrimental impacts of policies criminalizing migration ("crimmigration") on Latinx mental health. In this study, we seek to examine youth perspectives on how crimmigration policies affect Latinx adolescents' connections to Latinx identity, culture, and communities and the implications for Latinx youth mental health. We explored how immigration enforcement policies affect Latinx youths' mental health using photovoice with ten youth in a high-deportation county in Atlanta in 2022.

View Article and Find Full Text PDF

Purpose: Food insecurity (FI) is a social determinant of health and health disparity that leads to increased risk of chronic health conditions. Despite the widespread implementation of FI screening in other settings, the role of the anesthesia team in FI screening is underused, increasing the chance of at-risk individuals not being identified. The anesthesia preoperative interview is an opportunity to identify patients experiencing FI and provide resources to improve outcomes.

View Article and Find Full Text PDF

Background: Linking cancer cohort participants to state cancer registries typically relies on personally identifiable information, including Social Security Numbers (SSN), which uniquely identify individuals. However, complete SSN collection can be limited due to privacy concerns. This study evaluates the sensitivity of cancer registry linkage using partial or missing SSN and examines differences by demographic characteristics.

View Article and Find Full Text PDF