98%
921
2 minutes
20
Background: Online health information is widely available, but a substantial portion of it is inaccurate or misleading, including exaggerated, incomplete, or unverified claims. Such misinformation can significantly influence public health decisions and pose serious challenges to health care systems. With advances in artificial intelligence and natural language processing, pretrained large language models (LLMs) have shown promise in identifying and distinguishing misleading health information, although their effectiveness in this area remains underexplored.
Objective: This study aimed to evaluate the performance of 4 mainstream LLMs (ChatGPT-3.5, ChatGPT-4, Ernie Bot, and iFLYTEK Spark) in the identification of online health information, providing empirical evidence for their practical application in this field.
Methods: Web scraping was used to collect data from rumor-refuting websites, resulting in 2708 samples of online health information, including both true and false claims. The 4 LLMs' application programming interfaces were used for authenticity verification, with expert results as benchmarks. Model performance was evaluated using semantic similarity, accuracy, recall, F-score, content analysis, and credibility.
Results: This study found that the 4 models performed well in identifying online health information. Among them, ChatGPT-4 achieved the highest accuracy at 87.27%, followed by Ernie Bot at 87.25%, iFLYTEK Spark at 87%, and ChatGPT-3.5 at 81.82%. Furthermore, text length and semantic similarity analysis showed that Ernie Bot had the highest similarity to expert texts, whereas ChatGPT-4 showed good overall consistency in its explanations. In addition, the credibility assessment results indicated that ChatGPT-4 provided the most reliable evaluations. Further analysis suggested that the highest misjudgment probabilities with respect to the LLMs occurred within the topics of food and maternal-infant nutrition management and nutritional science and food controversies. Overall, the research suggests that LLMs have potential in online health information identification; however, their understanding of certain specialized health topics may require further improvement.
Conclusions: The results demonstrate that, while these models show potential in providing assistance, their performance varies significantly in terms of accuracy, semantic understanding, and cultural adaptability. The principal findings highlight the models' ability to generate accessible and context-aware explanations; however, they fall short in areas requiring specialized medical knowledge or updated data, particularly for emerging health issues and context-sensitive scenarios. Significant discrepancies were observed in the models' ability to distinguish scientifically verified knowledge from popular misconceptions and in their stability when processing complex linguistic and cultural contexts. These challenges reveal the importance of refining training methodologies to improve the models' reliability and adaptability. Future research should focus on enhancing the models' capability to manage nuanced health topics and diverse cultural and linguistic nuances, thereby facilitating their broader adoption as reliable tools for online health information identification.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12120363 | PMC |
http://dx.doi.org/10.2196/70733 | DOI Listing |
Neurol Res Pract
September 2025
German Neurological Society, Berlin, Germany.
Background: Recreational nitrous oxide (NO) abuse has become increasingly prevalent, raising concerns about associated health risks. In Germany, the lack of reliable data on NO consumption patterns limits the development of effective public health interventions. This study aims to address this knowledge gap by examining trends, determinants, and health consequences of NO abuse in Germany.
View Article and Find Full Text PDFBMC Med Ethics
September 2025
Dept of Neurobiology, Care Sciences, and Society, Karolinska Institutet, Stockholm, Sweden.
BMC Public Health
September 2025
Department of Sociology and Work Science, University of Gothenburg, Gothenburg, Sweden.
Background: Mental health problems are common in the working-age population. More knowledge is needed on how to support work participation and reduce sickness absence. The objective of the study was to estimate the distribution of mental well-being and work capacity in women and men in a working population and assess the association between mental well-being and work capacity, while adjusting for sociodemographic characteristics, health status, and working positions.
View Article and Find Full Text PDFBMC Public Health
September 2025
Department of Dermatology and Allergy, TUM School of Medicine and Health, Technical University of Munich, Biedersteiner Str. 29, 80802, Munich, Germany.
Background: Psoriasis, a chronic inflammatory skin disorder, imposes a high burden on those affected, often leading to stigma and increased depression risk. With the increasing importance of digital media in medical contexts, there is a notable prevalence of misinformation and low-quality content. This study aims to explore the experiences of individuals affected by psoriasis regarding their disease-related digital media use.
View Article and Find Full Text PDF