Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Objective: This study evaluated the coherence, consistency, and diagnostic accuracy of eight AI-based chatbots in clinical scenarios related to dental implants.

Methods: A double-blind, clinical experimental study was carried out between February and March 2025, to evaluate eight AI-based chatbots using six fictional cases simulating peri-implant mucositis and peri-implantitis. Each chatbot answered five standardized clinical questions across three independent runs per case, generating 720 binary outputs. Blinded investigators scored each response against a gold standard. Statistical analyses included chi-square and Fisher's exact and Cohen's Kappa tests were used to assess intra-model consistency, stability and reliability for each AI chatbot.

Results: GPT-4o demonstrated the highest diagnostic accuracy (88.8%), followed by Gemini (77.7%), OpenAI o3-mini (72.2%), OpenAI o3-mini-high (71.1%), Claude (66.6%), OpenAI o1 (60%), DeepSeek (55.5%), and Copilot (49.9%). GPT-4o also showed the highest intra-model stability (κ = 0.82) and consistency, while Copilot and DeepSeek showed the lowest reliability. Significant differences were observed only in the reference citation criterion (p < 0.001), with Gemini being the only AI chatbot to achieve 100% compliance, but GPT-4o consistently outperformed the other AI chatbots across all evaluation domains.

Conclusion: GPT-4o demonstrated superior diagnostic accuracy and response consistency, reinforcing the influence of AI chatbot architecture and training on clinical reasoning performance. In contrast, Copilot showed lower reliability and higher variability, emphasizing the need for cautious, evidence-based adoption of AI tools in the diagnosis of peri-implant diseases.

Clinical Relevance: Understanding AI performance in peri-implant diagnosis to support evidence-based decision-making using AI and its responsible clinical use.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.jdent.2025.106091DOI Listing

Publication Analysis

Top Keywords

clinical experimental
8
experimental study
8
diagnostic accuracy
8
ai-based chatbots
8
assessing diagnostic
4
diagnostic treatment
4
treatment accuracy
4
accuracy large
4
large language
4
language models
4

Similar Publications

Correction: Factors Affecting the Receptiveness of Chinese Internists and Surgeons Toward Artificial Intelligence-Driven Drug Prescription: Protocol for a Systematic Survey Study.

JMIR Res Protoc

September 2025

State Key Laboratory of Experimental Hematology, National Clinical Research Center for Blood Diseases, Haihe Laboratory of Cell Ecosystem, Institute of Hematology & Blood Diseases Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin, China.

[This corrects the article DOI: .].

View Article and Find Full Text PDF

Germline Findings From Tumor-Only Comprehensive Genomic Profiling in the RATIONAL Study: A Missed Opportunity?

JCO Precis Oncol

September 2025

Cell Biology and Biotherapy Unit, Istituto Nazionale Tumori IRCCS Fondazione G. Pascale, Napoli, Italy.

Purpose: Tumor comprehensive genomic profiling (CGP) may detect potential germline pathogenic/likely pathogenic (P/LP) alterations as secondary findings. We analyzed the frequency of potentially germline variants and large rearrangements (LRs) in the RATIONAL study, an Italian multicenter, observational clinical trial that collects next-generation sequencing-based tumor profiling data, and evaluated how these findings were managed by the enrolling centers.

Patients And Methods: Patients prospectively enrolled in the pathway-B of the RATIONAL study and undergoing CGP with the FoundationOne CDx assays were included in the analysis.

View Article and Find Full Text PDF

The hallmarks of mechanosensitive ion channels have been observed for half a century in various cell lines, although their mechanisms and molecular identities remained unknown until recently. Identification of the bona fide mammalian mechanosensory Piezo channels resulted in an explosion of research exploring the translation of mechanical cues into biochemical signals and dynamic cell morphology responses. One of the Piezo isoforms - Piezo1 - is integral in the erythrocyte (red blood cell; RBC) membrane.

View Article and Find Full Text PDF

EVOLVING TRENDS AND EMERGING THEMES IN GUT MICROBIOTA RESEARCH: A COMPREHENSIVE BIBLIOMETRIC ANALYSIS (2015-2024).

Arq Gastroenterol

September 2025

The Japanese Society of Internal Medicine, Editorial Department, Tokyo, Japan.

Background: This study aims to analyze research trends and emerging insights into gut microbiota studies from 2015 to 2024 through bibliometric analysis techniques. By examining bibliographic data from the Web of Science (WoS) Core Collection, it seeks to identify key research topics, evolving themes, and significant shifts in gut microbiota research. The study employs co-occurrence analysis, principal component analysis (PCA), and burst detection analysis to uncover latent patterns and the development trajectory of this rapidly expanding field.

View Article and Find Full Text PDF