Publications by Cherodeep Goswami

Publications by authors named "Cherodeep Goswami"

Page 1 of 1

Development and validation of the provider documentation summarization quality instrument for large language models.

Emma Croxford , Yanjun Gao , Nicholas Pellegrino , Karen Wong , Graham Wills , Cherodeep Goswami

J Am Med Inform Assoc

June 2025

Objectives: As large language models (LLMs) are integrated into electronic health record (EHR) workflows, validated instruments are essential to evaluate their performance before implementation and as models and documentation practices evolve. Existing instruments for provider documentation quality are often unsuitable for the complexities of LLM-generated text and lack validation on real-world data. The Provider Documentation Summarization Quality Instrument (PDSQI-9) was developed to evaluate LLM-generated clinical summaries.

View Article and Find Full Text PDF

Automating Evaluation of AI Text Generation in Healthcare with a Large Language Model (LLM)-as-a-Judge.

Emma Croxford , Yanjun Gao , Elliot First , Nicholas Pellegrino , Miranda Schnier , Cherodeep Goswami

medRxiv

May 2025

Electronic Health Records (EHRs) store vast amounts of clinical information that are difficult for healthcare providers to summarize and synthesize relevant details to their practice. To reduce cognitive load on providers, generative AI with Large Language Models have emerged to automatically summarize patient records into clear, actionable insights and offload the cognitive burden for providers. However, LLM summaries need to be precise and free from errors, making evaluations on the quality of the summaries necessary.

View Article and Find Full Text PDF

Clinical implementation of AI-based screening for risk for opioid use disorder in hospitalized adults.

Majid Afshar , Felice Resnik , Cara Joyce , Madeline Oguss , Dmitriy Dligach , Cherodeep Goswami

Nat Med

June 2025

Adults with opioid use disorder (OUD) are at increased risk for opioid-related complications and repeated hospital admissions. Routine screening for patients at risk for an OUD to prevent complications is not standard practice in many hospitals, leading to missed opportunities for intervention. The adoption of electronic health records (EHRs) and advancements in artificial intelligence (AI) offer a scalable approach to systematically identify at-risk patients for evidence-based care.

View Article and Find Full Text PDF

Current and future state of evaluation of large language models for medical summarization tasks.

Emma Croxford , Yanjun Gao , Nicholas Pellegrino , Karen Wong , Graham Wills , Cherodeep Goswami

Npj Health Syst

February 2025

Large Language Models have expanded the potential for clinical Natural Language Generation (NLG), presenting new opportunities to manage the vast amounts of medical text. However, their use in such high-stakes environments necessitate robust evaluation workflows. In this review, we investigated the current landscape of evaluation metrics for NLG in healthcare and proposed a future direction to address the resource constraints of expert human evaluation while balancing alignment with human judgments.

View Article and Find Full Text PDF

A Novel Playbook for Pragmatic Trial Operations to Monitor and Evaluate Ambient Artificial Intelligence in Clinical Practice.

Majid Afshar , Felice Resnik , Mary Ryan Baumann , Josie Hintzke , Anne Gravel Sullivan , Cherodeep Goswami

medRxiv

January 2025

Background: Ambient artificial intelligence offers promise for improving documentation efficiency and reducing provider burden through clinical note generation. However, challenges persist in workflow integration, compliance, and widespread adoption. This study leveraged a Learning Health System (LHS) framework to align research and operations using a hybrid effectiveness-implementation protocol, embedded as pragmatic trial operations within the electronic health record (EHR).

View Article and Find Full Text PDF

Prompt engineering with a large language model to assist providers in responding to patient inquiries: a real-time implementation in the electronic health record.

Majid Afshar , Yanjun Gao , Graham Wills , Jason Wang , Matthew M Churpek , Cherodeep Goswami

JAMIA Open

October 2024

Background: Large language models (LLMs) can assist providers in drafting responses to patient inquiries. We examined a prompt engineering strategy to draft responses for providers in the electronic health record. The aim was to evaluate the change in usability after prompt engineering.

View Article and Find Full Text PDF