Publications by Vipina K Keloth | LitMetric

Publications by authors named "Vipina K Keloth"

Page 1 of 1

Scientific Writing in the Era of Large Language Models: A Computational Analysis of AI Versus Human-Created Content.

Rohan Khera , Aline F Pedroso , Vipina K Keloth , Hua Xu , Gisele S Silva

Stroke

August 2025

Background: Large language models (LLMs) are artificial intelligence (AI) tools that can generate human expert-like content and be used to accelerate the synthesis of scientific literature, but they can spread misinformation by producing misleading content. This study sought to characterize distinguishing linguistic features in differentiating AI-generated from human-authored scientific text and evaluate the performance of AI detection tools for this task.

Methods: We conducted a computational synthesis of 34 essays on cerebrovascular topics (12 generated by large language models [Generative Pre-trained Transformer 4, Generative Pre-trained Transformer 3.

View Article and Find Full Text PDF

Improving Large Language Models' Summarization Accuracy by Adding Highlights to Discharge Notes: Comparative Evaluation.

Mahshad Koohi Habibi Dehkordi , Yehoshua Perl , Fadi P Deek , Zhe He , Vipina K Keloth

JMIR Med Inform

July 2025

Background: The American Medical Association recommends that electronic health record (EHR) notes, often dense and written in nuanced language, be made readable for patients and laypeople, a practice we refer to as the simplification of discharge notes. Our approach to achieving the simplification of discharge notes involves a process of incremental simplification steps to achieve the ideal note. In this paper, we present the first step of this process.

View Article and Find Full Text PDF

Ontology enrichment using a large language model: Applying lexical, semantic, and knowledge network-based similarity for concept placement.

Navya Martin Kollapally , James Geller , Vipina Kuttichi Keloth , Zhe He , Julia Xu

J Biomed Inform

August 2025

Objective: Ontologies are essential for representing the knowledge of a domain. To make ontologies useful, they must encompass a comprehensive domain view. To achieve ontology enrichment, there is a need to discover new concepts to be added, either because they were missed in the first place, or the state-of-the-art has advanced to develop new real-world concepts.

View Article and Find Full Text PDF

Developing and sustaining inclusive language in biomedical informatics communications: an AMIA Board of Directors endorsed paper on the Inclusive Language and Context Style Guidelines.

Oliver Bear Don't Walk , Shefali Haldar , Duo Helen Wei , Hu Huang , Rebecca L Rivera , Vipina K Keloth

J Am Med Inform Assoc

August 2025

Objectives: In 2023, AMIA's Inclusive Language and Context Style Guidelines (the "Guidelines") were approved by the Board of Directors and made a publicly available resource. This work began in 2021 through AMIA's DEI Task Force and subsequent DEI Committee; many members provided input, feedback, and time to create the Guidelines. In this paper, the authors provide a transparent account of the origin, development, contents, and dissemination of the Guidelines and share plans for their future development and use.

View Article and Find Full Text PDF

Social determinants of health extraction from clinical notes across institutions using large language models.

Vipina K Keloth , Salih Selek , Qingyu Chen , Christopher Gilman , Sunyang Fu

NPJ Digit Med

May 2025

Detailed social determinants of health (SDoH) is often buried within clinical text in EHRs. Most current NLP efforts for SDoH have limitations, investigating limited factors, deriving data from a single institution, using specific patient cohorts/note types, with reduced focus on generalizability. We aim to address these issues by creating cross-institutional corpora and developing and evaluating the generalizability of classification models, including large language models (LLMs), for detecting SDoH factors using data from four institutions.

View Article and Find Full Text PDF

Benchmarking large language models for biomedical natural language processing applications and recommendations.

Qingyu Chen , Yan Hu , Xueqing Peng , Qianqian Xie , Qiao Jin , Vipina K Keloth

Nat Commun

April 2025

The rapid growth of biomedical literature poses challenges for manual knowledge curation and synthesis. Biomedical Natural Language Processing (BioNLP) automates the process. While Large Language Models (LLMs) have shown promise in general domains, their effectiveness in BioNLP tasks remains unclear due to limited benchmarks and practical guidelines.

View Article and Find Full Text PDF

The Development Landscape of Large Language Models for Biomedical Applications.

Zhiyuan Cao , Vipina K Keloth , Qianqian Xie , Lingfei Qian , Yuntian Liu

Annu Rev Biomed Data Sci

August 2025

Large language models (LLMs) have become powerful tools for biomedical applications, offering potential to transform healthcare and medical research. Since the release of ChatGPT in 2022, there has been a surge in LLMs for diverse biomedical applications. This review examines the landscape of text-based biomedical LLM development, analyzing model characteristics (e.

View Article and Find Full Text PDF

Detection of Gastrointestinal Bleeding With Large Language Models to Aid Quality Improvement and Appropriate Reimbursement.

Neil S Zheng , Vipina K Keloth , Kisung You , Daniel Kats , Darrick K Li

Gastroenterology

January 2025

Article Synopsis

The study focuses on using a generative AI pipeline to enhance the identification of overt gastrointestinal bleeding (GIB) in electronic health records, ultimately improving patient management and reimbursement accuracy.
The pipeline was developed using nursing notes from over 11,000 patients and demonstrated high accuracy in detecting various forms of bleeding, such as melena and hematochezia.
Results showed that the machine learning model for recurrent bleeding had exceptional diagnostic performance, and the reimbursement algorithm significantly increased average patient reimbursements by up to $3,247, resulting in millions of dollars in total reimbursement.

View Article and Find Full Text PDF

A Study of Biomedical Relation Extraction Using GPT Models.

Jeffrey Zhang , Maxwell Wibert , Huixue Zhou , Xueqing Peng , Qingyu Chen , Vipina K Keloth

AMIA Jt Summits Transl Sci Proc

May 2024

Relation Extraction (RE) is a natural language processing (NLP) task for extracting semantic relations between biomedical entities. Recent developments in pre-trained large language models (LLM) motivated NLP researchers to use them for various NLP tasks. We investigated GPT-3.

View Article and Find Full Text PDF

Large Language Models for Social Determinants of Health Information Extraction from Clinical Notes - A Generalizable Approach across Institutions.

Vipina K Keloth , Salih Selek , Qingyu Chen , Christopher Gilman , Sunyang Fu

medRxiv

May 2024

The consistent and persuasive evidence illustrating the influence of social determinants on health has prompted a growing realization throughout the health care sector that enhancing health and health equity will likely depend, at least to some extent, on addressing detrimental social determinants. However, detailed social determinants of health (SDoH) information is often buried within clinical narrative text in electronic health records (EHRs), necessitating natural language processing (NLP) methods to automatically extract these details. Most current NLP efforts for SDoH extraction have been limited, investigating on limited types of SDoH elements, deriving data from a single institution, focusing on specific patient cohorts or note types, with reduced focus on generalizability.

View Article and Find Full Text PDF

Ensemble pretrained language models to extract biomedical knowledge from literature.

Zhao Li , Qiang Wei , Liang-Chin Huang , Jianfu Li , Yan Hu , Vipina Kuttichi Keloth

J Am Med Inform Assoc

September 2024

Article Synopsis

The rapid growth of biomedical literature requires automated methods to understand relationships between concepts, leading to the LitCoin NLP challenge aimed at developing and benchmarking these techniques.
The study employed ensemble learning with specialized models like BioBERT and PubMedBERT for named entity recognition (NER), while also finetuning a large model for improved relation extraction tasks.
The developed NLP system achieved first place in NER and second in relation extraction, demonstrating that specialized models significantly outperform general models like ChatGPT in biomedical contexts, supporting future research initiatives.

View Article and Find Full Text PDF

Advancing entity recognition in biomedicine via instruction tuning of large language models.

Vipina K Keloth , Yan Hu , Qianqian Xie , Xueqing Peng , Yan Wang

Bioinformatics

March 2024

Motivation: Large Language Models (LLMs) have the potential to revolutionize the field of Natural Language Processing, excelling not only in text generation and reasoning tasks but also in their ability for zero/few-shot learning, swiftly adapting to new tasks with minimal fine-tuning. LLMs have also demonstrated great promise in biomedical and healthcare applications. However, when it comes to Named Entity Recognition (NER), particularly within the biomedical domain, LLMs fall short of the effectiveness exhibited by fine-tuned domain-specific models.

View Article and Find Full Text PDF

FedFSA: Hybrid and federated framework for functional status ascertainment across institutions.

Sunyang Fu , Heling Jia , Maria Vassilaki , Vipina K Keloth , Yifang Dang

J Biomed Inform

April 2024

Introduction: Patients' functional status assesses their independence in performing activities of daily living, including basic ADLs (bADL), and more complex instrumental activities (iADL). Existing studies have discovered that patients' functional status is a strong predictor of health outcomes, particularly in older adults. Depite their usefulness, much of the functional status information is stored in electronic health records (EHRs) in either semi-structured or free text formats.

View Article and Find Full Text PDF

Improving large language models for clinical named entity recognition via prompt engineering.

Yan Hu , Qingyu Chen , Jingcheng Du , Xueqing Peng , Vipina Kuttichi Keloth

J Am Med Inform Assoc

September 2024

Importance: The study highlights the potential of large language models, specifically GPT-3.5 and GPT-4, in processing complex clinical data and extracting meaningful information with minimal training data. By developing and refining prompt-based strategies, we can significantly enhance the models' performance, making them viable tools for clinical NER tasks and possibly reducing the reliance on extensive annotated datasets.

View Article and Find Full Text PDF

Integrating Commercial and Social Determinants of Health: A Unified Ontology for Non-Clinical Determinants of Health.

Navya Martin Kollapally , Vipina Kuttichi Keloth , Julia Xu , James Geller

AMIA Annu Symp Proc

January 2024

The pivotal impact of Social Determinants of Health (SDoH) on people's health and well-being has been widely recognized and researched. However, the effect of Commercial Determinants of Health (CDoH) is only now garnering increased attention. Developing an ontology for CDoH can offer a systematic approach to identifying and categorizing the diverse commercial factors affecting health.

View Article and Find Full Text PDF

Towards precise PICO extraction from abstracts of randomized controlled trials using a section-specific learning approach.

Yan Hu , Vipina K Keloth , Kalpana Raja , Yong Chen , Hua Xu

Bioinformatics

September 2023

Motivation: Automated extraction of participants, intervention, comparison/control, and outcome (PICO) from the randomized controlled trial (RCT) abstracts is important for evidence synthesis. Previous studies have demonstrated the feasibility of applying natural language processing (NLP) for PICO extraction. However, the performance is not optimal due to the complexity of PICO information in RCT abstracts and the challenges involved in their annotation.

View Article and Find Full Text PDF

Systematic design and data-driven evaluation of social determinants of health ontology (SDoHO).

Yifang Dang , Fang Li , Xinyue Hu , Vipina K Keloth , Meng Zhang

J Am Med Inform Assoc

August 2023

Objective: Social determinants of health (SDoH) play critical roles in health outcomes and well-being. Understanding the interplay of SDoH and health outcomes is critical to reducing healthcare inequalities and transforming a "sick care" system into a "health-promoting" system. To address the SDOH terminology gap and better embed relevant elements in advanced biomedical informatics, we propose an SDoH ontology (SDoHO), which represents fundamental SDoH factors and their relationships in a standardized and measurable way.

View Article and Find Full Text PDF

Representing and utilizing clinical textual data for real world studies: An OHDSI approach.

Vipina K Keloth , Juan M Banda , Michael Gurley , Paul M Heider , Georgina Kennedy

J Biomed Inform

June 2023

Article Synopsis

* The OHDSI consortium's NLP Working Group created methods and tools to improve the use of textual data in observational studies, detailing a framework for integrating this information into the OMOP Common Data Model (CDM).
* The authors also highlight the workflow for extracting and transforming data from clinical notes, share current applications of the NLP solution, and discuss challenges and lessons learned to aid other researchers in implementing NLP in their studies.

View Article and Find Full Text PDF

Mining of EHR for interface terminology concepts for annotating EHRs of COVID patients.

Vipina K Keloth , Shuxin Zhou , Luke Lindemann , Ling Zheng , Gai Elhanan

BMC Med Inform Decis Mak

February 2023

Background: Two years into the COVID-19 pandemic and with more than five million deaths worldwide, the healthcare establishment continues to struggle with every new wave of the pandemic resulting from a new coronavirus variant. Research has demonstrated that there are variations in the symptoms, and even in the order of symptom presentations, in COVID-19 patients infected by different SARS-CoV-2 variants (e.g.

View Article and Find Full Text PDF

Visual comprehension and orientation into the COVID-19 CIDO ontology.

Ling Zheng , Yehoshua Perl , Yongqun He , Christopher Ochs , James Geller , Vipina K Keloth

J Biomed Inform

August 2021

The current intensive research on potential remedies and vaccinations for COVID-19 would greatly benefit from an ontology of standardized COVID terms. The Coronavirus Infectious Disease Ontology (CIDO) is the largest among several COVID ontologies, and it keeps growing, but it is still a medium sized ontology. Sophisticated CIDO users, who need more than searching for a specific concept, require orientation and comprehension of CIDO.

View Article and Find Full Text PDF

Extending import detection algorithms for concept import from two to three biomedical terminologies.

Vipina K Keloth , James Geller , Yan Chen , Julia Xu

BMC Med Inform Decis Mak

December 2020

Background: While enrichment of terminologies can be achieved in different ways, filling gaps in the IS-A hierarchy backbone of a terminology appears especially promising. To avoid difficult manual inspection, we started a research program in 2014, investigating terminology densities, where the comparison of terminologies leads to the algorithmic discovery of potentially missing concepts in a target terminology. While candidate concepts have to be approved for import by an expert, the human effort is greatly reduced by algorithmic generation of candidates.

View Article and Find Full Text PDF

Alternative classification of identical concepts in different terminologies: Different ways to view the world.

Vipina K Keloth , Zhe He , Gai Elhanan , James Geller

J Biomed Inform

June 2019

In previous research, we have studied concepts that occur in pairs of medical terminologies and are known to be identical, because they have the same ID number in the Unified Medical Language System (UMLS). We observed that such concepts rarely have exactly the same sets of children (=subconcepts) in the two terminologies. The number of common children was found to vary widely.

View Article and Find Full Text PDF

Extended Analysis of Topological-Pattern-Based Ontology Enrichment.

Zhe He , Vipina Kuttichi Keloth , Yan Chen , James Geller

Proceedings (IEEE Int Conf Bioinformatics Biomed)

December 2018

Maintenance of biomedical ontologies is difficult. We have previously developed a topological-pattern-based method to deal with the problem of identifying concepts in a reference ontology that could be of interest for insertion into a target ontology. Assuming that both ontologies are parts of the Unified Medical Language System (UMLS), the method suggests approximate locations where the target ontology could be extended with new concepts from the reference ontology.

View Article and Find Full Text PDF

Leveraging Horizontal Density Differences between Ontologies to Identify Missing Child Concepts: A Proof of Concept.

Vipina K Keloth , Zhe He , Yan Chen , James Geller

AMIA Annu Symp Proc

October 2019

Previously, we investigated pairs of ontologies with local similarities where corresponding "is-a" paths are of different lengths. This indicated the possibility of importing concepts from one ontology into the other. We referred to such structures as diamonds of concepts.

View Article and Find Full Text PDF

How Sustainable are Biomedical Ontologies?

James Geller , Vipina K Keloth , Mark A Musen

AMIA Annu Symp Proc

September 2019

BioPortal is widely regarded to be the world's most comprehensive repository of biomedical ontologies. With a coverage of many biomedical subfields by 716 ontologies (June 27, 2018), BioPortal is an extremely diverse repository. BioPortal maintains easily accessible information about the ontologies submitted by ontology curators.

View Article and Find Full Text PDF