Natural Language Processing Methods for the Study of Protein-Ligand Interactions.

ArXiv

Department of Biology and Center for Biodiversity and Conservation Research, University of Mississippi, University, MS.

Published: October 2024


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Natural Language Processing (NLP) has revolutionized the way computers are used to study and interact with human languages and is increasingly influential in the study of protein and ligand binding, which is critical for drug discovery and development. This review examines how NLP techniques have been adapted to decode the "language" of proteins and small molecule ligands to predict protein-ligand interactions (PLIs). We discuss how methods such as long short-term memory (LSTM) networks, transformers, and attention mechanisms can leverage different protein and ligand data types to identify potential interaction patterns. Significant challenges are highlighted, including the scarcity of high-quality negative data, difficulties in interpreting model decisions, and sampling biases of existing datasets. We argue that focusing on improving data quality, enhancing model robustness, and fostering both collaboration and competition could catalyze future advances in machine-learning-based predictions of PLIs.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11527106PMC

Publication Analysis

Top Keywords

natural language
8
language processing
8
protein-ligand interactions
8
protein ligand
8
processing methods
4
methods study
4
study protein-ligand
4
interactions natural
4
processing nlp
4
nlp revolutionized
4

Similar Publications

Integrating clinical anxiety scales with pre-trained language models for anxiety recognition on social media.

Health Inf Sci Syst

December 2025

Gansu Provincial Key Laboratory of Wearable Computing, School of Information Science and Engineering, Lanzhou University, Lanzhou, 730000 China.

Leveraging natural language processing to identify anxiety states from social media has been widely studied. However, existing research lacks deep user-level semantic modeling and effective anxiety feature extraction. Additionally, the absence of clinical domain knowledge in current models limits their interpretability and medical relevance.

View Article and Find Full Text PDF

Pragmatics: Exploring language use by younger generations in Pedi families.

S Afr J Commun Disord

August 2025

Department of Speech Pathology and Audiology, Faculty of Humanities, University of the Witwatersrand, Johannesburg, South Africa; and Department of Rehabilitative and Natural Sciences, Faculty of Health Sciences, University of Fort Hare, East London.

Background: The people of the Pedi culture place great value on, and take pride in, adhering to their culture, as reflected in the manner in which they communicate verbally and non-verbally. However, little is documented about the ways in which verbal and non-verbal language is used socially by the younger generations in the Pedi culture.

Objectives: This article examines how verbal and non-verbal social language skills and functions are used by the younger generations in Pedi families.

View Article and Find Full Text PDF

Purpose: Depression among college students is a growing concern that negatively affects academic performance, emotional well-being, and career planning. Existing diagnostic methods are often slow, subjective, and inaccessible, underscoring the need for automated systems that can detect depressive symptoms through digital behavior, particularly on social media platforms.

Method: This study proposes a novel natural language processing (NLP) framework that combines a RoBERTa-based Transformer with gated recurrent unit (GRU) layers and multimodal embeddings.

View Article and Find Full Text PDF

Urban planning in the era of large language models.

Nat Comput Sci

September 2025

Department of Electronic Engineering, Tsinghua University, Beijing, China.

City plans are the product of integrating human creativity with emerging technologies, which continuously evolve and reshape urban morphology and environments. Here we argue that large language models hold large untapped potential in addressing the growing complexities of urban planning and enabling a more holistic, innovative and responsive approach to city design. By harnessing their advanced generation and simulation capabilities, large language models can contribute as an intelligent assistant for human planners in synthesizing conceptual ideas, generating urban designs and evaluating the outcomes of planning efforts.

View Article and Find Full Text PDF

Objective: To examine the association between patient disability status and use of stigmatizing language in clinical notes from the hospital admission for birth.

Design: Cross-sectional study of electronic health record data.

Setting: Two urban hospitals in the northeastern United States.

View Article and Find Full Text PDF