Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Data augmentation can alleviate the limitations of small molecular datasets for generative deep learning by 'artificially inflating' the number of instances available for training. SMILES enumeration - wherein multiple valid SMILES strings are used to represent the same molecules - has become particularly beneficial to improve the quality of molecule design. Herein, we investigated whether rethinking SMILES augmentation techniques could further enhance the quality of design. To this end, we introduce four novel approaches for SMILES augmentation, drawing inspiration from natural language processing and chemistry insights: (a) token deletion, (b) atom masking, (c) bioisosteric substitution, and (d) self-training. systematic analysis, our results showed the promise of considering additional strategies for SMILES augmentation. Every strategy showed distinct advantages; for example, atom masking is particularly promising to learn desirable physico-chemical properties in very low-data regimes, and deletion to create novel scaffolds. This new repertoire of SMILES augmentation strategies expands the available toolkit to design molecules with bespoke properties in low-data scenarios.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12409607PMC
http://dx.doi.org/10.1039/d5dd00028aDOI Listing

Publication Analysis

Top Keywords

smiles augmentation
16
smiles enumeration
8
data augmentation
8
atom masking
8
properties low-data
8
augmentation
6
smiles
6
going smiles
4
enumeration data
4
augmentation generative
4

Similar Publications

Going beyond SMILES enumeration for data augmentation in generative drug discovery.

Digit Discov

August 2025

Institute for Complex Molecular Systems (ICMS), Eindhoven AI Systems Institute (EAISI), Department of Biomedical Engineering, Eindhoven University of Technology Eindhoven The Netherlands

Data augmentation can alleviate the limitations of small molecular datasets for generative deep learning by 'artificially inflating' the number of instances available for training. SMILES enumeration - wherein multiple valid SMILES strings are used to represent the same molecules - has become particularly beneficial to improve the quality of molecule design. Herein, we investigated whether rethinking SMILES augmentation techniques could further enhance the quality of design.

View Article and Find Full Text PDF

Diabetes remains one of the critical health issues worldwide, and its prevalence is gaining motion due to prevailing factors such as obesity and a sedentary lifestyle. Traditional herbal medications and natural products, particularly enzyme inhibitors, such as alpha-glucosidase, serve as promising alternatives. This study attempted to identify potent alpha-glucosidase inhibitors by including data augmentation in deep-learning modeling.

View Article and Find Full Text PDF

Cardiothoracic surgery remains one of the most challenging specialties to train in, particularly in resource-limited settings where traditional apprenticeship-based models remain impracticable. This article addresses how emerging digital technologies - virtual reality (VR) simulators, augmented reality (AR) platforms, artificial intelligence (AI) surgical training platforms, and tele-mentoring platforms - are transforming cardiothoracic surgical training globally. We present convincing evidence from African implementation studies demonstrating that these innovative approaches can effectively address critical training gaps.

View Article and Find Full Text PDF

Optimizing Hard and Soft-Tissue Esthetics With Anterior Cantilever Zirconia Ceramic Resin-Bonded Fixed Dental Prostheses.

J Esthet Restor Dent

August 2025

Department of Preventive and Restorative Sciences, School of Dental Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA.

Objective: The replacement of missing maxillary lateral incisors poses both functional and esthetic challenges, not only from a restorative but also from a periodontal aspect. This case report presents a step-by-step protocol for ideal hard and soft-tissue esthetics with cantilever zirconia ceramic resin-bonded fixed dental prostheses (RBFDPs).

Clinical Considerations: A 20-year-old female patient presented with congenitally missing maxillary lateral incisors, seeking treatment to enhance the esthetics of her smile.

View Article and Find Full Text PDF

Background: Antibiotic prophylaxis in dental implant surgery remains a contentious topic, with varying guidelines and clinical practices worldwide.

Aim: Report how Swedish dentists use antibiotic prophylaxis during dental implant surgery, based in a review of patient records.

Method: This retrospective, cross-sectional study evaluated the antibiotic prophylaxis habits of Swedish dentists, focusing on the relationship between surgical complexity and antibiotic use.

View Article and Find Full Text PDF