Enabling data-limited chemical bioactivity predictions through deep neural network transfer learning.

J Comput Aided Mol Des

Department of Defense Biotechnology High Performance Computing Software Applications Institute, Telemedicine and Advanced Technology Research Center, U.S. Army Medical Research and Development Command, FCMR-TT, 504 Scott Street, Fort Detrick, MD, 21702-5012, USA.

Published: December 2022


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

The main limitation in developing deep neural network (DNN) models to predict bioactivity properties of chemicals is the lack of sufficient assay data to train the network's classification layers. Focusing on feedforward DNNs that use atom- and bond-based structural fingerprints as input, we examined whether layers of a fully trained DNN based on large amounts of data to predict one property could be used to develop DNNs to predict other related or unrelated properties based on limited amounts of data. Hence, we assessed if and under what conditions the dense layers of a pre-trained DNN could be transferred and used for the development of another DNN associated with limited training data. We carried out a quantitative study employing more than 400 pairs of assay datasets, where we used fully trained layers from a large dataset to augment the training of a small dataset. We found that the higher the correlation r between two assay datasets, the more efficient the transfer learning is in reducing prediction errors associated with the smaller dataset DNN predictions. The reduction in mean squared prediction errors ranged from 10 to 20% for every 0.1 increase in r between the datasets, with the bulk of the error reductions associated with transfers of the first dense layer. Transfer of other dense layers did not result in additional benefits, suggesting that deeper, dense layers conveyed more specialized and assay-specific information. Importantly, depending on the dataset correlation, training sample size could be reduced by up to tenfold without any loss of prediction accuracy.

Download full-text PDF

Source
http://dx.doi.org/10.1007/s10822-022-00486-xDOI Listing

Publication Analysis

Top Keywords

dense layers
12
deep neural
8
neural network
8
transfer learning
8
fully trained
8
amounts data
8
assay datasets
8
prediction errors
8
layers
6
dnn
5

Similar Publications

Core-Periphery Detection in Multilayer Networks.

Phys Rev Lett

August 2025

Gran Sasso Science Institute, The University of Edinburgh, School of Mathematics, Edinburgh EH93FD, United Kingdom and School of Mathematics, 67100 L'Aquila, Italy.

Multilayer networks provide a powerful framework for modeling complex systems that capture different types of interactions between the same set of entities across multiple layers. Core-periphery detection involves partitioning the nodes of a network into core nodes, which are highly connected across the network, and peripheral nodes, which are densely connected to the core but sparsely connected among themselves. In this paper, we propose a new model of core-periphery structure in multilayer networks and a nonlinear spectral method that simultaneously detects the corresponding core and periphery structures of both nodes and layers in weighted and directed multilayer networks.

View Article and Find Full Text PDF

Hybrid two-stage CNN for detection and staging of periodontitis on panoramic radiographs.

J Oral Biol Craniofac Res

August 2025

Neura Integrasi Solusi, Jl. Kebun Raya No. 73, Rejowinangun, Kotagede, Yogyakarta, 55171, Indonesia.

Background: Periodontal disease is an inflammatory condition causing chronic damage to the tooth-supporting connective tissues, leading to tooth loss in adults. Diagnosing periodontitis requires clinical and radiographic examinations, with panoramic radiographs crucial in identifying and assessing its severity and staging. Convolutional Neural Networks (CNNs), a deep learning method for visual data analysis, and Dense Convolutional Networks (DenseNet), which utilize direct feed-forward connections between layers, enable high-performance computer vision tasks with reduced computational demands.

View Article and Find Full Text PDF

The paraventricular thalamic nucleus (PVT) integrates subcortical signals related to arousal, stress, addiction, and anxiety with top-down cortical influences. Increases or decreases in PVT activity exert profound, long-lasting effects on behavior related to motivation, addiction and homeostasis. Yet the sources of its subcortical excitatory and inhibitory afferents, their distribution within the PVT, and their integration with layer-specific cortical inputs remain unclear.

View Article and Find Full Text PDF

Fluorinated Imidazolidinium Cations as a Fluorine-Lean Interface Repairing Agent for Li-Metal Batteries.

ACS Appl Mater Interfaces

September 2025

Department of Chemistry and Shanghai Key Laboratory of Molecular Catalysis and Innovative Materials, Institute of New Energy, iChEM (Collaborative Innovation Center of Chemistry for Energy Materials), Fudan University, Shanghai 200433, China.

Li-metal batteries promise ultrahigh energy density, but their application is limited by Li-dendrite growth. Theoretically, fluorine-containing anions such as bis(fluorosulfonyl)imide (FSI) in electrolytes can be reduced to form LiF-rich solid-electrolyte interphases (SEIs) with high Young's modulus and ionic conductivity that can suppress dendrites. However, the anions migrate toward the cathode during the charging process, accompanied by a decrease in the concentration of interfacial anions near the anode surface.

View Article and Find Full Text PDF

High-strength Janus cellulose/MXene composite paper from deep eutectic solvent-carboxymethylated eucalyptus fibers for electromagnetic shielding.

Int J Biol Macromol

September 2025

Plant Fiber Material Science Research Center, State Key Laboratory of Advanced Papermaking and Paper-based Materials, South China University of Technology, Guangzhou, 510640, China.

The development of cellulose-based electromagnetic shielding materials is critical for the advancement of sustainable, lightweight, and flexible electronic devices. Most high-performance composites rely on nanocellulose, which is expensive and energy-intensive to produce. In this work, we employ chemically modified conventional eucalyptus pulp fibers (non-nano) to fabricate Janus-structured cellulose/MXene composite papers.

View Article and Find Full Text PDF