Large-scale comparison of machine learning algorithms for target prediction of natural products.

Brief Bioinform

State Key Laboratory of Medicinal Chemical Biology, College of Pharmacy and Tianjin Key Laboratory of Molecular Drug Research, Nankai University, Haihe Education Park, 38 Tongyan Road, Tianjin 300353, China.

Published: September 2022


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Natural products (NPs) and their derivatives are important resources for drug discovery. There are many in silico target prediction methods that have been reported, however, very few of them distinguish NPs from synthetic molecules. Considering the fact that NPs and synthetic molecules are very different in many characteristics, it is necessary to build specific target prediction models of NPs. Therefore, we collected the activity data of NPs and their derivatives from the public databases and constructed four datasets, including the NP dataset, the NPs and its first-class derivatives dataset, the NPs and all its derivatives and the ChEMBL26 compounds dataset. Conditions, including activity thresholds and input features, were explored to access the performance of eight machine learning methods of target prediction of NPs, including support vector machines (SVM), extreme gradient boosting, random forests, K-nearest neighbor, naive Bayes, feedforward neural networks (FNN), convolutional neural networks and recurrent neural networks. As a result, the NPs and all their derivatives datasets were selected to build the best NP-specific models. Furthermore, the consensus models, as well as the voting models, were additionally applied to improve the prediction performance. More evaluations were made on the external validation set and the results demonstrated that (1) the NP-specific model performed better on the target prediction of NPs than the traditional models training on the whole compounds of ChEMBL26. (2) The consensus model of FNN + SVM possessed the best overall performance, and the voting model can significantly improve recall and specificity.

Download full-text PDF

Source
http://dx.doi.org/10.1093/bib/bbac359DOI Listing

Publication Analysis

Top Keywords

target prediction
20
nps derivatives
16
neural networks
12
nps
10
machine learning
8
natural products
8
nps synthetic
8
synthetic molecules
8
dataset nps
8
prediction nps
8

Similar Publications

Analyzing the toxicological effects of PET-MPs on male infertility: Insights from network toxicology, mendelian randomization, and transcriptomics.

Reprod Biol

September 2025

Department of Obstetrics and Gynecology, The First Affiliated Hospital of Anhui Medical University, Hefei 230022, China; Engineering Research Center of Biopreservation and Artificial Organs, Ministry of Education, No 218 Jixi Road, Hefei Anhui230022, China; Key Laboratory of Population Health Across

Current research indicates that polyethylene terephthalate microplastics (PET-MPs) may significantly impair male reproductive function. This study aimed to investigate the potential molecular mechanisms underlying this impairment. Potential gene targets of PET-MPs were predicted via the SwissTargetPrediction database.

View Article and Find Full Text PDF

Objective: Many students who need mental health support do not receive it. We examined associations between perceived barriers and university mental health service access. Participants: First-year Oxford University undergraduates ( = 443) with unmet mental health needs.

View Article and Find Full Text PDF

Objective: To describe the sociodemographic and clinical characteristics of individuals exposed to smoking or biomass smoke and followed at primary health care (PHC) centers across three states in Brazil.

Methods: This was a cross-sectional multicenter study including patients followed at any of four PHC centers in Brazil. Patients ≥ 35 years of age who were smokers or former smokers, or were exposed to biomass smoke were included, the exception being those with physical/mental disabilities and those who were pregnant.

View Article and Find Full Text PDF

This Letter presents an investigation of low-energy electron-neutrino interactions in the Fermilab Booster Neutrino Beam by the MicroBooNE experiment, motivated by the excess of electron-neutrino-like events observed by the MiniBooNE experiment. This is the first measurement to use data from all five years of operation of the MicroBooNE experiment, corresponding to an exposure of 1.11×10^{21} protons on target, a 70% increase on past results.

View Article and Find Full Text PDF

Effectively motivating public action on climate change remains a central challenge for science communicators. This study investigated how message and messenger attributes shape viewers' motivation to act on climate change, and whether these effects vary as a function of political orientation. Using a policy-capturing design, 581 U.

View Article and Find Full Text PDF