98%
921
2 minutes
20
In this study, count-based Morgan fingerprints (CMF) were employed to represent the fundamental chemical structures of contaminants, and a neural network model (R² = 0.76) was developed to predict acute fish toxicity (AFT) of organic compounds. Models based on CMF consistently outperformed those based on binary Morgan fingerprints (BMF), likely due to the latter's inefficiency in describing homologous structures. The similarity of CMF was calculated using an improved method based on Tanimoto distance, which was used for calculation of dataset partition and application domain. The similarity-based dataset partitioning method ensured structural diversity within the training set and improved performance on the validation set, demonstrating its potential for toxicological structure analysis and priority screening. Toxic substructures identified by Shapley additive explanation (SHAP) method were substituted benzenes, long carbon chains, unsaturated carbons and halogen atoms. By incorporating K and monitoring shifts in feature importance, the influence of substructures on AFT was further delineated, revealing their roles in facilitating exposure (e.g.: long carbon chains) and reactive toxicity (e.g.: methyl). Additionally, we compared the toxicity of similar substructures and the same substructure in different chemical environments as well. To address SHAP's insensitivity to low-variance features, this study introduced a novel metric termed the toxicity index (TI), designed to pinpoint substructures that are present in minimal quantities yet potentially exhibit high toxicity. With TI, we identified several important substructures, such as parathion and polycyclic substituents. Finally, prevalent toxic substructures and potential highly toxic substances were identified in two external datasets.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1016/j.jhazmat.2025.137917 | DOI Listing |
ACS Omega
August 2025
Centre in Artificial Intelligence Driven Drug Discovery, Faculty of Applied Sciences, Macao Polytechnic University, Macao 999078, China.
Regulating chemicals to protect the environment based on ecotoxicological assessments is a major challenge. However, experimental ecotoxicity tests are time-consuming and expensive, which underscores the need for accurate prediction methods. In this study, we conducted a comprehensive analysis on the application of machine learning and graph-based learning techniques for the ecotoxicological prediction of chemicals.
View Article and Find Full Text PDFJ Cheminform
August 2025
Bioinformatics Group, Wageningen University & Research, Droevendaalsesteeg 4, 6708 PB, Wageningen, the Netherlands.
Natural products provide a rich source of bioactive molecules for a variety of applications. Molecular fingerprints are the tool of choice for systematic large-scale studies of their structures. However, current molecular fingerprints insufficiently represent characteristic features of natural products inherently, decreasing the interpretability of natural product-specific predictions.
View Article and Find Full Text PDFComput Struct Biotechnol J
August 2025
Entelos Institute, Nicosia 2102, Cyprus.
Assessing chemical toxicity is essential for understanding potential risks to human health. However, ethical, financial, and scientific challenges have driven the demand for non-animal testing methods. This study introduces a computational framework that leverages diverse molecular representations, including MACCS keys, Morgan fingerprints, and Mordred descriptors, to predict skin sensitization, irritation/corrosion, and acute dermal toxicity.
View Article and Find Full Text PDFBMC Genomics
August 2025
Faculty of Computer Science and Control Engineering, Shenzhen University of Advanced Technology, Shenzhen, 518107, China.
Background: Drug development is a time-consuming and costly endeavor, and utilizing computer-aided methods to predict drug-target affinity (DTA) can significantly accelerate this process. The key to accurate DTA prediction lies in selecting appropriate computational models to effectively extract features from drug molecular structures and target protein structures. Existing methods usually ignore the features of the protein three-dimensional structure.
View Article and Find Full Text PDFMikrochim Acta
August 2025
Third Hospital of Shanxi Medical University, Shanxi Bethune Hospital, Shanxi Academy of Medical Sciences Tongji Shanxi Hospital, Shanxi Medical University, Taiyuan, 030032, China.
Granzyme B (GrmB) is a key biomarker for immune activation and tumor cell eradication, as well as the therapeutic target for autoimmune and chronic inflammatory disorders. Recent bioinformatic methods have been extensively applied to discover new sites of action, thereby enabling the screening of corresponding inhibitors. However, verification of silico predictions requires efficient experimental tests in vitro, and this work aims to provide an efficient assay method that enables the colorimetric detection of GrmB activity with high specificity and sensitivity (0.
View Article and Find Full Text PDF