Prediction of acute toxicity of organic contaminants to fish: Model development and a novel approach to identify reactive substructures.

J Hazard Mater

School of Environmental Science and Engineering, Tianjin University, Tianjin 300072, China. Electronic address:

Published: July 2025


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

In this study, count-based Morgan fingerprints (CMF) were employed to represent the fundamental chemical structures of contaminants, and a neural network model (R² = 0.76) was developed to predict acute fish toxicity (AFT) of organic compounds. Models based on CMF consistently outperformed those based on binary Morgan fingerprints (BMF), likely due to the latter's inefficiency in describing homologous structures. The similarity of CMF was calculated using an improved method based on Tanimoto distance, which was used for calculation of dataset partition and application domain. The similarity-based dataset partitioning method ensured structural diversity within the training set and improved performance on the validation set, demonstrating its potential for toxicological structure analysis and priority screening. Toxic substructures identified by Shapley additive explanation (SHAP) method were substituted benzenes, long carbon chains, unsaturated carbons and halogen atoms. By incorporating K and monitoring shifts in feature importance, the influence of substructures on AFT was further delineated, revealing their roles in facilitating exposure (e.g.: long carbon chains) and reactive toxicity (e.g.: methyl). Additionally, we compared the toxicity of similar substructures and the same substructure in different chemical environments as well. To address SHAP's insensitivity to low-variance features, this study introduced a novel metric termed the toxicity index (TI), designed to pinpoint substructures that are present in minimal quantities yet potentially exhibit high toxicity. With TI, we identified several important substructures, such as parathion and polycyclic substituents. Finally, prevalent toxic substructures and potential highly toxic substances were identified in two external datasets.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.jhazmat.2025.137917DOI Listing

Publication Analysis

Top Keywords

morgan fingerprints
8
toxic substructures
8
long carbon
8
carbon chains
8
substructures
7
toxicity
6
prediction acute
4
acute toxicity
4
toxicity organic
4
organic contaminants
4

Similar Publications

A Comparative Evaluation of Machine Learning and Deep Graph Learning for Chemical Ecotoxicological Prediction.

ACS Omega

August 2025

Centre in Artificial Intelligence Driven Drug Discovery, Faculty of Applied Sciences, Macao Polytechnic University, Macao 999078, China.

Regulating chemicals to protect the environment based on ecotoxicological assessments is a major challenge. However, experimental ecotoxicity tests are time-consuming and expensive, which underscores the need for accurate prediction methods. In this study, we conducted a comprehensive analysis on the application of machine learning and graph-based learning techniques for the ecotoxicological prediction of chemicals.

View Article and Find Full Text PDF

Natural products provide a rich source of bioactive molecules for a variety of applications. Molecular fingerprints are the tool of choice for systematic large-scale studies of their structures. However, current molecular fingerprints insufficiently represent characteristic features of natural products inherently, decreasing the interpretability of natural product-specific predictions.

View Article and Find Full Text PDF

Assessing chemical toxicity is essential for understanding potential risks to human health. However, ethical, financial, and scientific challenges have driven the demand for non-animal testing methods. This study introduces a computational framework that leverages diverse molecular representations, including MACCS keys, Morgan fingerprints, and Mordred descriptors, to predict skin sensitization, irritation/corrosion, and acute dermal toxicity.

View Article and Find Full Text PDF

Background: Drug development is a time-consuming and costly endeavor, and utilizing computer-aided methods to predict drug-target affinity (DTA) can significantly accelerate this process. The key to accurate DTA prediction lies in selecting appropriate computational models to effectively extract features from drug molecular structures and target protein structures. Existing methods usually ignore the features of the protein three-dimensional structure.

View Article and Find Full Text PDF

Efficient colorimetric assay promotes the virtual screen of granzyme B inhibitors.

Mikrochim Acta

August 2025

Third Hospital of Shanxi Medical University, Shanxi Bethune Hospital, Shanxi Academy of Medical Sciences Tongji Shanxi  Hospital, Shanxi Medical University, Taiyuan, 030032, China.

Granzyme B (GrmB) is a key biomarker for immune activation and tumor cell eradication, as well as the therapeutic target for autoimmune and chronic inflammatory disorders. Recent bioinformatic methods have been extensively applied to discover new sites of action, thereby enabling the screening of corresponding inhibitors. However, verification of silico predictions requires efficient experimental tests in vitro, and this work aims to provide an efficient assay method that enables the colorimetric detection of GrmB activity with high specificity and sensitivity (0.

View Article and Find Full Text PDF