Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Marine dissolved organic matter (DOM) is an extremely complex mixture of organic compounds that plays a crucial role in the global carbon cycle. In the Arctic, climate change accelerates the release of terrestrial organic carbon. Since chemical information is the only way to track DOM sources and fate, it is essential to improve analytical and data science approaches to assess the DOM composition. Here, we compare random forest (RF), support vector machines, and generalized linear models (GLM) to predict a fluorescence-derived proxy for terrestrial DOM based on molecular formula data from liquid chromatography coupled with Fourier transform mass spectrometry (LC-FTMS). We systematically evaluate different data preprocessing, normalization, and ML techniques to optimize prediction accuracy and computational efficiency. Our results show that a generalized linear model (GLM) with sum normalization provides the most accurate and efficient predictions, achieving a normalized root-mean-square error (NRMSE) of 5.7%close to the precision of the fluorescence measurement. The prediction based on RF regression was slightly less accurate and required significantly more computation time compared to GLM, but it was most robust against data preprocessing and independent of linear correlations. Feature selection significantly improved the performance of all models, with robust predictions obtained using only ca.  2000 of the ca.  70,000 molecular features per sample. Additionally, we assessed the impact of chromatographic retention time on prediction accuracy and explored the key molecular features contributing to terrestrial DOM signatures using Shapley values and permutation importance (for RFs). Our study is a blueprint for the application of ML to enhance the analysis of high-resolution mass spectrometry data, offering a scalable approach for predicting information important for the understanding of marine DOM chemistry.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12268369PMC
http://dx.doi.org/10.1021/acsomega.5c02849DOI Listing

Publication Analysis

Top Keywords

dissolved organic
8
organic matter
8
generalized linear
8
terrestrial dom
8
mass spectrometry
8
data preprocessing
8
prediction accuracy
8
molecular features
8
data
6
dom
6

Similar Publications

Effects of Imidacloprid on Afrotropical Aquatic Ecosystems: A South African Microcosm Study.

Integr Environ Assess Manag

September 2025

Water Research Group, Unit for Environmental Sciences and Management, North-West University, Potchefstroom, South Africa.

Pesticides are widely used to meet the food demands of a growing population, with various types used to control pests depending on the crops grown. Rainfall, overspray, and runoff from agricultural fields can wash these insecticides into water bodies, posing documented environmental risks. Imidacloprid is commonly used in Afrotropical regions such as South Africa, yet limited information is available on its toxicity to aquatic ecosystems within this climate region.

View Article and Find Full Text PDF

Several micro- and nanoplastic particle (MNP) traits, like polymer type, size, and shape, have been shown to influence MNP toxicity. However, the direction and strength of these moderating effects are often unclear, and generalizations from single studies are challenging to establish. Meta-analyses increase generalizability and derive more accurate and precise effect size estimates by combining measurements from published studies.

View Article and Find Full Text PDF

Introduction: Soil dissolved organic matter (DOM) regulates nutrient cycling and carbon sequestration, yet how cropping systems (rotation vs. monoculture) shape the vertical distribution and molecular traits of DOM remains unclear.

Methods: We leveraged a long-term experiment (est.

View Article and Find Full Text PDF

Efficient and low-cost removal of dissolved organic phosphorus by visible light-enhanced Ti electrocoagulation with self-generated rutile photocatalysts.

Water Res

August 2025

State Key Laboratory of Environmental Aquatic Chemistry, Key Laboratory of Drinking Water Science and Technology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, China.

Phosphorus is recognized as a major pollutant in municipal and domestic wastewater, but the effective removal of organic phosphorus (OP) using conventional wastewater treatment technologies is difficult. Herein, a novel visible light-enhanced Ti electrocoagulation (EC) technology was proposed for the removal of OP using 2-amino-ethyl phosphonic acid (AEP) as a model compound to elucidate the removal efficiency and mechanisms. The results showed that the irradiation under visible light (670 Lux) effectively enhanced the removal of AEP by Ti EC.

View Article and Find Full Text PDF

Adsorption-desorption behavior of difenoconazole onto soils: Kinetics, isotherms, and influencing factors.

Pestic Biochem Physiol

November 2025

National Engineering Laboratory for Efficient Utilization of Soil and Fertilizer Resources, Key Laboratory of Agricultural Environment in Universities of Shandong, College of Resources and Environment, Shandong Agricultural University, 61 Daizong Road, Tai'an 271018, PR China. Electronic address: wj

Difenoconazole (DFC) is a commonly used triazole fungicide known for its high efficiency and environmental persistence. A thorough understanding of its environmental behavior, particularly sorption in soil, is critical to obtain a comprehensive assessment of the ecological risk of DFC. In this study, three soils with distinct physicochemical properties (brown soil, cinnamon soil, and fluvo-aquic soil) were used to elucidate the adsorption mechanisms of DFC on soil.

View Article and Find Full Text PDF