Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Inference of molecules with desired activities/properties is one of the key and challenging issues in cheminformatics and bioinformatics. For that purpose, our research group has recently developed a state-of-the-art framework mol-infer for molecular inference. This framework first constructs a prediction function for a fixed property using machine learning models, which is then simulated by mixed-integer linear programming to infer desired molecules. The accuracy of the framework heavily relies on the representation power of the descriptors. In this study, we highlight a typical class of non-isomorphic chemical graphs with reasonably different property values that cannot be distinguished by the standard "two-layered (2L) model" of mol-infer. To address this distinguishability problem of the 2L model, we propose a novel family of descriptors, named cycle-configuration (CC), which captures the notion of ortho/meta/para patterns that appear in aromatic rings, which was impossible in the framework so far. Extensive computational experiments show that with the new descriptors, we can construct prediction functions with similar or better performance for all 44 tested chemical properties, including 27 regression datasets and 17 classification datasets comparing with our previous studies, confirming the effectiveness of the CC descriptors. For inference, we also provide a system of linear constraints to formulate the CC descriptors as linear constraints. We demonstrate that a chemical graph with up to 50 non-hydrogen vertices can be inferred within a practical time frame.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12360023PMC
http://dx.doi.org/10.1186/s13321-025-01042-zDOI Listing

Publication Analysis

Top Keywords

molecular inference
8
linear constraints
8
descriptors
5
cycle-configuration descriptors
4
descriptors novel
4
novel graph-theoretic
4
graph-theoretic approach
4
approach enhancing
4
enhancing molecular
4
inference
4

Similar Publications

DNA fecal metabarcoding has revolutionized the field of herbivore diet analyses, offering deeper insight into plant-herbivore interactions and more reliable ecological inferences. However, due to PCR amplification bias, primer selection has a major impact on the validity of these inferences and insights. Using two pooling approaches on four mock communities and a case study examining diets of four large mammalian herbivores (LMH), we evaluated the efficacy of two primer pairs targeting the internal transcribed spacer 2 (ITS2) region: the widely used ITS-S2F/ITS4 pair and the UniPlant F/R pair, designed specifically for DNA metabarcoding.

View Article and Find Full Text PDF

Detecting Introgression in Shallow Phylogenies: How Minor Molecular Clock Deviations Lead to Major Inference Errors.

Mol Biol Evol

September 2025

Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing 100875, China.

Recent theoretical and algorithmic advances in introgression detection, coupled with the growing availability of genome-scale data, have highlighted the widespread occurrence of interspecific gene flow across the tree of life. However, current methods largely depend on the molecular clock assumption-a questionable premise given empirical evidence of substitution rate variation across lineages. While such rate heterogeneity is known to compromise gene flow detection among divergent lineages, its impact on closely related taxa at shallow evolutionary timescales remains poorly understood, likely because these taxa are often assumed to adhere to a molecular clock.

View Article and Find Full Text PDF

Recently photoinduced dynamic ligation in a metal-organic frameworks (MOFs) was reported, where a long-lived charge-transfer excited state (ca. 30 μs) featuring partial dissociation between the carboxylate linker and metal-based node was probed by time-resolved infrared (TRIR) spectroscopy. The study offers a new mechanistic perspective to evaluate the potential contribution from the excited state molecular configuration to the performance of MOF photocatalysts.

View Article and Find Full Text PDF

High-throughput phytoplankton monitoring and screening of harmful and bloom-forming algae in coastal waters with updated functional screening database.

Mar Pollut Bull

September 2025

Department of Science and Environmental Studies, The Education University of Hong Kong, New Territories, Hong Kong; State Key Laboratory of Marine Environmental Health, City University of Hong Kong, Kowloon, Hong Kong. Electronic address:

Climate change and anthropogenic pressures alter phytoplankton phenology, distribution, and bloom frequency. Healthy phytoplankton communities are crucial for biogeochemical processes, blue carbon sequestration, and climate change mitigation. By employing high-throughput 18S V4 rRNA metabarcoding, we addressed the need for profiling phytoplankton community and response mechanisms in urbanized coastal ecosystems.

View Article and Find Full Text PDF

Programmable self-assembly has recently enabled the creation of complex structures through precise control of the interparticle interactions and the particle geometries. Targeting ever more structurally complex, dynamic, and functional assemblies necessitates going beyond the design of the structure itself, to the measurement and control of the local flexibility of the intersubunit connections and its impact on the collective mechanics of the entire assembly. In this study, we demonstrate a method to infer the mechanical properties of multisubunit assemblies using cryogenic electron microscopy (cryo-EM) and RELION's multi-body refinement.

View Article and Find Full Text PDF