J Chem Inf Model
December 2024
The recent SmartGraph platform facilitates the execution of complex drug-discovery workflows with ease in the network-pharmacology paradigm. However, at the time of its publication we identified the need for the development of an Application Programming Interface (API) that could promote biomedical data integration and hypothesis generation in an automated manner. This need was magnified at the time of the COVID-19 pandemic.
View Article and Find Full Text PDFNucleic Acids Res
January 2025
CORUM (https://mips.helmholtz-muenchen.de/corum/) is a public database that offers comprehensive information about mammalian protein complexes, including their subunits, functions and associations with human diseases.
View Article and Find Full Text PDFTIN-X (Target Importance and Novelty eXplorer) is an interactive visualization tool for illuminating associations between diseases and potential drug targets and is publicly available at newdrugtargets.org. TIN-X uses natural language processing to identify disease and protein mentions within PubMed content using previously published tools for named entity recognition (NER) of gene/protein and disease names.
View Article and Find Full Text PDFMotivation: Graph representation learning is a family of related approaches that learn low-dimensional vector representations of nodes and other graph elements called embeddings. Embeddings approximate characteristics of the graph and can be used for a variety of machine-learning tasks such as novel edge prediction. For many biomedical applications, partial knowledge exists about positive edges that represent relationships between pairs of entities, but little to no knowledge is available about negative edges that represent the explicit lack of a relationship between two nodes.
View Article and Find Full Text PDFDrug Discov Today
March 2024
The Knowledge Management Center (KMC) for the Illuminating the Druggable Genome (IDG) project aims to aggregate, update, and articulate protein-centric data knowledge for the entire human proteome, with emphasis on the understudied proteins from the three IDG protein families. KMC collates and analyzes data from over 70 resources to compile the Target Central Resource Database (TCRD), which is the web-based informatics platform (Pharos). These data include experimental, computational, and text-mined information on protein structures, compound interactions, and disease and phenotype associations.
View Article and Find Full Text PDFJ Comput Aided Mol Des
December 2023
Annu Rev Pharmacol Toxicol
January 2024
Drug discovery is adapting to novel technologies such as data science, informatics, and artificial intelligence (AI) to accelerate effective treatment development while reducing costs and animal experiments. AI is transforming drug discovery, as indicated by increasing interest from investors, industrial and academic scientists, and legislators. Successful drug discovery requires optimizing properties related to pharmacodynamics, pharmacokinetics, and clinical outcomes.
View Article and Find Full Text PDFJ Comput Aided Mol Des
December 2023
DrugCentral, accessible at https://drugcentral.org , is an open-access online drug information repository. It covers over 4950 drugs, incorporating structural, physicochemical, and pharmacological details to support drug discovery, development, and repositioning.
View Article and Find Full Text PDFMolecular complexity (MC) lacks a universal definition, but various studies address it in contexts ranging from ligand-receptor interactions to DNA sequencing, with the overarching emphasis being its significance in synthetic organic chemistry and pharmaceutical research. Efforts to quantify MC in drug discovery have been numerous, but a unified approach remains challenging. Strategies based on graph theory, information theory, and substructural feature counts employed to gauge MC are often correlated to molecular weight (MW).
View Article and Find Full Text PDFCommun Med (Lond)
July 2023
The patent literature is a potentially valuable source of bioactivity data. In this article we describe a process to prioritise 3.7 million life science relevant patents obtained from the SureChEMBL database (https://www.
View Article and Find Full Text PDFNucleic Acids Res
January 2023
The Illuminating the Druggable Genome (IDG) project aims to improve our understanding of understudied proteins and our ability to study them in the context of disease biology by perturbing them with small molecules, biologics, or other therapeutic modalities. Two main products from the IDG effort are the Target Central Resource Database (TCRD) (http://juniper.health.
View Article and Find Full Text PDFNucleic Acids Res
January 2023
DrugCentral monitors new drug approvals and standardizes drug information. The current update contains 285 drugs (131 for human use). New additions include: (i) the integration of veterinary drugs (154 for animal use only), (ii) the addition of 66 documented off-label uses and iii) the identification of adverse drug events from pharmacovigilance data for pediatric and geriatric patients.
View Article and Find Full Text PDFWe report the main conclusions of the first Chemoinformatics and Artificial Intelligence Colloquium, Mexico City, June 15-17, 2022. Fifteen lectures were presented during a virtual public event with speakers from industry, academia, and non-for-profit organizations. Twelve hundred and ninety students and academics from more than 60 countries.
View Article and Find Full Text PDFTranslating human genetic findings (genome-wide association studies [GWAS]) to pathobiology and therapeutic discovery remains a major challenge for Alzheimer's disease (AD). We present a network topology-based deep learning framework to identify disease-associated genes (NETTAG). We leverage non-coding GWAS loci effects on quantitative trait loci, enhancers and CpG islands, promoter regions, open chromatin, and promoter flanking regions under the protein-protein interactome.
View Article and Find Full Text PDFMol Biol Cell
December 2022
Experimental and computational studies pinpoint rate-limiting step(s) in metastasis governed by Rac1. Using ovarian cancer cell and animal models, Rac1 expression was manipulated, and quantitative measurements of cell-cell and cell-substrate adhesion, cell invasion, mesothelial clearance, and peritoneal tumor growth discriminated the tumor behaviors most highly influenced by Rac1. The experimental data were used to parameterize an agent-based computational model simulating peritoneal niche colonization, intravasation, and hematogenous metastasis to distant organs.
View Article and Find Full Text PDFNat Rev Chem
April 2022
One aspirational goal of computational chemistry is to predict potent and drug-like binders for any protein, such that only those that bind are synthesized. In this Roadmap, we describe the launch of Critical Assessment of Computational Hit-finding Experiments (CACHE), a public benchmarking project to compare and improve small molecule hit-finding algorithms through cycles of prediction and experimental testing. Participants will predict small molecule binders for new and biologically relevant protein targets representing different prediction scenarios.
View Article and Find Full Text PDFThe scientific knowledge about which genes are involved in which diseases grows rapidly, which makes it difficult to keep up with new publications and genetics datasets. The DISEASES database aims to provide a comprehensive overview by systematically integrating and assigning confidence scores to evidence for disease-gene associations from curated databases, genome-wide association studies (GWAS) and automatic text mining of the biomedical literature. Here, we present a major update to this resource, which greatly increases the number of associations from all these sources.
View Article and Find Full Text PDFThe Biopharmaceutics Drug Disposition Classification system (BDDCS) is a four-class approach based on water solubility and extent of metabolism/permeability rate. Based on the BDDCS class to which a drug is assigned, it is possible to predict the role of metabolic enzymes and transporters on the drug disposition of a new molecular entity (NME) prior to its administration to animals or humans. Here, we report a total of 1475 drugs and active metabolites to which the BDDCS is applied.
View Article and Find Full Text PDF