Publications by authors named "Tudor I Oprea"

The recent SmartGraph platform facilitates the execution of complex drug-discovery workflows with ease in the network-pharmacology paradigm. However, at the time of its publication we identified the need for the development of an Application Programming Interface (API) that could promote biomedical data integration and hypothesis generation in an automated manner. This need was magnified at the time of the COVID-19 pandemic.

View Article and Find Full Text PDF

CORUM (https://mips.helmholtz-muenchen.de/corum/) is a public database that offers comprehensive information about mammalian protein complexes, including their subunits, functions and associations with human diseases.

View Article and Find Full Text PDF

TIN-X (Target Importance and Novelty eXplorer) is an interactive visualization tool for illuminating associations between diseases and potential drug targets and is publicly available at newdrugtargets.org. TIN-X uses natural language processing to identify disease and protein mentions within PubMed content using previously published tools for named entity recognition (NER) of gene/protein and disease names.

View Article and Find Full Text PDF

Motivation: Graph representation learning is a family of related approaches that learn low-dimensional vector representations of nodes and other graph elements called embeddings. Embeddings approximate characteristics of the graph and can be used for a variety of machine-learning tasks such as novel edge prediction. For many biomedical applications, partial knowledge exists about positive edges that represent relationships between pairs of entities, but little to no knowledge is available about negative edges that represent the explicit lack of a relationship between two nodes.

View Article and Find Full Text PDF

The Knowledge Management Center (KMC) for the Illuminating the Druggable Genome (IDG) project aims to aggregate, update, and articulate protein-centric data knowledge for the entire human proteome, with emphasis on the understudied proteins from the three IDG protein families. KMC collates and analyzes data from over 70 resources to compile the Target Central Resource Database (TCRD), which is the web-based informatics platform (Pharos). These data include experimental, computational, and text-mined information on protein structures, compound interactions, and disease and phenotype associations.

View Article and Find Full Text PDF

Drug discovery is adapting to novel technologies such as data science, informatics, and artificial intelligence (AI) to accelerate effective treatment development while reducing costs and animal experiments. AI is transforming drug discovery, as indicated by increasing interest from investors, industrial and academic scientists, and legislators. Successful drug discovery requires optimizing properties related to pharmacodynamics, pharmacokinetics, and clinical outcomes.

View Article and Find Full Text PDF

DrugCentral, accessible at https://drugcentral.org , is an open-access online drug information repository. It covers over 4950 drugs, incorporating structural, physicochemical, and pharmacological details to support drug discovery, development, and repositioning.

View Article and Find Full Text PDF

Molecular complexity (MC) lacks a universal definition, but various studies address it in contexts ranging from ligand-receptor interactions to DNA sequencing, with the overarching emphasis being its significance in synthetic organic chemistry and pharmaceutical research. Efforts to quantify MC in drug discovery have been numerous, but a unified approach remains challenging. Strategies based on graph theory, information theory, and substructural feature counts employed to gauge MC are often correlated to molecular weight (MW).

View Article and Find Full Text PDF
Article Synopsis
  • - Birth defects affect about 1 in 33 births in the U.S., stemming from genetic and various environmental factors, but their exact causes are often unknown.
  • - Researchers created the Reproductive Toxicity Knowledge Graph (ReproTox-KG) to analyze connections between drugs, genes, and birth defects, gathering data from multiple scientific sources.
  • - This tool scored over 30,000 small molecules for their potential to cause birth defects and identifies over 500 relevant associations, providing a valuable resource for understanding the underlying mechanisms of drug-induced birth defects.
View Article and Find Full Text PDF

The patent literature is a potentially valuable source of bioactivity data. In this article we describe a process to prioritise 3.7 million life science relevant patents obtained from the SureChEMBL database (https://www.

View Article and Find Full Text PDF

The Illuminating the Druggable Genome (IDG) project aims to improve our understanding of understudied proteins and our ability to study them in the context of disease biology by perturbing them with small molecules, biologics, or other therapeutic modalities. Two main products from the IDG effort are the Target Central Resource Database (TCRD) (http://juniper.health.

View Article and Find Full Text PDF

DrugCentral monitors new drug approvals and standardizes drug information. The current update contains 285 drugs (131 for human use). New additions include: (i) the integration of veterinary drugs (154 for animal use only), (ii) the addition of 66 documented off-label uses and iii) the identification of adverse drug events from pharmacovigilance data for pediatric and geriatric patients.

View Article and Find Full Text PDF

We report the main conclusions of the first Chemoinformatics and Artificial Intelligence Colloquium, Mexico City, June 15-17, 2022. Fifteen lectures were presented during a virtual public event with speakers from industry, academia, and non-for-profit organizations. Twelve hundred and ninety students and academics from more than 60 countries.

View Article and Find Full Text PDF

Translating human genetic findings (genome-wide association studies [GWAS]) to pathobiology and therapeutic discovery remains a major challenge for Alzheimer's disease (AD). We present a network topology-based deep learning framework to identify disease-associated genes (NETTAG). We leverage non-coding GWAS loci effects on quantitative trait loci, enhancers and CpG islands, promoter regions, open chromatin, and promoter flanking regions under the protein-protein interactome.

View Article and Find Full Text PDF

Experimental and computational studies pinpoint rate-limiting step(s) in metastasis governed by Rac1. Using ovarian cancer cell and animal models, Rac1 expression was manipulated, and quantitative measurements of cell-cell and cell-substrate adhesion, cell invasion, mesothelial clearance, and peritoneal tumor growth discriminated the tumor behaviors most highly influenced by Rac1. The experimental data were used to parameterize an agent-based computational model simulating peritoneal niche colonization, intravasation, and hematogenous metastasis to distant organs.

View Article and Find Full Text PDF

One aspirational goal of computational chemistry is to predict potent and drug-like binders for any protein, such that only those that bind are synthesized. In this Roadmap, we describe the launch of Critical Assessment of Computational Hit-finding Experiments (CACHE), a public benchmarking project to compare and improve small molecule hit-finding algorithms through cycles of prediction and experimental testing. Participants will predict small molecule binders for new and biologically relevant protein targets representing different prediction scenarios.

View Article and Find Full Text PDF
Article Synopsis
  • Scientists are looking at how confidence scores help check if a computer program called AlphaFold (AF2) is good at predicting the shapes of proteins, which is important for making new medicines.!
  • They found that ignoring low confidence scores (below 80) helps them get better predictions because those scores can be messy due to some proteins being disordered.!
  • Out of the latest crystal structures they studied, 95% probably have the correct shapes, and AF2 could help find useful info about many human proteins that are not well understood yet.!
View Article and Find Full Text PDF

The scientific knowledge about which genes are involved in which diseases grows rapidly, which makes it difficult to keep up with new publications and genetics datasets. The DISEASES database aims to provide a comprehensive overview by systematically integrating and assigning confidence scores to evidence for disease-gene associations from curated databases, genome-wide association studies (GWAS) and automatic text mining of the biomedical literature. Here, we present a major update to this resource, which greatly increases the number of associations from all these sources.

View Article and Find Full Text PDF

The Biopharmaceutics Drug Disposition Classification system (BDDCS) is a four-class approach based on water solubility and extent of metabolism/permeability rate. Based on the BDDCS class to which a drug is assigned, it is possible to predict the role of metabolic enzymes and transporters on the drug disposition of a new molecular entity (NME) prior to its administration to animals or humans. Here, we report a total of 1475 drugs and active metabolites to which the BDDCS is applied.

View Article and Find Full Text PDF