98%
921
2 minutes
20
Background: In recent decades, several life science resources have structured data using the same framework and made these accessible using the same query language to facilitate interoperability. Knowledge graphs have seen increased adoption in bioinformatics due to their advantages for representing data in a generic graph format. For example, yummydata.org catalogs more than 60 knowledge graphs accessible through SPARQL, a technical query language. Although SPARQL allows powerful, expressive queries, even across physically distributed knowledge graphs, formulating such queries is a challenge for most users. Therefore, to guide users in retrieving the relevant data, many of these resources provide representative examples. These examples can also be an important source of information for machine learning (for example, machine-learning algorithms for translating natural language questions to SPARQL), if a sufficiently large number of examples are provided and published in a common, machine-readable, and standardized format across different resources.
Findings: We introduce a large collection of human-written natural language questions and their corresponding SPARQL queries over federated bioinformatics knowledge graphs (KGs) collected for several years across different research groups at the SIB Swiss Institute of Bioinformatics. The collection comprises more than 1,000 example questions and queries, including almost 100 federated queries. We propose a methodology to uniformly represent the examples with minimal metadata, based on existing standards. Furthermore, we introduce an extensive set of open-source applications, including query graph visualizations and smart query editors, easily reusable by KG maintainers who adopt the proposed methodology.
Conclusions: We encourage the community to adopt and extend the proposed methodology, towards richer KG metadata and improved Semantic Web services. URL: https://github.com/sib-swiss/sparql-examples.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12083453 | PMC |
http://dx.doi.org/10.1093/gigascience/giaf045 | DOI Listing |
Neurobiol Dis
September 2025
Department of Neurology, The First Affiliated Hospital of Guangxi Medical University, Nanning, Guangxi Zhuang Autonomous Region, China. Electronic address:
The effect of recurrent seizures on the gradual deterioration of the white matter structural network and the potential molecular mechanisms that underlie the baseline and longitudinal changes in network topology in temporal lobe epilepsy (TLE) remain unclear. Therefore, we used diffusion tensor imaging (DTI) scans and neuropsychiatric assessments for 28 patients with unilateral TLE at baseline and follow-up, and for 28 healthy controls (HC). The topological properties of the structural network were calculated using graph theoretical analyses.
View Article and Find Full Text PDFNeuro Endocrinol Lett
September 2025
Department of Neurosurgery, PLA 960th Hospital, Jinan, Shandong, 250031, China.
Objective: To analyze the hotspots and frontiers in the field of subarachnoid hemorrhage using the bibliometrics method and providing references for academic research.
Methods: All published studies related to subarachnoid hemorrhage published in the Web of Science core database from 1 January 2016 to 25 September 2021 were retrospectively identified using VOSviewer and CiteSpace software. Visualization VOSviewer and CiteSpace software were used to perform statistical and cluster analyses on authors, countries, institutions, keywords, and co-cited documents.
IEEE J Biomed Health Inform
September 2025
The tumor microenvironment is a dynamic eco system where cellular interactions drive cancer progression. However, inferring cell-cell communication from non-spatial scRNA-seq data remains challenging due to incomplete li gand-receptor databases and noisy cell type annotations. H ere, we propose scGraphDap, a graph neural network frame work that integrates functional state pseudo-labels and graph structure learning to improve both cell type annotation an d CCC inference.
View Article and Find Full Text PDFMol Divers
September 2025
Laboratory of Molecular Design and Drug Discovery, School of Science, China Pharmaceutical University, Nanjing, 211198, China.
Drug absorption significantly influences pharmacokinetics. Accurately predicting human oral bioavailability (HOB) is essential for optimizing drug candidates and improving clinical success rates. The traditional method based on experiment is a common way to obtain HOB, but the experimental method is time-consuming and costly.
View Article and Find Full Text PDFJ Chem Inf Model
September 2025
Songshan Lake Materials Laboratory, Dongguan 523808, PR China.
Large language models (LLMs) have demonstrated transformative potential for materials discovery in condensed matter systems, but their full utility requires both broader application scenarios and integration with ab initio crystal structure prediction (CSP), density functional theory (DFT) methods and domain knowledge to benefit future inverse material design. Here, we develop an integrated computational framework combining language model-guided materials screening with genetic algorithm (GA) and graph neural network (GNN)-based CSP methods to predict new photovoltaic material. This LLM + CSP + DFT approach successfully identifies a previously overlooked oxide material with unexpected photovoltaic potential.
View Article and Find Full Text PDF