Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

The BioSamples database at EMBL-EBI is the central institutional repository for sample metadata storage and connection to EMBL-EBI archives and other resources. The technical improvements to our infrastructure described in our last update have enabled us to scale and accommodate an increasing number of communities, resulting in a higher number of submissions and more heterogeneous data. The BioSamples database now has a valuable set of features and processes to improve data quality in BioSamples, and in particular enriching metadata content and following FAIR principles. In this manuscript, we describe how BioSamples in 2021 handles requirements from our community of users through exemplar use cases: increased findability of samples and improved data management practices support the goals of the ReSOLUTE project, how the plant community benefits from being able to link genotypic to phenotypic information, and we highlight how cumulatively those improvements contribute to more complex multi-omics data integration supporting COVID-19 research. Finally, we present underlying technical features used as pillars throughout those use cases and how they are reused for expanded engagement with communities such as FAIRplus and the Global Alliance for Genomics and Health. Availability: The BioSamples database is freely available at http://www.ebi.ac.uk/biosamples. Content is distributed under the EMBL-EBI Terms of Use available at https://www.ebi.ac.uk/about/terms-of-use. The BioSamples code is available at https://github.com/EBIBioSamples/biosamples-v4 and distributed under the Apache 2.0 license.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8728232PMC
http://dx.doi.org/10.1093/nar/gkab1046DOI Listing

Publication Analysis

Top Keywords

biosamples database
16
data management
8
biosamples
7
data
5
database fairer
4
fairer samples
4
samples metadata
4
metadata accelerate
4
accelerate data
4
management biosamples
4

Similar Publications

Background And Objectives: Renal angiomyolipomas (AMLs) affect 80% of people with tuberous sclerosis complex (TSC) during their lifetime. We aimed to determine the diagnostic accuracy of blood biomarkers in identifying the presence and size of renal AMLs in people with TSC.

Methods: We collected clinical data and serum samples from individuals followed at 1 TSC clinic (Centre hospitalier de l'Université de Montréal [CHUM] cohort).

View Article and Find Full Text PDF

Extraction of biological terms using large language models enhances the usability of metadata in the BioSample database.

Gigascience

January 2025

Database Center for Life Science, Joint Support-Center for Data Science Research, Research Organization of Information and Systems, Univ. of Tokyo Kashiwanoha-campus Station Satellite 6F. 178-4-4 Wakashiba, Kashiwa-shi, Chiba 277-0871, JAPAN.

BioSample is a repository of experimental sample metadata. It is a comprehensive archive that enables searches of experiments, regardless of type. However, there is substantial variability in the submitted metadata due to the difficulty in defining comprehensive rules for describing them and the limited user awareness of best practices in creating them.

View Article and Find Full Text PDF

Introduction: The ORIGINS Project ("ORIGINS") is a longitudinal, population-level birth cohort with data and biosample collections that aim to facilitate research to reduce non-communicable diseases (NCDs) and encourage 'a healthy start to life'. ORIGINS has gathered millions of datapoints and over 400,000 biosamples over 15 timepoints, antenatally through to five years of age, from mothers, non-birthing partners and the child, across four health and wellness domains: 'Growth and development', 'Medical, biological and genetic', 'Biopsychosocial and cognitive', 'Lifestyle, environment and nutrition'.

Methods: Mothers, non-birthing partners and their offspring were recruited antenatally (between 18 and 38 weeks' gestation) from the Joondalup and Wanneroo communities of Perth, Western Australia from 2017 to 2024.

View Article and Find Full Text PDF

Introduction: Black men are diagnosed with high-grade prostate cancer (PCa; Gleason sum ≥7) at greater rates than White men. This persistent disparity has led to mortality rates among Black men that are twice the rate of White men. Risk prediction tools can aid clinical decision making for PCa screening, biopsy, and treatment.

View Article and Find Full Text PDF

Background: The gut microbiome functions as a metabolic organ, producing numerous enzymes that influence host health; however, their substrates and metabolites remain largely unknown.

Results: We present MicrobeRX, an enzyme-based metabolite prediction tool that employs 5487 human reactions and 4030 unique microbial reactions from 6286 genome-scale models, as well as 3650 drug metabolic reactions from the DrugBank database (v.5.

View Article and Find Full Text PDF