Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

Background: Next-generation sequencing has created many new technological challenges in organizing and distributing genomics datasets, which now can routinely reach petabyte scales. Coupled with data-hungry artificial intelligence and machine learning applications, findable, accessible, interoperable, and reusable genomics datasets have never been more valuable. While major archives like the Genomics Data Commons, Sequence Reads Archive, and European Genome-Phenome Archive have improved researchers' ability to share and reuse data, and general-purpose repositories such as Zenodo and Figshare provide valuable platforms for research data publication, the diversity of genomics research precludes any one-size-fits-all approach. In many cases, bespoke solutions are required, and despite funding agencies and journals increasingly mandating reusable data practices, researchers still lack the technical support needed to meet the multifaceted challenges of data reuse.

Findings: Overture bridges this gap by providing open-source software for building and deploying customizable genomics data platforms. Its architecture consists of modular microservices, each of which is generalized with narrow responsibilities that together combine to create complete data management systems. These systems enable researchers to organize, share, and explore their genomics data at any scale. Through Overture, researchers can connect their data to both humans and machines, fostering reproducibility and enabling new insights through controlled data sharing and reuse.

Conclusions: By making these tools freely available, we can accelerate the development of reliable genomic data management across the research community quickly, flexibly, and at multiple scales. Overture is an open-source project licensed under AGPLv3.0 with all source code publicly available from https://github.com/overture-stack and documentation on development, deployment, and usage available from www.overture.bio.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12020472PMC
http://dx.doi.org/10.1093/gigascience/giaf038DOI Listing

Publication Analysis

Top Keywords

genomics data
16
data
12
overture open-source
8
genomics datasets
8
data management
8
genomics
7
overture
4
open-source genomics
4
data platform
4
platform background
4

Similar Publications

Background: Laboratory animal veterinarians play a crucial role as a bridge between the ethical use of laboratory animals and the advancement of scientific and medical knowledge in biomedical research. They alleviate pain and reduce distress through veterinary care of laboratory animals. Additionally, they enhance animal welfare by creating environments that mimic natural habitats through environmental enrichment and social associations.

View Article and Find Full Text PDF

Background: Recent advances in high-throughput sequencing technologies have enabled the collection and sharing of a massive amount of omics data, along with its associated metadata-descriptive information that contextualizes the data, including phenotypic traits and experimental design. Enhancing metadata availability is critical to ensure data reusability and reproducibility and to facilitate novel biomedical discoveries through effective data reuse. Yet, incomplete metadata accompanying public omics data may hinder reproducibility and reusability and limit secondary analyses.

View Article and Find Full Text PDF

Whole genome sequence analysis of low-density lipoprotein cholesterol across 246 K individuals.

Genome Biol

September 2025

Center for Genomic Medicine, Cardiovascular Research Center, , Massachusetts General Hospital Simches Research Center, 185 Cambridge Street, CPZN 5.238,, Boston, MA, 02114, USA.

Background: Rare genetic variation provided by whole genome sequence datasets has been relatively less explored for its contributions to human traits. Meta-analysis of sequencing data offers advantages by integrating larger sample sizes from diverse cohorts, thereby increasing the likelihood of discovering novel insights into complex traits. Furthermore, emerging methods in genome-wide rare variant association testing further improve power and interpretability.

View Article and Find Full Text PDF

The global surge in the population of people 60 years and older, including that in China, challenges healthcare systems with rising age-related diseases. To address this demographic change, the Aging Biomarker Consortium (ABC) has launched the X-Age Project to develop a comprehensive aging evaluation system tailored to the Chinese population. Our goal is to identify robust biomarkers and construct composite aging clocks that capture biological age, defined as an individual's physiological and molecular state, across diverse Chinese cohorts.

View Article and Find Full Text PDF

Beyond their classical functions as redox cofactors, recent fundamental and clinical research has expanded our understanding of the diverse roles of nicotinamide adenine dinucleotide (NAD) and nicotinamide adenine dinucleotide phosphate (NADP) in signaling pathways, epigenetic regulation and energy homeostasis. Moreover, NAD and NADP influence numerous diseases as well as the processes of aging, and are emerging as targets for clinical intervention. Here, we summarize safety, bioavailability and efficacy data from NAD-related clinical trials, focusing on aging and neurodegenerative diseases.

View Article and Find Full Text PDF