Publications by Tony Burdett | LitMetric

Publications by authors named "Tony Burdett"

Page 1 of 2

MorPhiC Consortium: towards functional characterization of all human genes.

Mazhar Adli , Laralynne Przybyla , Tony Burdett , Paul W Burridge , Pilar Cacheiro

Nature

February 2025

Recent advances in functional genomics and human cellular models have substantially enhanced our understanding of the structure and regulation of the human genome. However, our grasp of the molecular functions of human genes remains incomplete and biased towards specific gene classes. The Molecular Phenotypes of Null Alleles in Cells (MorPhiC) Consortium aims to address this gap by creating a comprehensive catalogue of the molecular and cellular phenotypes associated with null alleles of all human genes using in vitro multicellular systems.

View Article and Find Full Text PDF

Building a FAIR data ecosystem for incorporating single-cell transcriptomics data into agricultural genome to phenome research.

Muskan Kapoor , Enrique Sapena Ventura , Amy Walsh , Alexey Sokolov , Nancy George , Tony Burdett

Front Genet

November 2024

Introduction: The agriculture genomics community has numerous data submission standards available, but the standards for describing and storing single-cell (SC, e.g., scRNA- seq) data are comparatively underdeveloped.

View Article and Find Full Text PDF

The European Nucleotide Archive in 2024.

Colman O'Cathail , Alisha Ahamed , Josephine Burgin , Carla Cummins , Rajkumar Devaraj , Tony Burdett

Nucleic Acids Res

January 2025

The European Nucleotide Archive (ENA, https://www.ebi.ac.

View Article and Find Full Text PDF

The international nucleotide sequence database collaboration (INSDC): enhancing global participation.

Ilene Karsch-Mizrachi , Masanori Arita , Tony Burdett , Guy Cochrane , Yasukazu Nakamura

Nucleic Acids Res

January 2025

The members of the International Nucleotide Sequence Database Collaboration (INSDC; https://insdc.org) have built systems to collect, archive and disseminate sequence data for more than four decades. The three collaborating organizations, the National Library of Medicine, National Center for Biotechnology Information (NLM-NCBI) in the United States, Research Organization of Information and Systems, National Institute of Genetics (ROIS-NIG) in Japan; and the European Molecular Biology Laboratory-European Bioinformatics Institute (EMBL-EBI) formalized their relationship through the adoption of an arrangement which documents their commitment to free and open access to genomic sequences.

View Article and Find Full Text PDF

Mobilisation and analyses of publicly available SARS-CoV-2 data for pandemic responses.

Nadim Rahman , Colman O'Cathail , Ahmad Zyoud , Alexey Sokolov , Bas Oude Munnink , Tony Burdett

Microb Genom

February 2024

The COVID-19 pandemic has seen large-scale pathogen genomic sequencing efforts, becoming part of the toolbox for surveillance and epidemic research. This resulted in an unprecedented level of data sharing to open repositories, which has actively supported the identification of SARS-CoV-2 structure, molecular interactions, mutations and variants, and facilitated vaccine development and drug reuse studies and design. The European COVID-19 Data Platform was launched to support this data sharing, and has resulted in the deposition of several million SARS-CoV-2 raw reads.

View Article and Find Full Text PDF

Toward a common standard for data and specimen provenance in life sciences.

Rudolf Wittner , Petr Holub , Cecilia Mascia , Francesca Frexia , Heimo Müller , Tony Burdett

Learn Health Syst

January 2024

Article Synopsis

The importance of openly sharing and reusing specimens and data in life sciences research is highlighted, as it directly affects the quality of findings and knowledge.
Accurate documentation of pre-analytical conditions, analytical procedures, and data processing is crucial to validate research results, but current information on sample and data provenance is often inadequate.
The publication discusses a standardization effort aimed at creating reliable machine-actionable documentation for data lineage and specimens, inviting experts from biotechnology and biomedical fields to contribute to this initiative.

View Article and Find Full Text PDF

Expression Atlas update: insights from sequencing data at both bulk and single cell level.

Nancy George , Silvie Fexova , Alfonso Munoz Fuentes , Pedro Madrigal , Yalan Bi , Tony Burdett

Nucleic Acids Res

January 2024

Expression Atlas (www.ebi.ac.

View Article and Find Full Text PDF

The European Nucleotide Archive in 2023.

David Yuan , Alisha Ahamed , Josephine Burgin , Carla Cummins , Rajkumar Devraj , Tony Burdett

Nucleic Acids Res

January 2024

The European Nucleotide Archive (ENA; https://www.ebi.ac.

View Article and Find Full Text PDF

The Translational Data Catalog - discoverable biomedical datasets.

Danielle Welter , Philippe Rocca-Serra , Valentin Grouès , Nirmeen Sallam , François Ancien , Tony Burdett

Sci Data

July 2023

The discoverability of datasets resulting from the diverse range of translational and biomedical projects remains sporadic. It is especially difficult for datasets emerging from pre-competitive projects, often due to the legal constraints of data-sharing agreements, and the different priorities of the private and public sectors. The Translational Data Catalog is a single discovery point for the projects and datasets produced by a number of major research programmes funded by the European Commission.

View Article and Find Full Text PDF

The FAIR Cookbook - the essential resource for and by FAIR doers.

Philippe Rocca-Serra , Wei Gu , Vassilios Ioannidis , Tooba Abbassi-Daloii , Salvador Capella-Gutierrez , Tony Burdett

Sci Data

May 2023

The notion that data should be Findable, Accessible, Interoperable and Reusable, according to the FAIR Principles, has become a global norm for good data stewardship and a prerequisite for reproducibility. Nowadays, FAIR guides data policy actions and professional practices in the public and private sectors. Despite such global endorsements, however, the FAIR Principles are aspirational, remaining elusive at best, and intimidating at worst.

View Article and Find Full Text PDF

FAIR in action - a flexible framework to guide FAIRification.

Danielle Welter , Nick Juty , Philippe Rocca-Serra , Fuqi Xu , David Henderson , Tony Burdett

Sci Data

May 2023

The COVID-19 pandemic has highlighted the need for FAIR (Findable, Accessible, Interoperable, and Reusable) data more than any other scientific challenge to date. We developed a flexible, multi-level, domain-agnostic FAIRification framework, providing practical guidance to improve the FAIRness for both existing and future clinical and molecular datasets. We validated the framework in collaboration with several major public-private partnership projects, demonstrating and delivering improvements across all aspects of FAIR and across a variety of datasets and their contexts.

View Article and Find Full Text PDF

MGnify Genomes: A Resource for Biome-specific Microbial Genome Catalogues.

Tatiana A Gurbich , Alexandre Almeida , Martin Beracochea , Tony Burdett , Josephine Burgin

J Mol Biol

July 2023

An increasingly common output arising from the analysis of shotgun metagenomic datasets is the generation of metagenome-assembled genomes (MAGs), with tens of thousands of MAGs now described in the literature. However, the discovery and comparison of these MAG collections is hampered by the lack of uniformity in their generation, annotation and storage. To address this, we have developed MGnify Genomes, a growing collection of biome-specific non-redundant microbial genome catalogues generated using MAGs and publicly available isolate genomes.

View Article and Find Full Text PDF

MGnify: the microbiome sequence data analysis resource in 2023.

Lorna Richardson , Ben Allen , Germana Baldi , Martin Beracochea , Maxwell L Bileschi , Tony Burdett

Nucleic Acids Res

January 2023

The MGnify platform (https://www.ebi.ac.

View Article and Find Full Text PDF

The European Nucleotide Archive in 2022.

Josephine Burgin , Alisha Ahamed , Carla Cummins , Rajkumar Devraj , Khadim Gueye , Tony Burdett

Nucleic Acids Res

January 2023

The European Nucleotide Archive (ENA; https://www.ebi.ac.

View Article and Find Full Text PDF

Toward a data infrastructure for the Plant Cell Atlas.

Noah Fahlgren , Muskan Kapoor , Galabina Yordanova , Irene Papatheodorou , Jamie Waese , Tony Burdett

Plant Physiol

January 2023

We review how a data infrastructure for the Plant Cell Atlas might be built using existing infrastructure and platforms. The Human Cell Atlas has developed an extensive infrastructure for human and mouse single cell data, while the European Bioinformatics Institute has developed a Single Cell Expression Atlas, that currently houses several plant data sets. We discuss issues related to appropriate ontologies for describing a plant single cell experiment.

View Article and Find Full Text PDF

From biomedical cloud platforms to microservices: next steps in FAIR data and analysis.

Nathan C Sheffield , Vivien R Bonazzi , Philip E Bourne , Tony Burdett , Timothy Clark

Sci Data

September 2022

The biomedical research community is investing heavily in biomedical cloud platforms. Cloud computing holds great promise for addressing challenges with big data and ensuring reproducibility in biology. However, despite their advantages, cloud platforms in and of themselves do not automatically support FAIRness.

View Article and Find Full Text PDF

ELIXIR biovalidator for semantic validation of life science metadata.

Isuru Liyanage , Tony Burdett , Bert Droesbeke , Karoly Erdos , Rolando Fernandez

Bioinformatics

May 2022

Summary: To advance biomedical research, increasingly large amounts of complex data need to be discovered and integrated. This requires syntactic and semantic validation to ensure shared understanding of relevant entities. This article describes the ELIXIR biovalidator, which extends the syntactic validation of the widely used AJV library with ontology-based validation of JSON documents.

View Article and Find Full Text PDF

GA4GH: International policies and standards for data sharing across genomic research and healthcare.

Heidi L Rehm , Angela J H Page , Lindsay Smith , Jeremy B Adams , Gil Alterovitz , Tony Burdett

Cell Genom

November 2021

The Global Alliance for Genomics and Health (GA4GH) aims to accelerate biomedical advances by enabling the responsible sharing of clinical and genomic data through both harmonized data aggregation and federated approaches. The decreasing cost of genomic sequencing (along with other genome-wide molecular assays) and increasing evidence of its clinical utility will soon drive the generation of sequence data from tens of millions of humans, with increasing levels of diversity. In this perspective, we present the GA4GH strategies for addressing the major challenges of this data revolution.

View Article and Find Full Text PDF

The European Nucleotide Archive in 2021.

Carla Cummins , Alisha Ahamed , Raheela Aslam , Josephine Burgin , Rajkumar Devraj , Tony Burdett

Nucleic Acids Res

January 2022

The European Nucleotide Archive (ENA, https://www.ebi.ac.

View Article and Find Full Text PDF

Expression Atlas update: gene and protein expression in multiple species.

Pablo Moreno , Silvie Fexova , Nancy George , Jonathan R Manning , Zhichiao Miao , Tony Burdett

Nucleic Acids Res

January 2022

The EMBL-EBI Expression Atlas is an added value knowledge base that enables researchers to answer the question of where (tissue, organism part, developmental stage, cell type) and under which conditions (disease, treatment, gender, etc) a gene or protein of interest is expressed. Expression Atlas brings together data from >4500 expression studies from >65 different species, across different conditions and tissues. It makes these data freely available in an easy to visualise form, after expert curation to accurately represent the intended experimental design, re-analysed via standardised pipelines that rely on open-source community developed tools.

View Article and Find Full Text PDF

The Data Use Ontology to streamline responsible access to human biomedical datasets.

Jonathan Lawson , Moran N Cabili , Giselle Kerry , Tiffany Boughtwood , Adrian Thorogood , Tony Burdett

Cell Genom

November 2021

Human biomedical datasets that are critical for research and clinical studies to benefit human health also often contain sensitive or potentially identifying information of individual participants. Thus, care must be taken when they are processed and made available to comply with ethical and regulatory frameworks and informed consent data conditions. To enable and streamline data access for these biomedical datasets, the Global Alliance for Genomics and Health (GA4GH) Data Use and Researcher Identities (DURI) work stream developed and approved the Data Use Ontology (DUO) standard.

View Article and Find Full Text PDF

BioSamples database: FAIRer samples metadata to accelerate research data management.

Mélanie Courtot , Dipayan Gupta , Isuru Liyanage , Fuqi Xu , Tony Burdett

Nucleic Acids Res

January 2022

The BioSamples database at EMBL-EBI is the central institutional repository for sample metadata storage and connection to EMBL-EBI archives and other resources. The technical improvements to our infrastructure described in our last update have enabled us to scale and accommodate an increasing number of communities, resulting in a higher number of submissions and more heterogeneous data. The BioSamples database now has a valuable set of features and processes to improve data quality in BioSamples, and in particular enriching metadata content and following FAIR principles.

View Article and Find Full Text PDF

A compendium of uniformly processed human gene expression and splicing quantitative trait loci.

Nurlan Kerimov , James D Hayhurst , Kateryna Peikova , Jonathan R Manning , Peter Walter , Tony Burdett

Nat Genet

September 2021

Many gene expression quantitative trait locus (eQTL) studies have published their summary statistics, which can be used to gain insight into complex human traits by downstream analyses, such as fine mapping and co-localization. However, technical differences between these datasets are a barrier to their widespread use. Consequently, target genes for most genome-wide association study (GWAS) signals have still not been identified.

View Article and Find Full Text PDF

The European Nucleotide Archive in 2020.

Peter W Harrison , Alisha Ahamed , Raheela Aslam , Blaise T F Alako , Josephine Burgin , Tony Burdett

Nucleic Acids Res

January 2021

The European Nucleotide Archive (ENA; https://www.ebi.ac.

View Article and Find Full Text PDF

Open Targets Genetics: systematic identification of trait-associated genes using large-scale genetics and functional genomics.

Maya Ghoussaini , Edward Mountjoy , Miguel Carmona , Gareth Peat , Ellen M Schmidt , Tony Burdett

Nucleic Acids Res

January 2021

Article Synopsis

Open Targets Genetics is an open-access platform that combines GWAS and functional genomics data to link genetic variants to potential causal genes and traits.
The resource allows users to search and analyze genetic variants, genes, and disease associations, providing tools for prioritizing potential causal variants across various traits and tissues.
It offers data visualizations and is accessible through a web portal, bulk downloads, and a GraphQL API, supporting applications in drug discovery and repurposing.

View Article and Find Full Text PDF