Publications by Brian D O'Connor | LitMetric

Publications by authors named "Brian D O'Connor"

Page 1 of 1

A Framework for the Interoperability of Cloud Platforms: Towards FAIR Data in SAFE Environments.

Robert L Grossman , Rebecca R Boyles , Brandi N Davis-Dusenbery , Amanda Haddock , Allison P Heath , Brian D O'Connor

Sci Data

February 2024

As the number of cloud platforms supporting scientific research grows, there is an increasing need to support interoperability between two or more cloud platforms. A well accepted core concept is to make data in cloud platforms Findable, Accessible, Interoperable and Reusable (FAIR). We introduce a companion concept that applies to cloud-based computing environments that we call a ecure and uthorized AIR nvironment (SAFE).

View Article and Find Full Text PDF

Inverting the model of genomics data sharing with the NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space.

Michael C Schatz , Anthony A Philippakis , Enis Afgan , Eric Banks , Vincent J Carey , Brian D O'Connor

Cell Genom

January 2022

Article Synopsis

AnVIL is a cloud-based platform designed to help researchers effectively store, manage, and analyze genomic data within a unified environment.
It enhances data sharing and collaboration by allowing researchers to work with various analysis tools (like Terra and Galaxy) without needing to move data around, ensuring better security.
The platform supports large-scale genomic studies and continuous improvements in features to facilitate responsible data sharing and accessibility for researchers.

View Article and Find Full Text PDF

GA4GH: International policies and standards for data sharing across genomic research and healthcare.

Heidi L Rehm , Angela J H Page , Lindsay Smith , Jeremy B Adams , Gil Alterovitz , Brian D O'Connor

Cell Genom

November 2021

The Global Alliance for Genomics and Health (GA4GH) aims to accelerate biomedical advances by enabling the responsible sharing of clinical and genomic data through both harmonized data aggregation and federated approaches. The decreasing cost of genomic sequencing (along with other genome-wide molecular assays) and increasing evidence of its clinical utility will soon drive the generation of sequence data from tens of millions of humans, with increasing levels of diversity. In this perspective, we present the GA4GH strategies for addressing the major challenges of this data revolution.

View Article and Find Full Text PDF

Author Correction: Mutations in PYCR1 cause cutis laxa with progeroid features.

Bruno Reversade , Nathalie Escande-Beillard , Aikaterini Dimopoulou , Björn Fischer , Serene C Chng , Brian D O'Connor

Nat Genet

February 2022

View Article and Find Full Text PDF

The Dockstore: enhancing a community platform for sharing reproducible and accessible computational protocols.

Denis Yuen , Louise Cabansay , Andrew Duncan , Gary Luu , Gregory Hogue , Brian D O'Connor

Nucleic Acids Res

July 2021

Dockstore (https://dockstore.org/) is an open source platform for publishing, sharing, and finding bioinformatics tools and workflows. The platform has facilitated large-scale biomedical research collaborations by using cloud technologies to increase the Findability, Accessibility, Interoperability and Reusability (FAIR) of computational resources, thereby promoting the reproducibility of complex bioinformatics analyses.

View Article and Find Full Text PDF

A user guide for the online exploration and visualization of PCAWG data.

Mary J Goldman , Junjun Zhang , Nuno A Fonseca , Isidro Cortés-Ciriano , Qian Xiang , Brian D O'Connor

Nat Commun

July 2020

The Pan-Cancer Analysis of Whole Genomes (PCAWG) project generated a vast amount of whole-genome cancer sequencing resource data. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, which aggregated whole genome sequencing data from 2658 cancers across 38 tumor types, we provide a user's guide to the five publicly available online data exploration and visualization tools introduced in the PCAWG marker paper. These tools are ICGC Data Portal, UCSC Xena, Chromothripsis Explorer, Expression Atlas, and PCAWG-Scout.

View Article and Find Full Text PDF

Organizing and running bioinformatics hackathons within Africa: The H3ABioNet cloud computing experience.

Azza E Ahmed , Phelelani T Mpangase , Sumir Panji , Shakuntala Baichoo , Yassine Souilmi , Brian D O'Connor

AAS Open Res

August 2019

The need for portable and reproducible genomics analysis pipelines is growing globally as well as in Africa, especially with the growth of collaborative projects like the Human Health and Heredity in Africa Consortium (H3Africa). The Pan-African H3Africa Bioinformatics Network (H3ABioNet) recognized the need for portable, reproducible pipelines adapted to heterogeneous computing environments, and for the nurturing of technical expertise in workflow languages and containerization technologies. Building on the network's Standard Operating Procedures (SOPs) for common genomic analyses, H3ABioNet arranged its first Cloud Computing and Reproducible Workflows Hackathon in 2016, with the purpose of translating those SOPs into analysis pipelines able to run on heterogeneous computing environments and meeting the needs of H3Africa research projects.

View Article and Find Full Text PDF

Developing reproducible bioinformatics analysis workflows for heterogeneous computing environments to support African genomics.

Shakuntala Baichoo , Yassine Souilmi , Sumir Panji , Gerrit Botha , Ayton Meintjes , Brian D O'Connor

BMC Bioinformatics

November 2018

Background: The Pan-African bioinformatics network, H3ABioNet, comprises 27 research institutions in 17 African countries. H3ABioNet is part of the Human Health and Heredity in Africa program (H3Africa), an African-led research consortium funded by the US National Institutes of Health and the UK Wellcome Trust, aimed at using genomics to study and improve the health of Africans. A key role of H3ABioNet is to support H3Africa projects by building bioinformatics infrastructure such as portable and reproducible bioinformatics workflows for use on heterogeneous African computing environments.

View Article and Find Full Text PDF

Correction: U87MG Decoded: The Genomic Sequence of a Cytogenetically Aberrant Human Cancer Cell Line.

Michael James Clark , Nils Homer , Brian D O'Connor , Zugen Chen , Ascia Eskin

PLoS Genet

May 2018

[This corrects the article DOI: 10.1371/journal.pgen.

View Article and Find Full Text PDF

The Dockstore: enabling modular, community-focused sharing of Docker-based genomics tools and workflows.

Brian D O'Connor , Denis Yuen , Vincent Chung , Andrew G Duncan , Xiang Kun Liu

F1000Res

January 2017

As genomic datasets continue to grow, the feasibility of downloading data to a local organization and running analysis on a traditional compute environment is becoming increasingly problematic. Current large-scale projects, such as the ICGC PanCancer Analysis of Whole Genomes (PCAWG), the Data Platform for the U.S.

View Article and Find Full Text PDF

New insights into the Tyrolean Iceman's origin and phenotype as inferred by whole-genome sequencing.

Andreas Keller , Angela Graefen , Markus Ball , Mark Matzas , Valesca Boisguerin , Brian D O'Connor

Nat Commun

February 2012

The Tyrolean Iceman, a 5,300-year-old Copper age individual, was discovered in 1991 on the Tisenjoch Pass in the Italian part of the Ötztal Alps. Here we report the complete genome sequence of the Iceman and show 100% concordance between the previously reported mitochondrial genome sequence and the consensus sequence generated from our genomic data. We present indications for recent common ancestry between the Iceman and present-day inhabitants of the Tyrrhenian Sea, that the Iceman probably had brown eyes, belonged to blood group O and was lactose intolerant.

View Article and Find Full Text PDF

Multiple self-healing squamous epithelioma is caused by a disease-specific spectrum of mutations in TGFBR1.

David R Goudie , Mariella D'Alessandro , Barry Merriman , Hane Lee , Ildikó Szeverényi , Brian D O'Connor

Nat Genet

February 2011

Multiple self-healing squamous epithelioma (MSSE), also known as Ferguson-Smith disease (FSD), is an autosomal-dominant skin cancer condition characterized by multiple squamous-carcinoma-like locally invasive skin tumors that grow rapidly for a few weeks before spontaneously regressing, leaving scars. High-throughput genomic sequencing of a conservative estimate (24.2 Mb) of the disease locus on chromosome 9 using exon array capture identified independent mutations in TGFBR1 in three unrelated families.

View Article and Find Full Text PDF

SeqWare Query Engine: storing and searching sequence data in the cloud.

Brian D O'Connor , Barry Merriman , Stanley F Nelson

BMC Bioinformatics

December 2010

Background: Since the introduction of next-generation DNA sequencers the rapid increase in sequencer throughput, and associated drop in costs, has resulted in more than a dozen human genomes being resequenced over the last few years. These efforts are merely a prelude for a future in which genome resequencing will be commonplace for both biomedical research and clinical applications. The dramatic increase in sequencer output strains all facets of computational infrastructure, especially databases and query interfaces.

View Article and Find Full Text PDF

U87MG decoded: the genomic sequence of a cytogenetically aberrant human cancer cell line.

Michael James Clark , Nils Homer , Brian D O'Connor , Zugen Chen , Ascia Eskin

PLoS Genet

January 2010

U87MG is a commonly studied grade IV glioma cell line that has been analyzed in at least 1,700 publications over four decades. In order to comprehensively characterize the genome of this cell line and to serve as a model of broad cancer genome sequencing, we have generated greater than 30x genomic sequence coverage using a novel 50-base mate paired strategy with a 1.4kb mean insert library.

View Article and Find Full Text PDF

Improving the efficiency of genomic loci capture using oligonucleotide arrays for high throughput resequencing.

Hane Lee , Brian D O'Connor , Barry Merriman , Vincent A Funari , Nils Homer

BMC Genomics

December 2009

Background: The emergence of next-generation sequencing technology presents tremendous opportunities to accelerate the discovery of rare variants or mutations that underlie human genetic disorders. Although the complete sequencing of the affected individuals' genomes would be the most powerful approach to finding such variants, the cost of such efforts make it impractical for routine use in disease gene research. In cases where candidate genes or loci can be defined by linkage, association, or phenotypic studies, the practical sequencing target can be made much smaller than the whole genome, and it becomes critical to have capture methods that can be used to purify the desired portion of the genome for shotgun short-read sequencing without biasing allelic representation or coverage.

View Article and Find Full Text PDF

Pathogenicity of a disease-associated human IL-4 receptor allele in experimental asthma.

Raffi Tachdjian , Clinton Mathias , Shadi Al Khatib , Paul J Bryce , Hong S Kim , Brian D O'Connor

J Exp Med

September 2009

Polymorphisms in the interleukin-4 receptor alpha chain (IL-4R alpha) have been linked to asthma incidence and severity, but a causal relationship has remained uncertain. In particular, a glutamine to arginine substitution at position 576 (Q576R) of IL-4R alpha has been associated with severe asthma, especially in African Americans. We show that mice carrying the Q576R polymorphism exhibited intense allergen-induced airway inflammation and remodeling.

View Article and Find Full Text PDF

Mutations in PYCR1 cause cutis laxa with progeroid features.

Bruno Reversade , Nathalie Escande-Beillard , Aikaterini Dimopoulou , Björn Fischer , Serene C Chng , Brian D O'Connor

Nat Genet

September 2009

Autosomal recessive cutis laxa (ARCL) describes a group of syndromal disorders that are often associated with a progeroid appearance, lax and wrinkled skin, osteopenia and mental retardation. Homozygosity mapping in several kindreds with ARCL identified a candidate region on chromosome 17q25. By high-throughput sequencing of the entire candidate region, we detected disease-causing mutations in the gene PYCR1.

View Article and Find Full Text PDF

GMODWeb: a web framework for the Generic Model Organism Database.

Brian D O'Connor , Allen Day , Scott Cain , Olivier Arnaiz , Linda Sperling

Genome Biol

August 2008

The Generic Model Organism Database (GMOD) initiative provides species-agnostic data models and software tools for representing curated model organism data. Here we describe GMODWeb, a GMOD project designed to speed the development of model organism database (MOD) websites. Sites created with GMODWeb provide integration with other GMOD tools and allow users to browse and search through a variety of data types.

View Article and Find Full Text PDF

Celsius: a community resource for Affymetrix microarray data.

Allen Day , Marc R J Carlson , Jun Dong , Brian D O'Connor , Stanley F Nelson

Genome Biol

February 2008

Celsius is a data warehousing system to aggregate Affymetrix CEL files and associated metadata. It provides mechanisms for importing, storing, querying, and exporting large volumes of primary and pre-processed microarray data. Celsius contains ten billion assay measurements and affiliated metadata.

View Article and Find Full Text PDF

Utilizing logical relationships in genomic data to decipher cellular processes.

Peter M Bowers , Brian D O'Connor , Shawn J Cokus , Einat Sprinzak , Todd O Yeates

FEBS J

October 2005

The wealth of available genomic data has spawned a corresponding interest in computational methods that can impart biological meaning and context to these experiments. Traditional computational methods have drawn relationships between pairs of proteins or genes based on notions of equality or similarity between their patterns of occurrence or behavior. For example, two genes displaying similar variation in expression, over a number of experiments, may be predicted to be functionally related.

View Article and Find Full Text PDF

The genomics of disulfide bonding and protein stabilization in thermophiles.

Morgan Beeby , Brian D O'Connor , Carsten Ryttersgaard , Daniel R Boutz , L Jeanne Perry

PLoS Biol

September 2005

Thermophilic organisms flourish in varied high-temperature environmental niches that are deadly to other organisms. Recently, genomic evidence has implicated a critical role for disulfide bonds in the structural stabilization of intracellular proteins from certain of these organisms, contrary to the conventional view that structural disulfide bonds are exclusively extracellular. Here both computational and structural data are presented to explore the occurrence of disulfide bonds as a protein-stabilization method across many thermophilic prokaryotes.

View Article and Find Full Text PDF

GDAP: a web tool for genome-wide protein disulfide bond prediction.

Brian D O'Connor , Todd O Yeates

Nucleic Acids Res

July 2004

The Genomic Disulfide Analysis Program (GDAP) provides web access to computationally predicted protein disulfide bonds for over one hundred microbial genomes, including both bacterial and achaeal species. In the GDAP process, sequences of unknown structure are mapped, when possible, to known homologous Protein Data Bank (PDB) structures, after which specific distance criteria are applied to predict disulfide bonds. GDAP also accepts user-supplied protein sequences and subsequently queries the PDB sequence database for the best matches, scans for possible disulfide bonds and returns the results to the client.

View Article and Find Full Text PDF