While environmental exposures are known to play a significant role in human disease, these effects are understudied compared to genomic and molecular components of disease. One example is the effect of chronic, low-level exposures to cyanobacteria toxins on health outcomes. Here we perform a retrospective analysis on real-world data in the National Clinical Cohort Collaborative (N3C) COVID Enclave, examining the possible impact of chronic exposure to cyanobacteria toxin on the severity of COVID-19 outcomes in patients from Colorado.
View Article and Find Full Text PDFSingle-cell omics technologies have transformed our understanding of cellular diversity by enabling high-resolution profiling of individual cells. However, the unprecedented scale and heterogeneity of these datasets demand robust frameworks for data integration and annotation. The Cell Ontology (CL) has emerged as a pivotal resource for achieving FAIR (Findable, Accessible, Interoperable, and Reusable) data principles by providing standardized, species-agnostic terms for canonical cell types-forming a core component of a wide range of platforms and tools.
View Article and Find Full Text PDFPurpose: Stargardt disease, also called ABCA4-related retinopathy (ABCA4R), is the most common form of juvenile-onset macular dystrophy and yet lacks an FDA approved treatment. Substantial progress has been made through landmark studies like that of the Progression of Atrophy Secondary to Stargardt Disease (ProgStar), but tasks like image segmentation and phenotyping still pose major challenges in terms of monitoring disease progression and categorizing patient subgroups. Furthermore, these methods are subjective and laborious.
View Article and Find Full Text PDFBackground: Electronic health records (EHRs) provide valuable insights to address clinical and epidemiological research concerning HIV, including the disproportionate impact of the COVID-19 pandemic on people living with HIV. To identify this population, most studies using EHR or claims databases start with diagnostic codes, which can result in misclassification without further refinement using drug or laboratory data. Furthermore, given that antiretrovirals now have indications for both HIV and COVID-19 (ie, ritonavir in nirmatrelvir/ritonavir), new phenotyping methods are needed to better capture people living with HIV.
View Article and Find Full Text PDFMotivation: Structured representations of clinical data can support computational analysis of individuals and cohorts, and ontologies representing disease entities and phenotypic abnormalities are now commonly used for translational research. The Medical Action Ontology (MAxO) provides a computational representation of treatments and other actions taken for clinical management. Currently, manual biocuration is used to annotate MAxO terms to rare diseases.
View Article and Find Full Text PDFOntologies are structured frameworks for representing knowledge by systematically defining concepts, categories, and their relationships. While widely adopted in biomedicine, ontologies remain largely absent in mental health research and clinical care, where the field continues to rely heavily on existing classification systems (e.g.
View Article and Find Full Text PDFBackground: Limited universally-adopted data standards in veterinary medicine hinder data interoperability and therefore integration and comparison; this ultimately impedes the application of existing information-based tools to support advancement in diagnostics, treatments, and precision medicine.
Hypothesis/objectives: A single, coherent, logic-based standard for documenting breed names in health, production, and research-related records will improve data use capabilities in veterinary and comparative medicine.
Animals: No live animals were used.
Background: Postpartum depression (PPD) is a mood disorder affecting 1 in 7 women after childbirth that is often underscreened and underdetected. If not diagnosed and treated, PPD is associated with long-term developmental challenges in the child and maternal morbidity. Wearable technologies, such as smartwatches and fitness trackers (eg, Fitbit), offer continuous and longitudinal digital phenotyping for mood disorder diagnosis and monitoring, with device wear time being an important yet understudied aspect.
View Article and Find Full Text PDFWhile Research Electronic Data Capture (REDCap) has been widely adopted in rare disease research, its unconstrained data format often leads to implementations that lack native interoperability with global health data standards, limiting secondary data use. To address this, we developed and validated , an open-source framework implementing our previously-published ontology-based rare disease common data model, enabling standardised data exchange between REDCap, international registries, and downstream analysis tools. Its preconfigured pipelines interact with the local REDCap application programming interface and enable semi-automatic import or export of data to the Global Alliance for Genomics and Health (GA4GH) Phenopackets and Health Level 7 (HL7) Fast Healthcare Interoperability Resources (FHIR) instances, conforming to the HL7 International Patient Summary and Genomics Reporting profiles.
View Article and Find Full Text PDFArtificial intelligence (AI) is rapidly transforming numerous aspects of daily life, including clinical practice and biomedical research. In light of this rapid transformation, and in the context of medical genetics, we assembled a group of leaders in the field to respond to the question about how AI is affecting, and especially how AI will affect, medical genetics. The authors who contributed to this collection of essays intentionally represent different areas of expertise, career stages, and geographies, and include diverse types of clinicians, computer scientists, and researchers.
View Article and Find Full Text PDFObjective -: Significant opportunities for understanding the co-occurrence of conditions across species in coincident households remain untapped. We determined the feasibility of creating a Companion Care Registry (CCR) for analysis of health data from the University of Colorado Health (UCHealth) patients and their companion animals who received veterinary care at the geographically-adjacent Colorado State University Veterinary Teaching Hospital (CSU-VTH).
Materials And Methods -: Using a hybrid deterministic and probabilistic record linkage method, non-medical Personally Identifiable Information was securely matched to determine the total number of UCHealth patients within the HIPAA-compliant Health Data Compass Research Data Warehouse (2015-2024) who took a companion animal to the CSU-VTH (2019-2024).
Comprehensively characterizing genotype-phenotype correlations (GPCs) in Mendelian disease would create new opportunities for improving clinical management and understanding disease biology. However, heterogeneous approaches to data sharing, reuse, and analysis have hindered progress in the field. We developed Genotype Phenotype Evaluation of Statistical Association (GPSEA), a software package that leverages the Global Alliance for Genomics and Health (GA4GH) Phenopacket Schema to represent case-level clinical and genetic data about individuals.
View Article and Find Full Text PDFBackground: Large language models (LLMs) are increasingly used in the medical field for diverse applications including differential diagnostic support. The estimated training data used to create LLMs such as the Generative Pretrained Transformer (GPT) predominantly consist of English-language texts, but LLMs could be used across the globe to support diagnostics if language barriers could be overcome. Initial pilot studies on the utility of LLMs for differential diagnosis in languages other than English have shown promise, but a large-scale assessment on the relative performance of these models in a variety of European and non-European languages on a comprehensive corpus of challenging rare-disease cases is lacking.
View Article and Find Full Text PDFPhenotypic data are critical for understanding biological mechanisms and consequences of genomic variation, and are pivotal for clinical use cases such as disease diagnostics and treatment development. For over a century, vast quantities of phenotype data have been collected in many different contexts covering a variety of organisms. The emerging field of phenomics focuses on integrating and interpreting these data to inform biological hypotheses.
View Article and Find Full Text PDFNPJ Genom Med
December 2024
Whole genome sequencing has transformed rare disease research; however, 50-80% of rare disease patients remain undiagnosed after such testing. Regular reanalysis can identify new diagnoses, especially in newly discovered disease-gene associations, but efficient tools are required to support clinical interpretation. Exomiser, a phenotype-driven variant prioritisation tool, fulfils this role; within the 100,000 Genomes Project (100kGP), diagnoses were identified after reanalysis in 463 (2%) of 24,015 unsolved patients after previous analysis for variants in known disease genes.
View Article and Find Full Text PDFPost-Acute Sequelae of SARS-CoV-2 infection (PASC), also known as Long-COVID, encompasses a variety of complex and varied outcomes following COVID-19 infection that are still poorly understood. We clustered over 600 million condition diagnoses from 14 million patients available through the National COVID Cohort Collaborative (N3C), generating hundreds of highly detailed clinical phenotypes. Assessing patient clinical trajectories using these clusters allowed us to identify individual conditions and phenotypes strongly increased after acute infection.
View Article and Find Full Text PDFPostpartum depression (PPD) is a mood disorder affecting one in seven women after childbirth that is often under-screened and under-detected. If not diagnosed and treated, PPD is associated with long-term developmental challenges in the child and maternal morbidity. Wearable technologies, such as smartwatches and fitness trackers (e.
View Article and Find Full Text PDFJ Biomed Semantics
October 2024
Background: A wealth of clinically relevant information is only obtainable within unstructured clinical narratives, leading to great interest in clinical natural language processing (NLP). While a multitude of approaches to NLP exist, current algorithm development approaches have limitations that can slow the development process. These limitations are exacerbated when the task is emergent, as is the case currently for NLP extraction of signs and symptoms of COVID-19 and postacute sequelae of SARS-CoV-2 infection (PASC).
View Article and Find Full Text PDFStructured representations of clinical data can support computational analysis of individuals and cohorts, and ontologies representing disease entities and phenotypic abnormalities are now commonly used for translational research. The Medical Action Ontology (MAxO) provides a computational representation of treatments and other actions taken for the clinical management of patients. Currently, manual biocuration is used to assign MAxO terms to rare diseases, enabling clinical management of rare diseases to be described computationally for use in clinical decision support and mechanism discovery.
View Article and Find Full Text PDFOtol Neurotol Open
June 2024
Background –: Limited universally-adopted data standards in veterinary medicine hinder data interoperability and therefore integration and comparison; this ultimately impedes the application of existing information-based tools to support advancement in diagnostics, treatments, and precision medicine.
Hypothesis/objectives –: A single, coherent, logic-based standard for documenting breed names in health, production, and research-related records will improve data use capabilities in veterinary and comparative medicine.
Animals –: No live animals were used.