Atrial fibrillation (AF) is a prevalent and morbid abnormality of the heart rhythm with a strong genetic component. Here, we meta-analyzed genome and exome sequencing data from 36 studies that included 52,416 AF cases and 277,762 controls. In burden tests of rare coding variation, we identified novel associations between AF and the genes MYBPC3, LMNA, PKP2, FAM189A2 and KDM5B.
View Article and Find Full Text PDFHeart failure (HF) is a major contributor to global morbidity and mortality. While distinct clinical subtypes, defined by etiology and left ventricular ejection fraction, are well recognized, their genetic determinants remain inadequately understood. In this study, we report a genome-wide association study of HF and its subtypes in a sample of 1.
View Article and Find Full Text PDFWe sought to characterize cellular composition across the cardiovascular system of the healthy Wistar rat, an important model in preclinical cardiovascular research. We performed single-nucleus RNA sequencing (snRNA-seq) in 78 samples in 10 distinct regions, including the four chambers of the heart, ventricular septum, sinoatrial node, atrioventricular node, aorta, pulmonary artery, and pulmonary veins, which produced 505,835 nuclei. We identified 26 distinct cell types and additional subtypes, with different cellular composition across cardiac regions and tissue-specific transcription for each cell type.
View Article and Find Full Text PDFDroplet-based single-cell assays, including single-cell RNA sequencing (scRNA-seq), single-nucleus RNA sequencing (snRNA-seq) and cellular indexing of transcriptomes and epitopes by sequencing (CITE-seq), generate considerable background noise counts, the hallmark of which is nonzero counts in cell-free droplets and off-target gene expression in unexpected cell types. Such systematic background noise can lead to batch effects and spurious differential gene expression results. Here we develop a deep generative model based on the phenomenology of noise generation in droplet-based assays.
View Article and Find Full Text PDFMapping gene networks requires large amounts of transcriptomic data to learn the connections between genes, which impedes discoveries in settings with limited data, including rare diseases and diseases affecting clinically inaccessible tissues. Recently, transfer learning has revolutionized fields such as natural language understanding and computer vision by leveraging deep learning models pretrained on large-scale general datasets that can then be fine-tuned towards a vast array of downstream tasks with limited task-specific data. Here, we developed a context-aware, attention-based deep learning model, Geneformer, pretrained on a large-scale corpus of about 30 million single-cell transcriptomes to enable context-specific predictions in settings with limited data in network biology.
View Article and Find Full Text PDFBackground: As the largest conduit vessel, the aorta is responsible for the conversion of phasic systolic inflow from ventricular ejection into more continuous peripheral blood delivery. Systolic distention and diastolic recoil conserve energy and are enabled by the specialized composition of the aortic extracellular matrix. Aortic distensibility decreases with age and vascular disease.
View Article and Find Full Text PDFLarge-scale gene sequencing studies for complex traits have the potential to identify causal genes with therapeutic implications. We performed gene-based association testing of blood lipid levels with rare (minor allele frequency < 1%) predicted damaging coding variation by using sequence data from >170,000 individuals from multiple ancestries: 97,493 European, 30,025 South Asian, 16,507 African, 16,440 Hispanic/Latino, 10,420 East Asian, and 1,182 Samoan. We identified 35 genes associated with circulating lipid levels; some of these genes have not been previously associated with lipid levels when using rare coding variation from population-based samples.
View Article and Find Full Text PDFEnlargement or aneurysm of the aorta predisposes to dissection, an important cause of sudden death. We trained a deep learning model to evaluate the dimensions of the ascending and descending thoracic aorta in 4.6 million cardiac magnetic resonance images from the UK Biobank.
View Article and Find Full Text PDFLife Sci Alliance
December 2021
Extracellular vesicles (EVs) mediate intercellular signaling by transferring their cargo to recipient cells, but the functional consequences of signaling are not fully appreciated. RBC-derived EVs are abundant in circulation and have been implicated in regulating immune responses. Here, we use a transgenic mouse model for fluorescence-based mapping of RBC-EV recipient cells to assess the role of this intercellular signaling mechanism in heart disease.
View Article and Find Full Text PDFESC Heart Fail
December 2021
Background: Alterations in electrocardiographic (ECG) intervals are well-known markers for arrhythmia and sudden cardiac death (SCD) risk. While the genetics of arrhythmia syndromes have been studied, relations between electrocardiographic intervals and rare genetic variation at a population level are poorly understood.
Methods: Using a discovery sample of 29 000 individuals with whole-genome sequencing from Trans-Omics in Precision Medicine and replication in nearly 100 000 with whole-exome sequencing from the UK Biobank and MyCode, we examined associations between low-frequency and rare coding variants with 5 routinely measured electrocardiographic traits (RR, P-wave, PR, and QRS intervals and corrected QT interval).
The electrocardiographic PR interval reflects atrioventricular conduction, and is associated with conduction abnormalities, pacemaker implantation, atrial fibrillation (AF), and cardiovascular mortality. Here we report a multi-ancestry (N = 293,051) genome-wide association meta-analysis for the PR interval, discovering 202 loci of which 141 have not previously been reported. Variants at identified loci increase the percentage of heritability explained, from 33.
View Article and Find Full Text PDFHeart failure (HF) is a leading cause of morbidity and mortality worldwide. A small proportion of HF cases are attributable to monogenic cardiomyopathies and existing genome-wide association studies (GWAS) have yielded only limited insights, leaving the observed heritability of HF largely unexplained. We report results from a GWAS meta-analysis of HF comprising 47,309 cases and 930,014 controls.
View Article and Find Full Text PDFHigh-throughput metabolomics using liquid chromatography and mass spectrometry (LC/MS) provides a useful method to identify biomarkers of disease and explore biological systems. However, the majority of metabolic features detected from untargeted metabolomics experiments have unknown ion signatures, making it critical that data should be thoroughly quality controlled to avoid analyzing false signals. Here, we present a postalignment method relying on intermittent pooled study samples to separate genuine metabolic features from potential measurement artifacts.
View Article and Find Full Text PDFIn the version of this article originally published, there were two errors in the text of the second paragraph of the Methods section. In the sentence "To identify genetic variants that contribute to doctor-diagnosed asthma and allergic diseases (detailed phenotype information described in the Supplementary Note) and link them with other conditions, we performed GWASs using phenotype measures in UK Biobank participants (N = 487,409)" the number of participants should have been 150,509. In the sentence "Thus, a total of 110,361 European descendants with high-quality genotyping and complete phenotype/covariate data were used for these analyses, including 25,685 allergic diseases subjects (hay fever/allergic rhinitis or eczema, without doctor-diagnosed asthma), 14,085 asthma subjects and 76,768 controls for the analysis" the phrase "without doctor-diagnosed asthma" should have read "some with doctor-diagnosed asthma.
View Article and Find Full Text PDFAtrial fibrillation (AF) affects more than 33 million individuals worldwide and has a complex heritability. We conducted the largest meta-analysis of genome-wide association studies (GWAS) for AF to date, consisting of more than half a million individuals, including 65,446 with AF. In total, we identified 97 loci significantly associated with AF, including 67 that were novel in a combined-ancestry analysis, and 3 that were novel in a European-specific analysis.
View Article and Find Full Text PDFClinical and epidemiological data suggest that asthma and allergic diseases are associated and may share a common genetic etiology. We analyzed genome-wide SNP data for asthma and allergic diseases in 33,593 cases and 76,768 controls of European ancestry from UK Biobank. Two publicly available independent genome-wide association studies were used for replication.
View Article and Find Full Text PDFMarine Group I (MGI) Thaumarchaeota are one of the most abundant and cosmopolitan chemoautotrophs within the global dark ocean. To date, no representatives of this archaeal group retrieved from the dark ocean have been successfully cultured. We used single cell genomics to investigate the genomic and metabolic diversity of thaumarchaea within the mesopelagic of the subtropical North Pacific and South Atlantic Ocean.
View Article and Find Full Text PDF