Publications by authors named "Doruk Beyter"

Obesity is associated with adverse effects on health and quality of life. Improved understanding of its underlying pathophysiology is essential for developing counteractive measures. To search for sequence variants with large effects on BMI, we perform a multi-ancestry meta-analysis of 13 genome-wide association studies on BMI, including data derived from 1,534,555 individuals of European ancestry, 339,657 of Asian ancestry, and 130,968 of African ancestry.

View Article and Find Full Text PDF
Article Synopsis
  • Researchers analyzed genetic data from nearly 130,000 cancer patients and over 730,000 healthy controls to identify variants linked to cancer risk across 22 cancer types.
  • Four high-risk genes were found: BIK (prostate cancer), ATG12 (colorectal cancer), TG (thyroid cancer), and CMTR2 (lung cancer and melanoma).
  • Additionally, two genes, AURKB (general cancer risk) and PPP1R15A (breast cancer), were associated with decreased cancer risk, indicating potential pathways for cancer prevention strategies.
View Article and Find Full Text PDF

Gene promoter and enhancer sequences are bound by transcription factors and are depleted of methylated CpG sites (cytosines preceding guanines in DNA). The absence of methylated CpGs in these sequences typically correlates with increased gene expression, indicating a regulatory role for methylation. We used nanopore sequencing to determine haplotype-specific methylation rates of 15.

View Article and Find Full Text PDF

Background: Long-read sequencing can enable the detection of base modifications, such as CpG methylation, in single molecules of DNA. The most commonly used methods for long-read sequencing are nanopore developed by Oxford Nanopore Technologies (ONT) and single molecule real-time (SMRT) sequencing developed by Pacific Bioscience (PacBio). In this study, we systematically compare the performance of CpG methylation detection from long-read sequencing.

View Article and Find Full Text PDF

Importance: Recurrent pericarditis is a treatment challenge and often a debilitating condition. Drugs inhibiting interleukin 1 cytokines are a promising new treatment option, but their use is based on scarce biological evidence and clinical trials of modest sizes, and the contributions of innate and adaptive immune processes to the pathophysiology are incompletely understood.

Objective: To use human genomics, transcriptomics, and proteomics to shed light on the pathogenesis of pericarditis.

View Article and Find Full Text PDF
Article Synopsis
  • Migraine is a complicated neurovascular condition with varying symptoms, traditionally studied as a single type in genome-wide association studies (GWAS), but this research focuses on two main subtypes: migraine with aura (MA) and migraine without aura (MO).
  • The study analyzed large datasets from six European populations, identifying four new gene variants associated with MA and classifying 13 variants for MO, highlighting a significant frameshift variant in PRRT2 linked to MA and epilepsy.
  • Additionally, testing on rare variants showed that loss-of-function mutations in SCN11A provide strong protection against migraine, while another variant affecting KCNK5 offers large protection against both migraine and brain aneurysms, suggesting new avenues for treatment.
View Article and Find Full Text PDF

Urticaria is a skin disorder characterized by outbreaks of raised pruritic wheals. In order to identify sequence variants associated with urticaria, we performed a meta-analysis of genome-wide association studies for urticaria with a total of 40,694 cases and 1,230,001 controls from Iceland, the UK, Finland, and Japan. We also performed transcriptome- and proteome-wide analyses in Iceland and the UK.

View Article and Find Full Text PDF

Microsatellites are polymorphic tracts of short tandem repeats with one to six base-pair (bp) motifs and are some of the most polymorphic variants in the genome. Using 6084 Icelandic parent-offspring trios we estimate 63.7 (95% CI: 61.

View Article and Find Full Text PDF

Genotypes causing pregnancy loss and perinatal mortality are depleted among living individuals and are therefore difficult to find. To explore genetic causes of recessive lethality, we searched for sequence variants with deficit of homozygosity among 1.52 million individuals from six European populations.

View Article and Find Full Text PDF
Article Synopsis
  • Genome-wide association studies have found thousands of genetic variations affecting blood traits, but the impact of structural variants on these traits was previously unclear.
  • A study using whole genome sequencing from a diverse group of nearly 50,700 participants identified 21 significant structural variants linked to red and white blood cell traits, with most findings confirmed in other datasets.
  • Experimental evidence showed that a specific deletion linked to lower monocyte counts disrupts an enhancer for the S1PR3 gene, leading to reduced S1PR3 expression.
View Article and Find Full Text PDF

Detailed knowledge of how diversity in the sequence of the human genome affects phenotypic diversity depends on a comprehensive and reliable characterization of both sequences and phenotypic variation. Over the past decade, insights into this relationship have been obtained from whole-exome sequencing or whole-genome sequencing of large cohorts with rich phenotypic data. Here we describe the analysis of whole-genome sequencing of 150,119 individuals from the UK Biobank.

View Article and Find Full Text PDF

Long-read sequencing (LRS) promises to improve the characterization of structural variants (SVs). We generated LRS data from 3,622 Icelanders and identified a median of 22,636 SVs per individual (a median of 13,353 insertions and 9,474 deletions). We discovered a set of 133,886 reliably genotyped SV alleles and imputed them into 166,281 individuals to explore their effects on diseases and other traits.

View Article and Find Full Text PDF

Thousands of genomic structural variants (SVs) segregate in the human population and can impact phenotypic traits and diseases. Their identification in whole-genome sequence data of large cohorts is a major computational challenge. Most current approaches identify SVs in single genomes and afterwards merge the identified variants into a joint call set across many genomes.

View Article and Find Full Text PDF

A major challenge to long read sequencing data is their high error rate of up to 15%. We present Ratatosk, a method to correct long reads with short read data. We demonstrate on 5 human genome trios that Ratatosk reduces the error rate of long reads 6-fold on average with a median error rate as low as 0.

View Article and Find Full Text PDF

Despite the important role that monozygotic twins have played in genetics research, little is known about their genomic differences. Here we show that monozygotic twins differ on average by 5.2 early developmental mutations and that approximately 15% of monozygotic twins have a substantial number of these early developmental mutations specific to one of them.

View Article and Find Full Text PDF

Analysis of sequence diversity in the human genome is fundamental for genetic studies. Structural variants (SVs) are frequently omitted in sequence analysis studies, although each has a relatively large impact on the genome. Here, we present GraphTyper2, which uses pangenome graphs to genotype SVs and small variants using short-reads.

View Article and Find Full Text PDF

Shotgun metaproteomics has the potential to reveal the functional landscape of microbial communities but lacks appropriate methods for complex samples with unknown compositions. In the absence of prior taxonomic information, tandem mass spectra would be searched against large pan-microbial databases, which requires heavy computational workload and reduces sensitivity. We present ProteoStorm, an efficient database search framework for large-scale metaproteomics studies, which identifies high-confidence peptide-spectrum matches (PSMs) while achieving a two-to-three orders-of-magnitude speedup over popular tools.

View Article and Find Full Text PDF

We describe Strelka2 ( https://github.com/Illumina/strelka ), an open-source small-variant-calling method for research and clinical germline and somatic sequencing applications. Strelka2 introduces a novel mixture-model-based estimation of insertion/deletion error parameters from each sample, an efficient tiered haplotype-modeling strategy, and a normal sample contamination model to improve liquid tumor analysis.

View Article and Find Full Text PDF
Article Synopsis
  • Human cells typically have 23 pairs of chromosomes, but in cancer, genes can be found in both chromosomes and circular extrachromosomal DNA (ecDNA).
  • A study of 17 cancer types revealed that ecDNA appears in nearly half of all human cancers, often amplifying driver oncogenes, which leads to higher levels of these genes.
  • Mathematical models and quantitative analyses support the idea that ecDNA plays a significant role in cancer evolution by increasing genetic diversity and oncogene copy numbers more effectively than traditional chromosomal changes.
View Article and Find Full Text PDF
Article Synopsis
  • Managing ecosystems to support biodiversity can enhance stability and productivity for bioenergy production; however, its effectiveness in industrial ecosystems remains untested.
  • A year-long study utilized tag-sequencing to analyze the diversity of bacterial and eukaryotic communities in a bioenergy-producing pond, revealing that high eukaryotic diversity correlated with stable biomass productivity.
  • The study found that bacterial diversity and eukaryotic diversity were inversely related, suggesting that varying responses to temperature may influence their respective dynamics, highlighting the importance of maintaining diverse microbial communities for sustainable bioenergy ecosystems.
View Article and Find Full Text PDF